Ah interesting. Is your keyword-document map (aka term dict) too big to keep in ...

marginalia_nu · 2025-07-28T22:11:48 1753740708

Could plausibly fit in RAM, is only like ~100 GB in total. We'll see, will probably keep it mmap:ed at first to see what happens. It isn't the target of very many queries (relatively speaking) at any rate so either way is probably fine.

n_u · 2025-07-28T23:24:15 1753745055

>It isn't the target of very many queries (relatively speaking)

Wow why is that? Do you use a vector index primarily?

marginalia_nu · 2025-07-28T23:56:13 1753746973

No I mean for every query there is mapping up keywords to trees of documents, there is dozens if not hundreds of queries in the latter in order to intersect document lists.

n_u · 2025-07-29T00:33:44 1753749224

Ah I see. I thought by query you meant "user search query". I'm guessing by query here you mean "read".