RE: Lucene FieldCache memory requirements

Fuad Efendi Mon, 02 Nov 2009 17:28:16 -0800

Ok, my "naive" thinking about FieldCache: for each Term we can quickly
retrieve DocSet. What are memory requirements? Theoretically,
[maxdoc]x[4-bytes DocumentID], plus some (small) array to store terms
pointing to (large) arrays of DocumentIDs.


Mike suggested http://issues.apache.org/jira/browse/LUCENE-1990 to make this
memory requirement even lower... but please correct me if I am wrong with
formula, and I am unsure how it is currently implemented...


Thanks,
Fuad


> -----Original Message-----
> From: Fuad Efendi [mailto:f...@efendi.ca]
> Sent: November-02-09 8:21 PM
> To: solr-user@lucene.apache.org
> Subject: RE: Lucene FieldCache memory requirements
> 
> Mark,
> 
> I don't understand this:
> > so with a ton of docs and a few uniques, you get a temp boost in the RAM
> > reqs until it sizes it down.
> 
> Sizes down??? Why is it called Cache indeed? And how SOLR uses it if it is
> not cache?
> 
> 
> And this:
> > A pointer for each doc.
> 
> Why can't we use (int) DocumentID? For me, it is natural; 64-bit pointer
to
> an object in RAM is not natural (in Lucene world)...
> 
> 
> So, is it [maxdoc]x[4-bytes], or [maxdoc]x[8-bytes]?...
> -Fuad
> 
> 
> 
>

RE: Lucene FieldCache memory requirements

Reply via email to