On Fri, Jun 13, 2014 at 3:02 AM, Clemens Wyss DEV <clemens...@mysign.ch> wrote: >> limit how many fields have norms enabled > We have one index for approx 7000 pdfs (24GB). Of course no content is STOREd > (but ANALYZEd). This very index occupies 4GB on disk and the corresponding > IndexReader is 60MB. > Are norms per default enabled org.apache.lucene.document .TextField?
Yes. Norms are a good idea for "large text fields", e.g. body text or a catch all field, but usually not a good idea for tiny fields (e.g. title). >> use disk-based doc values not field cache > How is this done? Add XXXDocValuesField instead of e.g. StringField. >> etc. > such as? ;) Upgrade to the upcoming Lucene 4.9; there have been some improvements e.g. to norms compression. You can tune your terms index settings, but terms index usually doesn't use much RAM. You can fire up your up, get all searchers warmed, and take a heap dump and see what's using RAM. We can iterate from there. Mike McCandless http://blog.mikemccandless.com --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org