large set of memory consumed by array init

2016-12-15 Thread Vincent Sevel
Hello, I have seen an unexpected behavior when setting a limit too high in a search. I index log files in my system. Each week I create a new index. At the end of the week the index is around 35 Gb. when I do a search with no date, I would create a MultiReader built out of the readers from the w

Re: Altering Term Frequency in Similarity

2016-12-15 Thread Robert Muir
Maybe have a look at SynonymQuery: https://github.com/apache/lucene-solr/blob/master/lucene/core/src/java/org/apache/lucene/search/SynonymQuery.java I think it does a similar thing to what you want, it sums up the frequencies of the synonyms and passes that sum to the similarity class as TF. On W