[ https://issues.apache.org/jira/browse/SOLR-2462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13043992#comment-13043992 ]
Robert Muir commented on SOLR-2462: ----------------------------------- {quote} Maybe we can just put a hard limit on the number of possibilities it will evaluate? It could be really high like a million or something. We could make it a configurable parameter, something like "spellcheck.maxCollationPossibilitiesToEval" , but then again that seems silly. Who would really change it if a million was the default ? {quote} Well, I think this sounds much better than being time-based? And you know, use your best judgement as a default, definitely I'm ok with it as long as its configurable and has good defaults. > Using spellcheck.collate can result in extremely high memory usage > ------------------------------------------------------------------ > > Key: SOLR-2462 > URL: https://issues.apache.org/jira/browse/SOLR-2462 > Project: Solr > Issue Type: Bug > Components: spellchecker > Affects Versions: 3.1 > Reporter: James Dyer > Priority: Critical > Fix For: 3.1.1, 4.0 > > Attachments: SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462_3_1.patch > > > When using "spellcheck.collate", class SpellPossibilityIterator creates a > ranked list of *every* possible correction combination. But if returning > several corrections per term, and if several words are misspelled, the > existing algorithm uses a huge amount of memory. > This bug was introduced with SOLR-2010. However, it is triggered anytime > "spellcheck.collate" is used. It is not necessary to use any features that > were added with SOLR-2010. > We were in Production with Solr for 1 1/2 days and this bug started taking > our Solr servers down with "infinite" GC loops. It was pretty easy for this > to happen as occasionally a user will accidently paste the URL into the > Search box on our app. This URL results in a search with ~12 misspelled > words. We have "spellcheck.count" set to 15. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org