[ https://issues.apache.org/jira/browse/SOLR-2462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Dyer updated SOLR-2462: ----------------------------- Attachment: SOLR-2462.patch I guess I should have run that one myself too. This test is very similar to the ones in SpellCheckCollatorTest. I guess while the ones in SCCT test whether or not it can collate properly, TSCR checks that the response it sends back is proper. In any case, this is just another one of my brittle tests! Because we're using a different comparator, results with tied scores don't come back exactly the same as before. So now this test needs more than 5 tries to find the 2nd valid collation. I up'ed it from 5 to 10 and now it passes. > Using spellcheck.collate can result in extremely high memory usage > ------------------------------------------------------------------ > > Key: SOLR-2462 > URL: https://issues.apache.org/jira/browse/SOLR-2462 > Project: Solr > Issue Type: Bug > Components: spellchecker > Affects Versions: 3.1 > Reporter: James Dyer > Assignee: Robert Muir > Priority: Critical > Fix For: 3.1.1, 4.0 > > Attachments: SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462.patch, SOLR-2462.patch, SOLR-2462_3_1.patch > > > When using "spellcheck.collate", class SpellPossibilityIterator creates a > ranked list of *every* possible correction combination. But if returning > several corrections per term, and if several words are misspelled, the > existing algorithm uses a huge amount of memory. > This bug was introduced with SOLR-2010. However, it is triggered anytime > "spellcheck.collate" is used. It is not necessary to use any features that > were added with SOLR-2010. > We were in Production with Solr for 1 1/2 days and this bug started taking > our Solr servers down with "infinite" GC loops. It was pretty easy for this > to happen as occasionally a user will accidently paste the URL into the > Search box on our app. This URL results in a search with ~12 misspelled > words. We have "spellcheck.count" set to 15. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org