[ https://issues.apache.org/jira/browse/SOLR-2462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Dyer updated SOLR-2462: ----------------------------- Attachment: SOLR-2462.patch This is along the lines of what I initially intended on doing but didn't have time back when I first submitted this. I felt particularly guilty in gathering all these RankedSpellPossibility objects in cases where the user isn't even using new functionality from SOLR-2010 (upgrade from 1.4 then collate becomes more expensive!). Thank you for another opportunity to absolve my guilt. I ran these tests and they all pass: SpellPossibilityIteratorTest, SpellCheckCollatorTest, SpellCheckComponentTest & DistributedSpellCheckComponentTest > Using spellcheck.collate can result in extremely high memory usage > ------------------------------------------------------------------ > > Key: SOLR-2462 > URL: https://issues.apache.org/jira/browse/SOLR-2462 > Project: Solr > Issue Type: Bug > Components: spellchecker > Affects Versions: 3.1 > Reporter: James Dyer > Priority: Critical > Fix For: 3.1.1, 4.0 > > Attachments: SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462.patch, SOLR-2462_3_1.patch > > > When using "spellcheck.collate", class SpellPossibilityIterator creates a > ranked list of *every* possible correction combination. But if returning > several corrections per term, and if several words are misspelled, the > existing algorithm uses a huge amount of memory. > This bug was introduced with SOLR-2010. However, it is triggered anytime > "spellcheck.collate" is used. It is not necessary to use any features that > were added with SOLR-2010. > We were in Production with Solr for 1 1/2 days and this bug started taking > our Solr servers down with "infinite" GC loops. It was pretty easy for this > to happen as occasionally a user will accidently paste the URL into the > Search box on our app. This URL results in a search with ~12 misspelled > words. We have "spellcheck.count" set to 15. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org