[ https://issues.apache.org/jira/browse/SOLR-2462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13044125#comment-13044125 ]
Robert Muir commented on SOLR-2462: ----------------------------------- Hi James, when applying the latest patch, I noticed a test fail: {noformat} [junit] Testsuite: org.apache.solr.client.solrj.response.TestSpellCheckResponse [junit] Testcase: testSpellCheckCollationResponse(org.apache.solr.client.solrj.response.TestSpellCheckResponse): FAILED [junit] [junit] junit.framework.AssertionFailedError: [junit] at org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1348) [junit] at org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:1266) [junit] at org.apache.solr.client.solrj.response.TestSpellCheckResponse.testSpellCheckCollationResponse(TestSpellCheckResponse.java:153) {noformat} This seemed odd... maybe a comparator is off somewhere? > Using spellcheck.collate can result in extremely high memory usage > ------------------------------------------------------------------ > > Key: SOLR-2462 > URL: https://issues.apache.org/jira/browse/SOLR-2462 > Project: Solr > Issue Type: Bug > Components: spellchecker > Affects Versions: 3.1 > Reporter: James Dyer > Assignee: Robert Muir > Priority: Critical > Fix For: 3.1.1, 4.0 > > Attachments: SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, SOLR-2462.patch, > SOLR-2462.patch, SOLR-2462_3_1.patch > > > When using "spellcheck.collate", class SpellPossibilityIterator creates a > ranked list of *every* possible correction combination. But if returning > several corrections per term, and if several words are misspelled, the > existing algorithm uses a huge amount of memory. > This bug was introduced with SOLR-2010. However, it is triggered anytime > "spellcheck.collate" is used. It is not necessary to use any features that > were added with SOLR-2010. > We were in Production with Solr for 1 1/2 days and this bug started taking > our Solr servers down with "infinite" GC loops. It was pretty easy for this > to happen as occasionally a user will accidently paste the URL into the > Search box on our app. This URL results in a search with ~12 misspelled > words. We have "spellcheck.count" set to 15. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org