[ https://issues.apache.org/jira/browse/LUCENE-494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Grant Ingersoll reassigned LUCENE-494: -------------------------------------- Assignee: Grant Ingersoll > Analyzer for preventing overload of search service by queries with common > terms in large indexes > ------------------------------------------------------------------------------------------------ > > Key: LUCENE-494 > URL: https://issues.apache.org/jira/browse/LUCENE-494 > Project: Lucene - Java > Issue Type: New Feature > Components: Analysis > Reporter: Mark Harwood > Assignee: Grant Ingersoll > Priority: Minor > Attachments: QueryAutoStopWordAnalyzer.java, > QueryAutoStopWordAnalyzerTest.java > > > An analyzer used primarily at query time to wrap another analyzer and provide > a layer of protection > which prevents very common words from being passed into queries. For very > large indexes the cost > of reading TermDocs for a very common word can be high. This analyzer was > created after experience with > a 38 million doc index which had a term in around 50% of docs and was causing > TermQueries for > this term to take 2 seconds. > Use the various "addStopWords" methods in this class to automate the > identification and addition of > stop words found in an already existing index. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]