[ https://issues.apache.org/jira/browse/SOLR-10156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joel Bernstein updated SOLR-10156: ---------------------------------- Description: The significantTerms Streaming Expression will emit a set of terms from a *text field* within a doc frequency range for a specific query. It will also score the terms based on how many times the terms appear in the result set, and how many times the terms appear in the corpus, and return the top N terms based on this significance score. Syntax: {code} significantTerms(collection, q="any query", field="some_text_field", minDocFreq="5", //optional default is 5 documents maxDocFreq=".3", // optional default is no more then 30% of the index (.3) minTermlength="4", // optional default is 4 limit="50") // optional default is 20 {code} was: The significantTerms Streaming Expression will emit a set of terms from a *text field* within a doc frequency range for a specific query. It will also score the terms based on how many times the terms appear in the result set, and how many times the terms appear in the corpus, and return the top N terms based on this significance score. Syntax: {code} significantTerms(collection, q="any query", field="some_text_field", minDocFreq="5", //optional default is 5 documents maxDocFreq=".3", // optional default is no more then 30% of the index (.3) minTermlength="4", // optional default is 4 limit="50") // optional default is 20 {code} > Add significantTerms Streaming Expression > ----------------------------------------- > > Key: SOLR-10156 > URL: https://issues.apache.org/jira/browse/SOLR-10156 > Project: Solr > Issue Type: New Feature > Security Level: Public(Default Security Level. Issues are Public) > Reporter: Joel Bernstein > Assignee: Joel Bernstein > Fix For: 6.5 > > Attachments: SOLR-10156.patch, SOLR-10156.patch, SOLR-10156.patch > > > The significantTerms Streaming Expression will emit a set of terms from a > *text field* within a doc frequency range for a specific query. It will also > score the terms based on how many times the terms appear in the result set, > and how many times the terms appear in the corpus, and return the top N terms > based on this significance score. > Syntax: > {code} > significantTerms(collection, > q="any query", > field="some_text_field", > minDocFreq="5", //optional default is 5 documents > maxDocFreq=".3", // optional default is no more > then 30% of the index (.3) > minTermlength="4", // optional default is 4 > limit="50") // optional default is > 20 > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org