[ 
https://issues.apache.org/jira/browse/SOLR-10156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-10156:
----------------------------------
    Description: 
The significantTerms Streaming Expression will emit a set of terms from a *text 
field* within a doc frequency range for a specific query. It will also score 
the terms based on how many times the terms appear in the result set, and how 
many times the terms appear in the corpus, and return the top N terms based on 
this significance score.

Syntax:

{code}
significantTerms(collection, 
                      q="any query", 
                           field="some_text_field", 
                           minDocFreq="5",   //optional default is 5 documents
                           maxDocFreq=".3", // optional default is no more then 
30% of the index (.3)
                           minTermlength="4",  // optional default is 4
                           limit="50")                // optional default is 20
{code}




  was:
The significantTerms Streaming Expression will emit a set of terms from a *text 
field* within a doc frequency range for a specific query. It will also score 
the terms based on how many times the terms appear in the result set, and how 
many times the terms appear in the corpus, and return the top N terms based on 
this significance score.

Syntax:

{code}
significantTerms(collection, 
                           q="any query", 
                           field="some_text_field", 
                           minDocFreq="5",   //optional default is 5 documents
                           maxDocFreq=".3", // optional default is no more then 
30% of the index (.3)
                           minTermlength="4",  // optional default is 4
                           limit="50")                // optional default is 20
{code}





> Add significantTerms Streaming Expression
> -----------------------------------------
>
>                 Key: SOLR-10156
>                 URL: https://issues.apache.org/jira/browse/SOLR-10156
>             Project: Solr
>          Issue Type: New Feature
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Joel Bernstein
>            Assignee: Joel Bernstein
>             Fix For: 6.5
>
>         Attachments: SOLR-10156.patch, SOLR-10156.patch, SOLR-10156.patch
>
>
> The significantTerms Streaming Expression will emit a set of terms from a 
> *text field* within a doc frequency range for a specific query. It will also 
> score the terms based on how many times the terms appear in the result set, 
> and how many times the terms appear in the corpus, and return the top N terms 
> based on this significance score.
> Syntax:
> {code}
> significantTerms(collection, 
>                       q="any query", 
>                            field="some_text_field", 
>                            minDocFreq="5",   //optional default is 5 documents
>                            maxDocFreq=".3", // optional default is no more 
> then 30% of the index (.3)
>                            minTermlength="4",  // optional default is 4
>                            limit="50")                // optional default is 
> 20
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to