[
https://issues.apache.org/jira/browse/SOLR-10156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joel Bernstein updated SOLR-10156:
----------------------------------
Description:
The significantTerms Streaming Expression will emit a set of terms from a *text
field* within a doc frequency range for a specific query. It will also score
the terms based on how many times the terms appear in the result set, and how
many times the terms appear in the corpus, and return the top N terms based on
this significance score.
Syntax:
{code}
significantTerms(collection,
q="any query",
field="some_text_field",
minDocFreq="5", //optional default is 5 documents
maxDocFreq=".3", // optional default is no more then 30% of
the index (.3)
minTermLength="4", // optional default is 4
limit="50") // optional default is 20
{code}
was:
The significantTerms Streaming Expression will emit a set of terms from a *text
field* within a doc frequency range for a specific query. It will also score
the terms based on how many times the terms appear in the result set, and how
many times the terms appear in the corpus, and return the top N terms based on
this significance score.
Syntax:
{code}
significantTerms(collection,
q="any query",
field="some_text_field",
minDocFreq="5", //optional default is 5 documents
maxDocFreq=".3", // optional default is no more then 30% of
the index (.3)
minTermlength="4", // optional default is 4
limit="50") // optional default is 20
{code}
> Add significantTerms Streaming Expression
> -----------------------------------------
>
> Key: SOLR-10156
> URL: https://issues.apache.org/jira/browse/SOLR-10156
> Project: Solr
> Issue Type: New Feature
> Security Level: Public(Default Security Level. Issues are Public)
> Reporter: Joel Bernstein
> Assignee: Joel Bernstein
> Fix For: 6.5
>
> Attachments: SOLR-10156.patch, SOLR-10156.patch, SOLR-10156.patch
>
>
> The significantTerms Streaming Expression will emit a set of terms from a
> *text field* within a doc frequency range for a specific query. It will also
> score the terms based on how many times the terms appear in the result set,
> and how many times the terms appear in the corpus, and return the top N terms
> based on this significance score.
> Syntax:
> {code}
> significantTerms(collection,
> q="any query",
> field="some_text_field",
> minDocFreq="5", //optional default is 5 documents
> maxDocFreq=".3", // optional default is no more then 30% of
> the index (.3)
> minTermLength="4", // optional default is 4
> limit="50") // optional default is 20
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]