[
https://issues.apache.org/jira/browse/SOLR-10716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joel Bernstein updated SOLR-10716:
----------------------------------
Description:
The termVectors Stream Evaluator returns tf-idf word vectors for a text field
in a list of tuples.
Syntax:
{code}
let(a=select(search(...), analyze(a, body) as terms),
b=termVectors(a, minDocFreq=".00", maxDocFreq="1.0"))
{code}
The code above performs a search then uses the *select* stream and *analyze*
evaluator to attach a list of terms to each document.
was:
The termVectors Stream Evaluator returns tf-idf word vectors for a text field
in a list of tuples. A Lucene analyzer can be specified to support flexible
word analysis.
The word vectors can then be used for various machine learning operations.
Syntax:
{code}
r = search(....)
v = termVectors(r, fieldA, analyzerFied)
{code}
> Add termVectors Stream Evaluator
> --------------------------------
>
> Key: SOLR-10716
> URL: https://issues.apache.org/jira/browse/SOLR-10716
> Project: Solr
> Issue Type: New Feature
> Security Level: Public(Default Security Level. Issues are Public)
> Components: streaming expressions
> Reporter: Joel Bernstein
> Assignee: Joel Bernstein
> Fix For: 7.0
>
> Attachments: SOLR-10716.patch, SOLR-10716.patch
>
>
> The termVectors Stream Evaluator returns tf-idf word vectors for a text field
> in a list of tuples.
> Syntax:
> {code}
> let(a=select(search(...), analyze(a, body) as terms),
> b=termVectors(a, minDocFreq=".00", maxDocFreq="1.0"))
> {code}
> The code above performs a search then uses the *select* stream and *analyze*
> evaluator to attach a list of terms to each document.
>
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]