Joel Bernstein created SOLR-9240:
------------------------------------

             Summary: Add the partitionKeys parameter to the topic() Streaming 
Expression
                 Key: SOLR-9240
                 URL: https://issues.apache.org/jira/browse/SOLR-9240
             Project: Solr
          Issue Type: Improvement
            Reporter: Joel Bernstein


Currently the topic() function doesn't accept a partitionKeys parameter like 
the search() function does. This means the topic() function can't be wrapped by 
the parallel() function to run across worker nodes.

It would be useful to support parallelizing the topic function because it would 
provide a general purpose parallelized approach for processing batches of data 
as they enter the index.

For example this would allow a classify() function to be wrapped around a 
topic() function to classify documents in parallel across worker nodes. 

Sample syntax:

{code}
parallel(daemon(update(classify(topic(..., partitionKeys="id")))))
{code}

The example above would send a daemon out to worker nodes that would classify 
all new documents returned by the topic() function. The update function would 
send the output of classify() to a SolrCloud collection for indexing.








--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to