[
https://issues.apache.org/jira/browse/SOLR-1045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12747070#action_12747070
]
Lance Norskog commented on SOLR-1045:
-------------------------------------
Map/Reduce would also be useful in the DataImportHandler. We're talking about
parallelizing analysis stacks that require a lot of CPU. I would rather push
this sort of thing out into the DIH - Solr Cell, for example. The DIH
declaration language could have something like the ANT parallelization
directives.
At this level of multi-threaded sophistication, Solr really wants to be an OSGi
application instead of a custom-built mini application server.
> Build Solr index using Hadoop MapReduce
> ---------------------------------------
>
> Key: SOLR-1045
> URL: https://issues.apache.org/jira/browse/SOLR-1045
> Project: Solr
> Issue Type: New Feature
> Reporter: Ning Li
> Attachments: SOLR-1045.0.patch
>
>
> The goal is a contrib module that builds Solr index using Hadoop MapReduce.
> It is different from the Solr support in Nutch. The Solr support in Nutch
> sends a document to a Solr server in a reduce task. Here, the goal is to
> build/update Solr index within map/reduce tasks. Also, it achieves better
> parallelism when the number of map tasks is greater than the number of reduce
> tasks, which is usually the case.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.