[
https://issues.apache.org/jira/browse/UIMA-3096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13715251#comment-13715251
]
Erik Faessler commented on UIMA-3096:
-------------------------------------
I also plan to do a similar thing for ElasticSearch (but cannot guarantee it
yet). Would we like to have such extensions to reside within LuCas in general?
We would certainly not place the ElasticSearch related part in SolrCAS (this is
why I bring this up). A new project would be too heavy in my opinion. So in the
end, bundling the extensions within LuCas may be the best idea. Right?
> A LuCas extension that allows ro index the Lucene documents created by LuCas
> into a Solr server.
> ------------------------------------------------------------------------------------------------
>
> Key: UIMA-3096
> URL: https://issues.apache.org/jira/browse/UIMA-3096
> Project: UIMA
> Issue Type: Improvement
> Components: Sandbox-Lucas
> Affects Versions: 2.4.0Addons
> Reporter: Erik Faessler
> Priority: Minor
> Attachments: lucasToSolr.patch
>
> Original Estimate: 4h
> Remaining Estimate: 4h
>
> Add a UIMA component extending LuceneDocumentAE that converts the Lucene
> document instances created by LuCas into Solr's PreAnalyzed field format
> (http://wiki.apache.org/solr/PreAnalyzedField). The converted documents are
> then sent in batches to Solr using the SolrJ API.
> On the Solr side, PreAnalyzedUpdateProcessorFactory
> (http://lucene.apache.org/solr/4_3_1/solr-core/org/apache/solr/update/processor/FieldMutatingUpdateProcessorFactory.html)
> can be used to restrict the pre-analyzed field values to an existing Solr
> schema. In case that the LuCas mapping file matches the Solr schema, it is
> easy as this:
> In solrconfig.xml, add this updateRequestProcessorChain:
> <updateRequestProcessorChain name="pre-analyzed-json">
> <processor class="solr.PreAnalyzedUpdateProcessorFactory">
> <str name="fieldRegex">.*</str>
> <str name="parser">json</str>
> </processor>
> <processor class="solr.RunUpdateProcessorFactory" />
> </updateRequestProcessorChain>
> Then, add this chain to the default update handler:
> <requestHandler name="/update" class="solr.UpdateRequestHandler">
> <lst name="defaults">
> <str name="update.chain">pre-analyzed-json</str>
> </lst>
> </requestHandler>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira