[
https://issues.apache.org/jira/browse/SOLR-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17705140#comment-17705140
]
Jan Høydahl commented on SOLR-1526:
-----------------------------------
Not convinced anymore that a tight integration with SolrJ and Tika makes sense.
If you're a Java app using SolrJ, then you can just as well integrate with Tika
(tika-pipes?) as a separate dependency and then insert the extracted text into
a SolrInputDocument however you want.
Close this?
> Client Side Tika integration
> ----------------------------
>
> Key: SOLR-1526
> URL: https://issues.apache.org/jira/browse/SOLR-1526
> Project: Solr
> Issue Type: New Feature
> Components: clients - java, SolrJ
> Reporter: Grant Ingersoll
> Priority: Minor
> Fix For: 4.9, 6.0
>
> Attachments: clientextraction.tar.gz
>
>
> Often times it is cost prohibitive to send full, rich documents over the
> wire. The contrib/extraction library has server side integration with Tika,
> but it would be nice to have a client side implementation as well. It should
> support both metadata and content or just metadata.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]