[
https://issues.apache.org/jira/browse/SOLR-7632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18017089#comment-18017089
]
Eric Pugh commented on SOLR-7632:
---------------------------------
Is there any way to mark Tika as "deprecated" in Solr 9, so that WHEN we land a
replacement in Solr 10, we can remove it in the 10x line? And NOT have to
support it through Solr 11.
I'm thinking about the fact that Tika does bring a lot of dependencies, and
that the big win of seperating it out is no longer having that dependency
management work. It would be nice to experience fewer dependencies in Solr 10,
and not wait till Solr 11...
> Change the ExtractingRequestHandler to use Tika-Server
> ------------------------------------------------------
>
> Key: SOLR-7632
> URL: https://issues.apache.org/jira/browse/SOLR-7632
> Project: Solr
> Issue Type: Improvement
> Components: contrib - Solr Cell (Tika extraction)
> Reporter: Chris A. Mattmann
> Assignee: Jan Høydahl
> Priority: Major
> Labels: gsoc2017, memex, pull-request-available
> Time Spent: 40m
> Remaining Estimate: 0h
>
> It's a pain to upgrade Tika's jars all the times when we release, and if Tika
> fails it messes up the ExtractingRequestHandler (e.g., the document type
> caused Tika to fail, etc). A more reliable way and also separated, and easier
> to deploy version of the ExtractingRequestHandler would make a network call
> to the Tika JAXRS server, and then call Tika on the Solr server side, get the
> results and then index the information that way. I have a patch in the works
> from the DARPA Memex project and I hope to post it soon.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]