[
https://issues.apache.org/jira/browse/TIKA-169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837750#action_12837750
]
Ingo Renner commented on TIKA-169:
----------------------------------
I see a servlet making quite some sense - think of Solr, but only having the
extraction request handler... That way you could have a central meta data /
text extracting server without needing to install java + tika on all the hosts
where you might need it in a replicated CMS environment f.e.
So the scenario would be that a CMS trys to extract text, meta data from a
file, but does not have a local tika at hand. It would then send the file to a
Tika server and get the results back in XML or JSON like Solr does.
> Tika Web Service Servlet
> ------------------------
>
> Key: TIKA-169
> URL: https://issues.apache.org/jira/browse/TIKA-169
> Project: Tika
> Issue Type: New Feature
> Components: general
> Affects Versions: 0.2
> Reporter: Rida Benjelloun
> Priority: Minor
> Attachments: tikaServlet.war
>
>
> Tika servlet, use file or directory path to build a list of XML documents.
> The next version will allow file upload.
> Usage :
> //Extract document content and metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10
> //Extract metadata
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=metadata
> //Extract document content
> http://localhost:8080/tikaServlet/?filePath=C:\test&start=0&rows=10&extract=content
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.