[ 
https://issues.apache.org/jira/browse/TIKA-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343566#comment-14343566
 ] 

Nick Burch commented on TIKA-944:
---------------------------------

With the server as-is, you can get the metadata as text, json and xmp. You can 
get the text of the document as text or xhtml. You can get a list of parsers, 
detectors, mime types etc

Not sure about language detection though? But I believe that we're a lot closer 
to feature parity between server and cli now than we were in 2012!

> Extend tika-server API to be consistent with tika-app CLI
> ---------------------------------------------------------
>
>                 Key: TIKA-944
>                 URL: https://issues.apache.org/jira/browse/TIKA-944
>             Project: Tika
>          Issue Type: New Feature
>          Components: server
>    Affects Versions: 1.1
>         Environment: Any
>            Reporter: Jason Judge
>            Assignee: Chris A. Mattmann
>              Labels: exposed-functionality, tika-server
>
> The tika-server API (web service) provides a limited set of functionality 
> compared to the tika-app command-line version. Notable things missing are:
> 1. Language recognition.
> 2. Output in various formats (JSON for metadata, XHTML for the extracted 
> text).
> Those are the two main things that would be useful to me, but ideally the 
> server should be able to provide all the functionality that the command-line 
> app does, taking the command-line as the model to follow.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to