[ 
https://issues.apache.org/jira/browse/TIKA-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16849116#comment-16849116
 ] 

Tim Allison commented on TIKA-2882:
-----------------------------------

The vision (and Bob’s excellent example) for Tika 2.0 is to offer more 
modularity. We had broken the parsers into families of file formats, but we 
might also (or maybe we did?) consider breaking them into modules based on 
requirements— native code, remote resources, etc.  Perhaps this issue could be 
the motivator to get moving in that direction...  

> Parsers should not include HTTP client code
> -------------------------------------------
>
>                 Key: TIKA-2882
>                 URL: https://issues.apache.org/jira/browse/TIKA-2882
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.21
>            Reporter: Jonathan Essex
>            Priority: Major
>
> Folks, does it really make sense for a parser to have a REST client built in?
> The GROBID and NLTKNERecogniser parsers use the apache CXF client directly. 
>  
> Since I don't use CXF and my entire app is built on a different JAX-RS stack 
> this just dropped me straight into dependency hell.
> Surely it would make more sense to keep the parsers... well, parsers... and 
> build support for delegating parsing to other services into some higher level 
> in the stack (such as the server, where the CXF dependency is more benign). 
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to