[ 
https://issues.apache.org/jira/browse/TIKA-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14319525#comment-14319525
 ] 

Lewis John McGibbney commented on TIKA-1540:
--------------------------------------------

Hi [~aashish24]
 * regarding the use of two httpclient libraries, this needs to be cleaned up
 * I would suggest that we define a [Configuration-type 
object|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRConfig.java]
 for setting all of the external service characteristics such as URL and 
implementation specific parameters as expressed within the service call. An 
example can be seen on the 
[TesseractOCRParser.java|https://github.com/apache/tika/blob/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java#L89].

On another level, I am thinking if this could be generalized?
I mean from looking at the code it is not obvious that it is a ImageParser, 
only you and I know this :) Does this make sense?


> New Tika plugin for image based feature extraction using computer vision 
> techniques
> -----------------------------------------------------------------------------------
>
>                 Key: TIKA-1540
>                 URL: https://issues.apache.org/jira/browse/TIKA-1540
>             Project: Tika
>          Issue Type: New Feature
>         Environment: cross platform
>            Reporter: Aashish Chaudhary
>            Assignee: Lewis John McGibbney
>              Labels: gsoc2015
>             Fix For: 1.8
>
>         Attachments: TIKA-vision.achaudhary.150209.patch.txt
>
>
> This will be a web-service client based parser to perform image feature 
> extraction using Computer Vision techniques. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to