[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

Lewis John McGibbney (JIRA) Tue, 18 Nov 2014 20:42:08 -0800

    [ 
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217409#comment-14217409
 ]


Lewis John McGibbney commented on TIKA-1445:
--------------------------------------------

One final thing to mention is that *every* Extractor within Any23 is 
accompanied by an ExtractorFactory e.g.
https://github.com/apache/any23/blob/master/core/src/main/java/org/apache/any23/extractor/csv/CSVExtractor.java
https://github.com/apache/any23/blob/master/core/src/main/java/org/apache/any23/extractor/csv/CSVExtractorFactory.java
This is where [~p_ansell] performed some magic within his work in refactoring 
some code in Any23.

> Figure out how to add Image metadata extraction to Tesseract parser
> -------------------------------------------------------------------
>
>                 Key: TIKA-1445
>                 URL: https://issues.apache.org/jira/browse/TIKA-1445
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.8
>
>         Attachments: TIKA-1445.Mattmann.101214.patch.txt, 
> TIKA-1445.Palsulich.102614.patch, TIKA-1445_tallison_20141027.patch.txt, 
> TIKA-1445_tallison_v2_20141027.patch, TIKA-1445_tallison_v3_20141027.patch
>
>
> Now that Tesseract is the default image parser in Tika for many image types, 
> consider how to add back in the metadata extraction capabilities by the other 
> Image parsers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

Reply via email to