[ 
https://issues.apache.org/jira/browse/TIKA-2293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15943301#comment-15943301
 ] 

Luis Filipe Nassif commented on TIKA-2293:
------------------------------------------

Just to contribute to the discussion, my experience says windows users not 
always have the correct version of Microsoft Redistributable Package installed, 
and 2015 Redist is very huge, several GBs. And the fact Tess4j only includes 
native libs for Windows is a limitation, users will still have to install (or 
compile) tesseract software. The SQLite3Parser includes native libs for 
windows, linux and mac, and even with those it is an optional dependecy.

>  Tess4jOCRParser - A simpler Java version of TesseractOCRParser
> ---------------------------------------------------------------
>
>                 Key: TIKA-2293
>                 URL: https://issues.apache.org/jira/browse/TIKA-2293
>             Project: Tika
>          Issue Type: Improvement
>          Components: ocr
>            Reporter: Thejan Wijesinghe
>             Fix For: 1.15
>
>
> Right now, TesseractOCRParser calls tesseract and imagemagick from command 
> line. Intention of this new parser "Tess4jOCRParser" is to use the Tess4J API 
> instead of the runtime.exec way to executing tesseract out of process.  



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to