[jira] [Commented] (TIKA-1703) Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path

2015-08-05 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14654999#comment-14654999
 ] 

Hudson commented on TIKA-1703:
--

SUCCESS: Integrated in tika-trunk-jdk1.7 #812 (See 
[https://builds.apache.org/job/tika-trunk-jdk1.7/812/])
Fix for TIKA-1703: Can't Specify Tesseract Data Folder Distinct from Tesseract 
Executable Path Contributed by Christian Wolfe  this closes 
#56. (mattmann: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1694133)
* /tika/trunk/CHANGES.txt
* 
/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRConfig.java
* 
/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java
* 
/tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/ocr/TesseractOCRConfigTest.java
* 
/tika/trunk/tika-parsers/src/test/resources/test-properties/TesseractOCRConfig-full.properties


> Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path
> ---
>
> Key: TIKA-1703
> URL: https://issues.apache.org/jira/browse/TIKA-1703
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.9
>Reporter: Christian Wolfe
>Assignee: Chris A. Mattmann
>Priority: Minor
> Fix For: 1.11
>
>
> If a user specifies the path to the Tesseract executable using 
> {{TesseractOCRConfig.setTesseractPath}}, then Tika will assume that the 
> Tesseract config folder (usually referred to as the 'tessdata' folder) is in 
> the same location. This is usually true in a Windows environment, where 
> everything is installed into a central location.
> However, this is not necessarily the case in a Linux environment. If one were 
> to build Tesseract from source, for example, the config folder will be 
> installed in a different location than the Tesseract executable.
> One way to fix this would be to add a way to specify the location of the 
> Tesseract config folder separate from the path to the executable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-1703) Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path

2015-08-04 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14654652#comment-14654652
 ] 

ASF GitHub Bot commented on TIKA-1703:
--

Github user asfgit closed the pull request at:

https://github.com/apache/tika/pull/56


> Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path
> ---
>
> Key: TIKA-1703
> URL: https://issues.apache.org/jira/browse/TIKA-1703
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.9
>Reporter: Christian Wolfe
>Assignee: Chris A. Mattmann
>Priority: Minor
> Fix For: 1.11
>
>
> If a user specifies the path to the Tesseract executable using 
> {{TesseractOCRConfig.setTesseractPath}}, then Tika will assume that the 
> Tesseract config folder (usually referred to as the 'tessdata' folder) is in 
> the same location. This is usually true in a Windows environment, where 
> everything is installed into a central location.
> However, this is not necessarily the case in a Linux environment. If one were 
> to build Tesseract from source, for example, the config folder will be 
> installed in a different location than the Tesseract executable.
> One way to fix this would be to add a way to specify the location of the 
> Tesseract config folder separate from the path to the executable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-1703) Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path

2015-08-03 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14652940#comment-14652940
 ] 

ASF GitHub Bot commented on TIKA-1703:
--

GitHub user taidan19 opened a pull request:

https://github.com/apache/tika/pull/56

TIKA-1703 Add ability to specify Tesseract config path.

Link to Jira ticket - https://issues.apache.org/jira/browse/TIKA-1703

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/taidan19/tika TIKA-1703

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/tika/pull/56.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #56


commit 86e8fdf187af5051812e1164c4cc3fef737a0644
Author: Christian Wolfe 
Date:   2015-08-04T00:54:23Z

TIKA-1703 Add ability to specify Tesseract config path.




> Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path
> ---
>
> Key: TIKA-1703
> URL: https://issues.apache.org/jira/browse/TIKA-1703
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.9
>Reporter: Christian Wolfe
>Priority: Minor
> Fix For: 1.9
>
>
> If a user specifies the path to the Tesseract executable using 
> {{TesseractOCRConfig.setTesseractPath}}, then Tika will assume that the 
> Tesseract config folder (usually referred to as the 'tessdata' folder) is in 
> the same location. This is usually true in a Windows environment, where 
> everything is installed into a central location.
> However, this is not necessarily the case in a Linux environment. If one were 
> to build Tesseract from source, for example, the config folder will be 
> installed in a different location than the Tesseract executable.
> One way to fix this would be to add a way to specify the location of the 
> Tesseract config folder separate from the path to the executable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)