[jira] [Commented] (TIKA-1703) Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path
[ https://issues.apache.org/jira/browse/TIKA-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14654999#comment-14654999 ] Hudson commented on TIKA-1703: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #812 (See [https://builds.apache.org/job/tika-trunk-jdk1.7/812/]) Fix for TIKA-1703: Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path Contributed by Christian Wolfe this closes #56. (mattmann: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1694133) * /tika/trunk/CHANGES.txt * /tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRConfig.java * /tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/ocr/TesseractOCRParser.java * /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/ocr/TesseractOCRConfigTest.java * /tika/trunk/tika-parsers/src/test/resources/test-properties/TesseractOCRConfig-full.properties > Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path > --- > > Key: TIKA-1703 > URL: https://issues.apache.org/jira/browse/TIKA-1703 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.9 >Reporter: Christian Wolfe >Assignee: Chris A. Mattmann >Priority: Minor > Fix For: 1.11 > > > If a user specifies the path to the Tesseract executable using > {{TesseractOCRConfig.setTesseractPath}}, then Tika will assume that the > Tesseract config folder (usually referred to as the 'tessdata' folder) is in > the same location. This is usually true in a Windows environment, where > everything is installed into a central location. > However, this is not necessarily the case in a Linux environment. If one were > to build Tesseract from source, for example, the config folder will be > installed in a different location than the Tesseract executable. > One way to fix this would be to add a way to specify the location of the > Tesseract config folder separate from the path to the executable. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1703) Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path
[ https://issues.apache.org/jira/browse/TIKA-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14654652#comment-14654652 ] ASF GitHub Bot commented on TIKA-1703: -- Github user asfgit closed the pull request at: https://github.com/apache/tika/pull/56 > Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path > --- > > Key: TIKA-1703 > URL: https://issues.apache.org/jira/browse/TIKA-1703 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.9 >Reporter: Christian Wolfe >Assignee: Chris A. Mattmann >Priority: Minor > Fix For: 1.11 > > > If a user specifies the path to the Tesseract executable using > {{TesseractOCRConfig.setTesseractPath}}, then Tika will assume that the > Tesseract config folder (usually referred to as the 'tessdata' folder) is in > the same location. This is usually true in a Windows environment, where > everything is installed into a central location. > However, this is not necessarily the case in a Linux environment. If one were > to build Tesseract from source, for example, the config folder will be > installed in a different location than the Tesseract executable. > One way to fix this would be to add a way to specify the location of the > Tesseract config folder separate from the path to the executable. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-1703) Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path
[ https://issues.apache.org/jira/browse/TIKA-1703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14652940#comment-14652940 ] ASF GitHub Bot commented on TIKA-1703: -- GitHub user taidan19 opened a pull request: https://github.com/apache/tika/pull/56 TIKA-1703 Add ability to specify Tesseract config path. Link to Jira ticket - https://issues.apache.org/jira/browse/TIKA-1703 You can merge this pull request into a Git repository by running: $ git pull https://github.com/taidan19/tika TIKA-1703 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/tika/pull/56.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #56 commit 86e8fdf187af5051812e1164c4cc3fef737a0644 Author: Christian Wolfe Date: 2015-08-04T00:54:23Z TIKA-1703 Add ability to specify Tesseract config path. > Can't Specify Tesseract Data Folder Distinct from Tesseract Executable Path > --- > > Key: TIKA-1703 > URL: https://issues.apache.org/jira/browse/TIKA-1703 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.9 >Reporter: Christian Wolfe >Priority: Minor > Fix For: 1.9 > > > If a user specifies the path to the Tesseract executable using > {{TesseractOCRConfig.setTesseractPath}}, then Tika will assume that the > Tesseract config folder (usually referred to as the 'tessdata' folder) is in > the same location. This is usually true in a Windows environment, where > everything is installed into a central location. > However, this is not necessarily the case in a Linux environment. If one were > to build Tesseract from source, for example, the config folder will be > installed in a different location than the Tesseract executable. > One way to fix this would be to add a way to specify the location of the > Tesseract config folder separate from the path to the executable. -- This message was sent by Atlassian JIRA (v6.3.4#6332)