[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15537818#comment-15537818 ] Hudson commented on TIKA-2106: -- SUCCESS: Integrated in Jenkins build tika-2.x #155 (See

[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15537814#comment-15537814 ] Hudson commented on TIKA-2106: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1113 (See

tika-2.x-windows - Build # 59 - Still Failing

2016-09-30 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #59) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/59/ to view the results.

[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15537507#comment-15537507 ] Hudson commented on TIKA-2106: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #59 (See

[jira] [Resolved] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2106. --- Resolution: Fixed Thank you! > "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093 >

[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15537423#comment-15537423 ] ASF GitHub Bot commented on TIKA-2106: -- Github user asfgit closed the pull request at:

[GitHub] tika pull request #136: TIKA-2106. Need to lowercase the output file to matc...

2016-09-30 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/tika/pull/136 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[jira] [Assigned] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reassigned TIKA-2106: - Assignee: Tim Allison > "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093 >

[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536891#comment-15536891 ] ASF GitHub Bot commented on TIKA-2106: -- GitHub user epugh opened a pull request:

[GitHub] tika pull request #136: TIKA-2106. Need to lowercase the output file to matc...

2016-09-30 Thread epugh
GitHub user epugh opened a pull request: https://github.com/apache/tika/pull/136 TIKA-2106. Need to lowercase the output file to match the format passed to tesse… …ract cmd line. You can merge this pull request into a Git repository by running: $ git pull

[jira] [Created] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread Eric Pugh (JIRA)
Eric Pugh created TIKA-2106: --- Summary: "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093 Key: TIKA-2106 URL: https://issues.apache.org/jira/browse/TIKA-2106 Project: Tika Issue

[jira] [Resolved] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2105. --- Resolution: Not A Problem > Unable to process documents with french accents in filenames >

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread susserj (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536541#comment-15536541 ] susserj commented on TIKA-2105: --- Thank you very much for your assistance. I was finally able to get it to

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536397#comment-15536397 ] Tim Allison commented on TIKA-2105: --- try adding the -J flag, that will output to json and any exceptions

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536396#comment-15536396 ] Tim Allison commented on TIKA-2105: --- I got this to work from a .bat script. The trick was that I had to

[jira] [Issue Comment Deleted] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread susserj (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] susserj updated TIKA-2105: -- Comment: was deleted (was: Hi Tim When I added the -I -o to my command line I got a bunch of zero byte files

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread susserj (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536371#comment-15536371 ] susserj commented on TIKA-2105: --- Hi Tim When I added the -I -o to my command line I got a bunch of zero

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread susserj (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536363#comment-15536363 ] susserj commented on TIKA-2105: --- Hi Tim I tried added chcp 65001 which didn't work and then I tried chcp

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread susserj (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536346#comment-15536346 ] susserj commented on TIKA-2105: --- Hi Tim When I added the -I -o to my command line I got a bunch of zero

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536286#comment-15536286 ] Tim Allison commented on TIKA-2105: --- Try this:

[jira] [Commented] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536258#comment-15536258 ] Tim Allison commented on TIKA-2105: --- That's probably a Windows/bat scripting issue. I'll test some

[jira] [Created] (TIKA-2105) Unable to process documents with french accents in filenames

2016-09-30 Thread susserj (JIRA)
susserj created TIKA-2105: - Summary: Unable to process documents with french accents in filenames Key: TIKA-2105 URL: https://issues.apache.org/jira/browse/TIKA-2105 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-2094) Error parsing .doc file with visio embed

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15535839#comment-15535839 ] Tim Allison commented on TIKA-2094: --- Right, sorry. I was doing that from within the Tika project. In a

[jira] [Resolved] (TIKA-2093) Add hOCR output type to the TesseractOCRParser

2016-09-30 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2093. --- Resolution: Fixed W00t! Thank you, again, for the PR! > Add hOCR output type to the

[jira] [Commented] (TIKA-2099) Tar files without magic bytes are sporadically detected as text

2016-09-30 Thread Robin Schimpf (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15535154#comment-15535154 ] Robin Schimpf commented on TIKA-2099: - It seems like the ZipContainerDetector gets never called in