[jira] [Commented] (TIKA-2261) TikaOcr giving different result across platforms

2017-02-08 Thread Sandeepan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15858992#comment-15858992 ] Sandeepan commented on TIKA-2261: - [~talli...@mitre.org] where do i find rotation.py. Can you please point

[jira] [Commented] (TIKA-2261) TikaOcr giving different result across platforms

2017-02-08 Thread Sandeepan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15858988#comment-15858988 ] Sandeepan commented on TIKA-2261: - [~talli...@mitre.org] I'll try that. Thanks. [~lfcnassif] Actually

[jira] [Updated] (TIKA-2261) TikaOcr giving different result across platforms

2017-02-08 Thread Sandeepan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandeepan updated TIKA-2261: Attachment: 4.png This file's output on Mac vs Ubuntu Only first two lines. Mac [-~-] With the

[jira] [Closed] (TIKA-2260) Batch Processing fails -- causeForTermination='MAIN_LOOP_EXCEPTION_NO_RESTART'

2017-02-08 Thread Ravi kanth Goud Machapur (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravi kanth Goud Machapur closed TIKA-2260. -- Resolution: Fixed Looks like it was permissions issues in linux. > Batch

[jira] [Commented] (TIKA-2261) TikaOcr giving different result across platforms

2017-02-08 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15858588#comment-15858588 ] Luis Filipe Nassif commented on TIKA-2261: -- Have you checked if the tessdata_prefix environment

FW: FINAL REMINDER: CFP for ApacheCon closes February 11th

2017-02-08 Thread Allison, Timothy B.
Alright, looks like we didn't get around to pushing for a content track...is anyone else planning to present? Tika could still fit within: Big Data "Search and Databases" "Success stories, failures..." Data Science and Engineering "Organizing and structuring data for analysis" "Analysis

[jira] [Commented] (TIKA-2261) TikaOcr giving different result across platforms

2017-02-08 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15858405#comment-15858405 ] Tim Allison commented on TIKA-2261: --- Is there a chance that you have {{rotation.py}} or {{ImageMagick}}

[jira] [Commented] (TIKA-2260) Batch Processing fails -- causeForTermination='MAIN_LOOP_EXCEPTION_NO_RESTART'

2017-02-08 Thread Ravi kanth Goud Machapur (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15858142#comment-15858142 ] Ravi kanth Goud Machapur commented on TIKA-2260: Here is the log: INFO about to start

FINAL REMINDER: CFP for ApacheCon closes February 11th

2017-02-08 Thread Rich Bowen
Dear Apache Enthusiast, This is your FINAL reminder that the Call for Papers (CFP) for ApacheCon Miami is closing this weekend - February 11th. This is your final opportunity to submit a talk for consideration at this event. This year, we are running several mini conferences in conjunction with

[jira] [Created] (TIKA-2261) TikaOcr giving different result across platforms

2017-02-08 Thread Sandeepan (JIRA)
Sandeepan created TIKA-2261: --- Summary: TikaOcr giving different result across platforms Key: TIKA-2261 URL: https://issues.apache.org/jira/browse/TIKA-2261 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-1422) org.apache.tika.parser.mail.RFC822ParserTest fails

2017-02-08 Thread Sandeepan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15858006#comment-15858006 ] Sandeepan commented on TIKA-1422: - [~thaichat04] I am also getting different result when using Tesseract

[jira] [Comment Edited] (TIKA-2038) A more accurate facility for detecting Charset Encoding of HTML documents

2017-02-08 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15857926#comment-15857926 ] Tim Allison edited comment on TIKA-2038 at 2/8/17 12:29 PM: bq. Since it seems

[jira] [Updated] (TIKA-2038) A more accurate facility for detecting Charset Encoding of HTML documents

2017-02-08 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2038: -- Attachment: tld_text_html.xlsx bq. Since it seems that in this test the potential charset in meta