[jira] [Created] (TIKA-3388) Ole10Native attachments with non-ASCII filenames extracted with garbled names

2021-05-07 Thread Ross Johnson (Jira)
Ross Johnson created TIKA-3388: -- Summary: Ole10Native attachments with non-ASCII filenames extracted with garbled names Key: TIKA-3388 URL: https://issues.apache.org/jira/browse/TIKA-3388 Project: Tika

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-07 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17341027#comment-17341027 ] Peter Kronenberg commented on TIKA-3361: So I'd like to try to restart this conver

[jira] [Resolved] (TIKA-3365) RTFParser to XMLContentHandler incorrectly interprets en dash.

2021-05-07 Thread Gordon Allen (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gordon Allen resolved TIKA-3365. Resolution: Not A Problem The output was being viewed in a non-UTF character set database, so there

[jira] [Commented] (TIKA-3164) Upgrade to POI 5.0.0 when available

2021-05-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17340873#comment-17340873 ] Tim Allison commented on TIKA-3164: --- NPE in wmf: https://bz.apache.org/bugzilla/show_bug

[jira] [Comment Edited] (TIKA-3164) Upgrade to POI 5.0.0 when available

2021-05-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17340862#comment-17340862 ] Tim Allison edited comment on TIKA-3164 at 5/7/21, 2:17 PM: Th

[jira] [Commented] (TIKA-3164) Upgrade to POI 5.0.0 when available

2021-05-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17340862#comment-17340862 ] Tim Allison commented on TIKA-3164: --- There was a multithreading, erm, feature in the Tik

[jira] [Commented] (TIKA-3387) Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

2021-05-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17340777#comment-17340777 ] Tim Allison commented on TIKA-3387: --- Is more of the stacktrace available? Can you share

[jira] [Updated] (TIKA-3387) Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

2021-05-07 Thread Manojkumar M (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manojkumar M updated TIKA-3387: --- Description: org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tik

[jira] [Updated] (TIKA-3387) Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

2021-05-07 Thread Manojkumar M (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manojkumar M updated TIKA-3387: --- Description: org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tik

[jira] [Created] (TIKA-3387) Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

2021-05-07 Thread Manojkumar M (Jira)
Manojkumar M created TIKA-3387: -- Summary: Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser Key: TIKA-3387 URL: https://issues.apache.org/jira/browse/TIKA-3387 Project: