[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204920#comment-17204920 ] Tim Allison commented on TIKA-3094: --- I think this is resolved, and the fix will come out

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204502#comment-17204502 ] Abhijit Rajwade commented on TIKA-3094: --- [~tallison] [~bob] [~hudson] I don't know

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-06-02 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123914#comment-17123914 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #339 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17102182#comment-17102182 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1813 (See [

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17102137#comment-17102137 ] Bob Paulin commented on TIKA-3094: -- Looks like the jaxb error is not so much an issue wit

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17100054#comment-17100054 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1812 (See [

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17099964#comment-17099964 ] Tim Allison commented on TIKA-3094: --- Thank you, [~bob]! On 3, that was my idiocy in not

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17099868#comment-17099868 ] Bob Paulin commented on TIKA-3094: -- Thanks [~tallison] .  For #2 JAXB was removed from th

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17099859#comment-17099859 ] Tim Allison commented on TIKA-3094: --- Hi [~bob], I'll take #3. On 2, if you comment out

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17099848#comment-17099848 ] Bob Paulin commented on TIKA-3094: -- Hey [~tallison] I ran a build on Java 8 and Java 11 a

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-04 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17099485#comment-17099485 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1811 (See [

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17099475#comment-17099475 ] Tim Allison commented on TIKA-3094: --- Thank you [~bob]! For kicks, I ran the osgi'd Tika

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-03 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17098690#comment-17098690 ] Abhishek Chauhan commented on TIKA-3094: Hello [~tallison]  [~bob] , I have incre

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-30 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096324#comment-17096324 ] Abhijit Rajwade commented on TIKA-3094: --- Yes [~bob] thanks a lot for the prompt fix.

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-30 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096319#comment-17096319 ] Abhishek Chauhan commented on TIKA-3094: Really thankful to [~bob] for resolving t

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17095964#comment-17095964 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #337 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17095917#comment-17095917 ] Bob Paulin commented on TIKA-3094: -- Fixed with https://github.com/apache/tika/commit/678

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17095883#comment-17095883 ] Bob Paulin commented on TIKA-3094: -- Embedding SparseBitSet in Embed-Dependency fixes the

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17095342#comment-17095342 ] Tim Allison commented on TIKA-3094: --- Y, exactly right. > Apache Tika fails to extract t

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17095133#comment-17095133 ] Abhijit Rajwade commented on TIKA-3094: --- I am working with [~abchauha] on this issue

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094735#comment-17094735 ] Tim Allison commented on TIKA-3094: --- Thank you, [~bob]! > Apache Tika fails to extract

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094638#comment-17094638 ] Abhishek Chauhan commented on TIKA-3094: Glad ! Thanks for sharing this [~bob]. 

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094614#comment-17094614 ] Bob Paulin commented on TIKA-3094: -- Thanks [~abchauha] .  The build process adds OSGi spe

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094594#comment-17094594 ] Abhishek Chauhan commented on TIKA-3094: [~bob] Please find the .pptx file attache

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094517#comment-17094517 ] Bob Paulin commented on TIKA-3094: -- If SparseBitSet is embedded in the tika-bundle that t

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094466#comment-17094466 ] Tim Allison commented on TIKA-3094: --- [~bobpaulin], is this something we can fix within T

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094407#comment-17094407 ] Abhishek Chauhan commented on TIKA-3094: [~tallison] We are calling using OSGI bun

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17094385#comment-17094385 ] Tim Allison commented on TIKA-3094: --- How are you calling Tika? Are you using the osgi bu