[jira] [Commented] (TIKA-2823) Remove printstacktrace in XMLReaderUtils
[ https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755278#comment-16755278 ] Hudson commented on TIKA-2823: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #378 (See [https://builds.apache.org/job/tika-2.x-windows/378/]) TIKA-2823 (tallison: rev a14e645dc7d754ccdb7bba52dd43df42483368e7) * (edit) tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java > Remove printstacktrace in XMLReaderUtils > > > Key: TIKA-2823 > URL: https://issues.apache.org/jira/browse/TIKA-2823 > Project: Tika > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Tim Allison >Priority: Major > Fix For: 1.21 > > > Many apologies... -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (TIKA-2823) Remove printstacktrace in XMLReaderUtils
[ https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755256#comment-16755256 ] Hudson commented on TIKA-2823: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #157 (See [https://builds.apache.org/job/tika-branch-1x/157/]) TIKA-2823 (tallison: [https://github.com/apache/tika/commit/4a0c26be148ddabaa2d90fa2fe17f3f4c762b8ff]) * (edit) tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java > Remove printstacktrace in XMLReaderUtils > > > Key: TIKA-2823 > URL: https://issues.apache.org/jira/browse/TIKA-2823 > Project: Tika > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Tim Allison >Priority: Major > Fix For: 1.21 > > > Many apologies... -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (TIKA-2823) Remove printstacktrace in XMLReaderUtils
[ https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755253#comment-16755253 ] Hudson commented on TIKA-2823: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1622 (See [https://builds.apache.org/job/Tika-trunk/1622/]) TIKA-2823 (tallison: [https://github.com/apache/tika/commit/a14e645dc7d754ccdb7bba52dd43df42483368e7]) * (edit) tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java > Remove printstacktrace in XMLReaderUtils > > > Key: TIKA-2823 > URL: https://issues.apache.org/jira/browse/TIKA-2823 > Project: Tika > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Tim Allison >Priority: Major > Fix For: 1.21 > > > Many apologies... -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (TIKA-2823) Remove printstacktrace in XMLReaderUtils
[ https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2823. --- Resolution: Fixed Fix Version/s: 1.21 The horror... Sorry, again. :( > Remove printstacktrace in XMLReaderUtils > > > Key: TIKA-2823 > URL: https://issues.apache.org/jira/browse/TIKA-2823 > Project: Tika > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Tim Allison >Priority: Major > Fix For: 1.21 > > > Many apologies... -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (TIKA-2823) Remove printstacktrace in XMLReaderUtils
Tim Allison created TIKA-2823: - Summary: Remove printstacktrace in XMLReaderUtils Key: TIKA-2823 URL: https://issues.apache.org/jira/browse/TIKA-2823 Project: Tika Issue Type: Improvement Reporter: Tim Allison Assignee: Tim Allison Many apologies... -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755219#comment-16755219 ] Tim Allison commented on TIKA-2147: --- Try bumping your version to 1.20, and you should be good. > ClassCastException on a valid Word template > --- > > Key: TIKA-2147 > URL: https://issues.apache.org/jira/browse/TIKA-2147 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.13 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev >Priority: Major > Labels: sax_docx_fixes > Fix For: 1.20 > > Attachments: Forefront Fax.dotx, basicresume.docx > > > On the attached document template, which opens fine in Word, the Tika parser > throws the following error: > java.lang.ClassCastException: org.apache.poi.POIXMLDocumentPart cannot be > cast to org.apache.poi.xwpf.usermodel.XWPFDocument > at > org.apache.poi.xwpf.usermodel.XWPFFootnotes.getXWPFDocument(XWPFFootnotes.java:162) > at > org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:47) > at > org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:95) > at > org.apache.poi.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:658) > at > org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:235) > at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:160) > at > org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:124) > at > org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:58) > at > org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:237) > at > org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86) > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (TIKA-2147) ClassCastException on a valid Word template
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2147: -- Fix Version/s: (was: 1.15) 1.20 > ClassCastException on a valid Word template > --- > > Key: TIKA-2147 > URL: https://issues.apache.org/jira/browse/TIKA-2147 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.13 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev >Priority: Major > Labels: sax_docx_fixes > Fix For: 1.20 > > Attachments: Forefront Fax.dotx, basicresume.docx > > > On the attached document template, which opens fine in Word, the Tika parser > throws the following error: > java.lang.ClassCastException: org.apache.poi.POIXMLDocumentPart cannot be > cast to org.apache.poi.xwpf.usermodel.XWPFDocument > at > org.apache.poi.xwpf.usermodel.XWPFFootnotes.getXWPFDocument(XWPFFootnotes.java:162) > at > org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:47) > at > org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:95) > at > org.apache.poi.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:658) > at > org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:235) > at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:160) > at > org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:124) > at > org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:58) > at > org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:237) > at > org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86) > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16754903#comment-16754903 ] Jawahar commented on TIKA-2147: --- Facing this exception while trying to extract content from above .docx and .dotx files using tika-app-1.19.1 jar . This is the command used by me - "java -jar tika-app-1.19.1.jar basicresume.docx --text" Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@10d68fcd at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:282) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:209) at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:496) at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:149) Caused by: java.lang.ClassCastException: org.apache.poi.ooxml.POIXMLDocumentPart cannot be cast to org.apache.poi.xwpf.usermodel.XWPFDocument at org.apache.poi.xwpf.usermodel.XWPFAbstractFootnotesEndnotes.getXWPFDocument(XWPFAbstractFootnotesEndnotes.java:73) at org.apache.poi.xwpf.usermodel.XWPFAbstractFootnoteEndnote.(XWPFAbstractFootnoteEndnote.java:70) at org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:42) at org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:129) at org.apache.poi.ooxml.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:720) at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:262) at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:184) at org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:138) at org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:60) at org.apache.poi.ooxml.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:228) at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:116) at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:110) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) ... 5 more > ClassCastException on a valid Word template > --- > > Key: TIKA-2147 > URL: https://issues.apache.org/jira/browse/TIKA-2147 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.13 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev >Priority: Major > Labels: sax_docx_fixes > Fix For: 1.15 > > Attachments: Forefront Fax.dotx, basicresume.docx > > > On the attached document template, which opens fine in Word, the Tika parser > throws the following error: > java.lang.ClassCastException: org.apache.poi.POIXMLDocumentPart cannot be > cast to org.apache.poi.xwpf.usermodel.XWPFDocument > at > org.apache.poi.xwpf.usermodel.XWPFFootnotes.getXWPFDocument(XWPFFootnotes.java:162) > at > org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:47) > at > org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:95) > at > org.apache.poi.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:658) > at > org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:235) > at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:160) > at > org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:124) > at > org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:58) > at > org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:237) > at > org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86) > at > org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87) -- This message was sent by Atlassian JIRA (v7.6.3#76005)