[jira] [Commented] (TIKA-2823) Remove printstacktrace in XMLReaderUtils

2019-01-29 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755278#comment-16755278
 ] 

Hudson commented on TIKA-2823:
--

UNSTABLE: Integrated in Jenkins build tika-2.x-windows #378 (See 
[https://builds.apache.org/job/tika-2.x-windows/378/])
TIKA-2823 (tallison: rev a14e645dc7d754ccdb7bba52dd43df42483368e7)
* (edit) tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java


> Remove printstacktrace in XMLReaderUtils
> 
>
> Key: TIKA-2823
> URL: https://issues.apache.org/jira/browse/TIKA-2823
> Project: Tika
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Major
> Fix For: 1.21
>
>
> Many apologies...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (TIKA-2823) Remove printstacktrace in XMLReaderUtils

2019-01-29 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755256#comment-16755256
 ] 

Hudson commented on TIKA-2823:
--

SUCCESS: Integrated in Jenkins build tika-branch-1x #157 (See 
[https://builds.apache.org/job/tika-branch-1x/157/])
TIKA-2823 (tallison: 
[https://github.com/apache/tika/commit/4a0c26be148ddabaa2d90fa2fe17f3f4c762b8ff])
* (edit) tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java


> Remove printstacktrace in XMLReaderUtils
> 
>
> Key: TIKA-2823
> URL: https://issues.apache.org/jira/browse/TIKA-2823
> Project: Tika
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Major
> Fix For: 1.21
>
>
> Many apologies...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (TIKA-2823) Remove printstacktrace in XMLReaderUtils

2019-01-29 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755253#comment-16755253
 ] 

Hudson commented on TIKA-2823:
--

SUCCESS: Integrated in Jenkins build Tika-trunk #1622 (See 
[https://builds.apache.org/job/Tika-trunk/1622/])
TIKA-2823 (tallison: 
[https://github.com/apache/tika/commit/a14e645dc7d754ccdb7bba52dd43df42483368e7])
* (edit) tika-core/src/main/java/org/apache/tika/utils/XMLReaderUtils.java


> Remove printstacktrace in XMLReaderUtils
> 
>
> Key: TIKA-2823
> URL: https://issues.apache.org/jira/browse/TIKA-2823
> Project: Tika
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Major
> Fix For: 1.21
>
>
> Many apologies...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (TIKA-2823) Remove printstacktrace in XMLReaderUtils

2019-01-29 Thread Tim Allison (JIRA)


 [ 
https://issues.apache.org/jira/browse/TIKA-2823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-2823.
---
   Resolution: Fixed
Fix Version/s: 1.21

The horror... Sorry, again. :(

> Remove printstacktrace in XMLReaderUtils
> 
>
> Key: TIKA-2823
> URL: https://issues.apache.org/jira/browse/TIKA-2823
> Project: Tika
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Tim Allison
>Priority: Major
> Fix For: 1.21
>
>
> Many apologies...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (TIKA-2823) Remove printstacktrace in XMLReaderUtils

2019-01-29 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2823:
-

 Summary: Remove printstacktrace in XMLReaderUtils
 Key: TIKA-2823
 URL: https://issues.apache.org/jira/browse/TIKA-2823
 Project: Tika
  Issue Type: Improvement
Reporter: Tim Allison
Assignee: Tim Allison


Many apologies...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template

2019-01-29 Thread Tim Allison (JIRA)


[ 
https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16755219#comment-16755219
 ] 

Tim Allison commented on TIKA-2147:
---

Try bumping your version to 1.20, and you should be good.

> ClassCastException on a valid Word template
> ---
>
> Key: TIKA-2147
> URL: https://issues.apache.org/jira/browse/TIKA-2147
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>Priority: Major
>  Labels: sax_docx_fixes
> Fix For: 1.20
>
> Attachments: Forefront Fax.dotx, basicresume.docx
>
>
> On the attached document template, which opens fine in Word, the Tika parser 
> throws the following error:
> java.lang.ClassCastException: org.apache.poi.POIXMLDocumentPart cannot be 
> cast to org.apache.poi.xwpf.usermodel.XWPFDocument
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnotes.getXWPFDocument(XWPFFootnotes.java:162)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:47)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:95)
>   at 
> org.apache.poi.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:658)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:235)
>   at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:160)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:124)
>   at 
> org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:58)
>   at 
> org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:237)
>   at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86)
>   at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (TIKA-2147) ClassCastException on a valid Word template

2019-01-29 Thread Tim Allison (JIRA)


 [ 
https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-2147:
--
Fix Version/s: (was: 1.15)
   1.20

> ClassCastException on a valid Word template
> ---
>
> Key: TIKA-2147
> URL: https://issues.apache.org/jira/browse/TIKA-2147
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>Priority: Major
>  Labels: sax_docx_fixes
> Fix For: 1.20
>
> Attachments: Forefront Fax.dotx, basicresume.docx
>
>
> On the attached document template, which opens fine in Word, the Tika parser 
> throws the following error:
> java.lang.ClassCastException: org.apache.poi.POIXMLDocumentPart cannot be 
> cast to org.apache.poi.xwpf.usermodel.XWPFDocument
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnotes.getXWPFDocument(XWPFFootnotes.java:162)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:47)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:95)
>   at 
> org.apache.poi.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:658)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:235)
>   at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:160)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:124)
>   at 
> org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:58)
>   at 
> org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:237)
>   at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86)
>   at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template

2019-01-29 Thread Jawahar (JIRA)


[ 
https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16754903#comment-16754903
 ] 

Jawahar commented on TIKA-2147:
---

Facing this exception while trying to extract content from above .docx and 
.dotx files using tika-app-1.19.1 jar . 
This is the command used by me - "java -jar tika-app-1.19.1.jar 
basicresume.docx --text"
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected 
RuntimeException from 
org.apache.tika.parser.microsoft.ooxml.OOXMLParser@10d68fcd
at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:282)
at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:209)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:496)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:149)
Caused by: java.lang.ClassCastException: 
org.apache.poi.ooxml.POIXMLDocumentPart cannot be cast to 
org.apache.poi.xwpf.usermodel.XWPFDocument
at 
org.apache.poi.xwpf.usermodel.XWPFAbstractFootnotesEndnotes.getXWPFDocument(XWPFAbstractFootnotesEndnotes.java:73)
at 
org.apache.poi.xwpf.usermodel.XWPFAbstractFootnoteEndnote.(XWPFAbstractFootnoteEndnote.java:70)
at 
org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:42)
at 
org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:129)
at 
org.apache.poi.ooxml.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:720)
at 
org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:262)
at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:184)
at 
org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:138)
at 
org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:60)
at 
org.apache.poi.ooxml.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:228)
at 
org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:116)
at 
org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:110)
at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
... 5 more


> ClassCastException on a valid Word template
> ---
>
> Key: TIKA-2147
> URL: https://issues.apache.org/jira/browse/TIKA-2147
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.13
> Environment: Windows 7 x64, JVM 1.8.0_101
>Reporter: Seva Alekseyev
>Priority: Major
>  Labels: sax_docx_fixes
> Fix For: 1.15
>
> Attachments: Forefront Fax.dotx, basicresume.docx
>
>
> On the attached document template, which opens fine in Word, the Tika parser 
> throws the following error:
> java.lang.ClassCastException: org.apache.poi.POIXMLDocumentPart cannot be 
> cast to org.apache.poi.xwpf.usermodel.XWPFDocument
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnotes.getXWPFDocument(XWPFFootnotes.java:162)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnote.(XWPFFootnote.java:47)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFFootnotes.onDocumentRead(XWPFFootnotes.java:95)
>   at 
> org.apache.poi.POIXMLDocumentPart._invokeOnDocumentRead(POIXMLDocumentPart.java:658)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:235)
>   at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:160)
>   at 
> org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:124)
>   at 
> org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:58)
>   at 
> org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:237)
>   at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:86)
>   at 
> org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)