Few of RTF files not extracting properly ----------------------------------------
Key: TIKA-642 URL: https://issues.apache.org/jira/browse/TIKA-642 Project: Tika Issue Type: Bug Components: parser Affects Versions: 0.9, 1.0 Environment: All Reporter: Manish Few of the RTF files dont get extracted properly. This is the stack trace: org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.rtf.RTFParser@616d071a at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:203) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135) Caused by: java.io.IOException: Too many close-groups in RTF text at javax.swing.text.rtf.RTFParser.write(RTFParser.java:156) at javax.swing.text.rtf.RTFParser.writeSpecial(RTFParser.java:101) at javax.swing.text.rtf.AbstractFilter.write(AbstractFilter.java:158) at javax.swing.text.rtf.AbstractFilter.readFromStream(AbstractFilter.java:88) at javax.swing.text.rtf.RTFEditorKit.read(RTFEditorKit.java:65) at org.apache.tika.parser.rtf.RTFParser.parse(RTFParser.java:112) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) Where should i attached the document? -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira