IllegalArgumentException Parsing MS Word 97 - 2003 --------------------------------------------------
Key: TIKA-707 URL: https://issues.apache.org/jira/browse/TIKA-707 Project: Tika Issue Type: Bug Components: parser Affects Versions: 0.1-incubating Reporter: Pablo Queixalos http://www.ac-nancy-metz.fr/enseign/physique/nouvcoll/4-matiere/Exemple%20s%C3%A9ance%20TIC%20et%20Prisme.doc Caused by: java.lang.IllegalArgumentException: charStart (3102) > charEnd (3091) at org.apache.poi.hwpf.model.BytePropertyNode.<init>(BytePropertyNode.java:61) at org.apache.poi.hwpf.model.CHPX.<init>(CHPX.java:53) at org.apache.poi.hwpf.model.CHPFormattedDiskPage.<init>(CHPFormattedDiskPage.java:91) at org.apache.poi.hwpf.model.CHPBinTable.<init>(CHPBinTable.java:101) at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:280) at org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:67) at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:196) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) ... 41 more -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira