[ 
https://issues.apache.org/jira/browse/TIKA-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14013647#comment-14013647
 ] 

Michael McCandless commented on TIKA-1305:
------------------------------------------

Net/net the RTF is corrupted right?

But we want to make a best-effort to "gloss over" the corruption and still 
extract what we can?  I think that makes sense.

+1 for the simple solution, maybe w/ a comment explaining it's best effort when 
we see a corrupted doc?

> New list processing changes appear to be causing RTFParser exception
> --------------------------------------------------------------------
>
>                 Key: TIKA-1305
>                 URL: https://issues.apache.org/jira/browse/TIKA-1305
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>         Environment: Mac OSX 10.7.5
> Tika 1.6-SNAPSHOT
>            Reporter: Chris Bamford
>            Priority: Minor
>              Labels: newbie
>         Attachments: rtfparsererror_2.rtf
>
>
> Some RTFs cause RTFParser to throw a RuntimeException:
> Unexpected RuntimeException from org.apache.tika.parser.rtf.RTFParser@425e60f2
> When tracing in the debugger (surfaces in CompositeParser.parse() where it 
> catches the RuntimeException, line 244 in my copy), the exception (e) is:
> java.lang.ArrayIndexOutOfBoundsException: -1
> A committer (Tim Allison) believes that it is being caused by recent list 
> processing changes.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to