[ 
https://issues.apache.org/jira/browse/TIKA-935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Mastarone updated TIKA-935:
--------------------------------

    Attachment: ArParserTest.java
                TIKA-935.patch

Patch uploaded which corrects the error in the *.ar file detection, along with 
new unit test class that makes use of existing .ar files in the test-documents 
folder.  With this patch, parsing occurs successfully in a latest build.  The 
unit tests pass.
                
> TikaException thrown when trying to parse archive (*.ar) files
> --------------------------------------------------------------
>
>                 Key: TIKA-935
>                 URL: https://issues.apache.org/jira/browse/TIKA-935
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.2
>         Environment: Windows 7
>            Reporter: John Mastarone
>         Attachments: ArParserTest.java, TIKA-935.patch
>
>
> A TikaException is thrown when trying to drop either of the two .ar files 
> from the parsers' test-documents folder into Tika-GUI.  From looking at this: 
> http://stuff.mit.edu/afs/athena/software/cygwin/cygwin_v1.3.2/usr/share/magic.mime
>    the archive detection is not done correctly for these types of files in 
> the PackageExtractor class, and a TarArchiveInputStream is chosen by default, 
> incorrectly.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to