[ 
https://issues.apache.org/jira/browse/TIKA-1191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14703807#comment-14703807
 ] 

Eric Biggers commented on TIKA-1191:
------------------------------------

I am using Tika 1.7 and I encountered this problem while testing ForkParser on 
the files in the test-documents directory distributed with the Tika sources.  
An example of a file that causes the problem is "testBinControlWord.rtf".  
Applying the ClassLoaderProxy.java.patch attached to this ticket appears to 
solve the problem (or at least work around it, since the packages won't be 
defined with their full original metadata).

The stacktrace given above for Tika 1.8-SNAPSHOT looks like an unrelated 
problem.

> ForkParser / ClassLoaderProxy does not define package
> -----------------------------------------------------
>
>                 Key: TIKA-1191
>                 URL: https://issues.apache.org/jira/browse/TIKA-1191
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.4, 1.5
>            Reporter: Nicolas Belisle
>         Attachments: ClassLoaderProxy.java.patch, Test.java, test.eml
>
>
> ForkParser will throw an Exception in some cases : 
> org.apache.tika.exception.TikaException: Invalid embedded resource
>       at 
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:189)
>       at 
> org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:135)
>       at 
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:186)
>       at 
> org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:161)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>       at java.lang.reflect.Method.invoke(Method.java:597)
>       at org.apache.tika.fork.ForkServer.call(ForkServer.java:144)
>       at org.apache.tika.fork.ForkServer.processRequests(ForkServer.java:124)
>       at org.apache.tika.fork.ForkServer.main(ForkServer.java:69)
> Caused by: java.lang.NullPointerException
>       at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:136)
>       at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:499)
>       at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60)
>       at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169)
>       at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268)
>       at 
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getTikaConfig(AbstractPOIFSExtractor.java:72)
>       at 
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getDetector(AbstractPOIFSExtractor.java:79)
>       at 
> org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:176)
>       ... 10 more
> A patch will follow



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to