Nicolas Belisle created TIKA-1191:
-------------------------------------

             Summary: ForkParser / ClassLoaderProxy does not define package
                 Key: TIKA-1191
                 URL: https://issues.apache.org/jira/browse/TIKA-1191
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.4
            Reporter: Nicolas Belisle


ForkParser will throw an Exception in some cases : 

org.apache.tika.exception.TikaException: Invalid embedded resource
        at 
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:189)
        at 
org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:135)
        at 
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:186)
        at 
org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:161)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.tika.fork.ForkServer.call(ForkServer.java:144)
        at org.apache.tika.fork.ForkServer.processRequests(ForkServer.java:124)
        at org.apache.tika.fork.ForkServer.main(ForkServer.java:69)
Caused by: java.lang.NullPointerException
        at 
org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:136)
        at 
org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:499)
        at 
org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60)
        at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169)
        at 
org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268)
        at 
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getTikaConfig(AbstractPOIFSExtractor.java:72)
        at 
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getDetector(AbstractPOIFSExtractor.java:79)
        at 
org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:176)
        ... 10 more


A patch will follow



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to