[ https://issues.apache.org/jira/browse/TIKA-1191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072104#comment-14072104 ]
Nicolas Belisle commented on TIKA-1191: --------------------------------------- I was able to reproduce a similar issue with another file using Tika 1.5. See attached eml.test and the test (Test.java). The exception : Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mail.RFC822Parser@6743bc0f at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at org.apache.tika.fork.ForkServer.call(ForkServer.java:144) at org.apache.tika.fork.ForkServer.processRequests(ForkServer.java:124) at org.apache.tika.fork.ForkServer.main(ForkServer.java:69) Caused by: java.lang.NullPointerException at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:158) at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:516) at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60) at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169) at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268) at org.apache.tika.parser.AutoDetectParser.<init>(AutoDetectParser.java:51) at org.apache.tika.parser.mail.RFC822Parser.adaptedExtractMultipart(RFC822Parser.java:167) at org.apache.tika.parser.mail.RFC822Parser.adaptedExtractMultipart(RFC822Parser.java:156) at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:101) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) ... 9 more > ForkParser / ClassLoaderProxy does not define package > ----------------------------------------------------- > > Key: TIKA-1191 > URL: https://issues.apache.org/jira/browse/TIKA-1191 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.4 > Reporter: Nicolas Belisle > Attachments: ClassLoaderProxy.java.patch, Test.java, test.eml > > > ForkParser will throw an Exception in some cases : > org.apache.tika.exception.TikaException: Invalid embedded resource > at > org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:189) > at > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:135) > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:186) > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:161) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.tika.fork.ForkServer.call(ForkServer.java:144) > at org.apache.tika.fork.ForkServer.processRequests(ForkServer.java:124) > at org.apache.tika.fork.ForkServer.main(ForkServer.java:69) > Caused by: java.lang.NullPointerException > at > org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:136) > at > org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:499) > at > org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60) > at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169) > at > org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268) > at > org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getTikaConfig(AbstractPOIFSExtractor.java:72) > at > org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.getDetector(AbstractPOIFSExtractor.java:79) > at > org.apache.tika.parser.microsoft.AbstractPOIFSExtractor.handleEmbeddedOfficeDoc(AbstractPOIFSExtractor.java:176) > ... 10 more > A patch will follow -- This message was sent by Atlassian JIRA (v6.2#6252)