[ https://issues.apache.org/jira/browse/TIKA-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14699635#comment-14699635 ]
Sergey Beryozkin commented on TIKA-1712: ---------------------------------------- Hi Chris The problem is META-INF/cxf/bus-extensions.txt in tika-app.jar is incomplete, it only contains what is available inside cxf-rt-transports-http, but has no content available in cxf-core/bus-extensions.txt, after updating the file in tika-app manually I got the GROBID example working. The solution is to have the content of all META-INF/cxf/bus-extensions.txt files available in tika-app, in a single file. Not sure how this can be realized though Sergey > GROBID parser fails in tika-app > ------------------------------- > > Key: TIKA-1712 > URL: https://issues.apache.org/jira/browse/TIKA-1712 > Project: Tika > Issue Type: Bug > Components: cli, server > Reporter: Chris A. Mattmann > Assignee: Sergey Beryozkin > Fix For: 1.11 > > > Hey Sergey do you have any idea why CXF's 3.0.3 rt-client would work fine in > tika-server, but fail in tika-app? I'm seeing that with the GROBID parser. > See: > https://issues.apache.org/jira/browse/CXF-6545 > Try calling the GROBID parser from Tika app: > java -classpath > $HOME/git/grobidparser-resources/:target/tika-app-1.11-SNAPSHOT.jar > org.apache.tika.cli.TikaCLI > --config=$HOME/git/grobidparser-resources/tika-config.xml -J > $HOME/git/grobid/papers/ICSE06.pdf > After following this guide: > https://wiki.apache.org/tika/GrobidJournalParser > Works fine in Tika-Server - dies in Tika-app with: > {noformat} > java.lang.NullPointerException > at > org.apache.cxf.jaxrs.client.AbstractClient.setupOutInterceptorChain(AbstractClient.java:849) > at > org.apache.cxf.jaxrs.client.AbstractClient.createMessage(AbstractClient.java:923) > at > org.apache.cxf.jaxrs.client.WebClient.finalizeMessage(WebClient.java:1125) > at > org.apache.cxf.jaxrs.client.WebClient.doChainedInvocation(WebClient.java:1098) > at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:894) > at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:865) > at org.apache.cxf.jaxrs.client.WebClient.invoke(WebClient.java:331) > at org.apache.cxf.jaxrs.client.WebClient.post(WebClient.java:340) > at > org.apache.tika.parser.journal.GrobidRESTParser.parse(GrobidRESTParser.java:82) > at > org.apache.tika.parser.journal.JournalParser.parse(JournalParser.java:67) > at > org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:177) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120) > at > org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158) > at org.apache.tika.cli.TikaCLI.handleRecursiveJson(TikaCLI.java:504) > at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:484) > at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:139) > java.lang.NullPointerException > at > org.apache.tika.parser.journal.GrobidRESTParser.parse(GrobidRESTParser.java:89) > at > org.apache.tika.parser.journal.JournalParser.parse(JournalParser.java:67) > at > org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:177) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120) > at > org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158) > at org.apache.tika.cli.TikaCLI.handleRecursiveJson(TikaCLI.java:504) > at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:484) > at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:139) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)