[ 
https://issues.apache.org/jira/browse/TIKA-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14699635#comment-14699635
 ] 

Sergey Beryozkin commented on TIKA-1712:
----------------------------------------

Hi Chris

The problem is META-INF/cxf/bus-extensions.txt in tika-app.jar is incomplete, 
it only contains what is available inside cxf-rt-transports-http, but has no 
content available in cxf-core/bus-extensions.txt, after updating the file in 
tika-app manually I got the GROBID example working.

The solution is to have the content of all META-INF/cxf/bus-extensions.txt 
files available in tika-app, in a single file.
Not sure how this can be realized though
Sergey
 

> GROBID parser fails in tika-app
> -------------------------------
>
>                 Key: TIKA-1712
>                 URL: https://issues.apache.org/jira/browse/TIKA-1712
>             Project: Tika
>          Issue Type: Bug
>          Components: cli, server
>            Reporter: Chris A. Mattmann
>            Assignee: Sergey Beryozkin
>             Fix For: 1.11
>
>
> Hey Sergey do you have any idea why CXF's 3.0.3 rt-client would work fine in 
> tika-server, but fail in tika-app? I'm seeing that with the GROBID parser. 
> See:
> https://issues.apache.org/jira/browse/CXF-6545
> Try calling the GROBID parser from Tika app:
> java -classpath 
> $HOME/git/grobidparser-resources/:target/tika-app-1.11-SNAPSHOT.jar 
> org.apache.tika.cli.TikaCLI 
> --config=$HOME/git/grobidparser-resources/tika-config.xml -J 
> $HOME/git/grobid/papers/ICSE06.pdf
> After following this guide:
> https://wiki.apache.org/tika/GrobidJournalParser
> Works fine in Tika-Server - dies in Tika-app with:
> {noformat}
> java.lang.NullPointerException
>       at 
> org.apache.cxf.jaxrs.client.AbstractClient.setupOutInterceptorChain(AbstractClient.java:849)
>       at 
> org.apache.cxf.jaxrs.client.AbstractClient.createMessage(AbstractClient.java:923)
>       at 
> org.apache.cxf.jaxrs.client.WebClient.finalizeMessage(WebClient.java:1125)
>       at 
> org.apache.cxf.jaxrs.client.WebClient.doChainedInvocation(WebClient.java:1098)
>       at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:894)
>       at org.apache.cxf.jaxrs.client.WebClient.doInvoke(WebClient.java:865)
>       at org.apache.cxf.jaxrs.client.WebClient.invoke(WebClient.java:331)
>       at org.apache.cxf.jaxrs.client.WebClient.post(WebClient.java:340)
>       at 
> org.apache.tika.parser.journal.GrobidRESTParser.parse(GrobidRESTParser.java:82)
>       at 
> org.apache.tika.parser.journal.JournalParser.parse(JournalParser.java:67)
>       at 
> org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:177)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
>       at 
> org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158)
>       at org.apache.tika.cli.TikaCLI.handleRecursiveJson(TikaCLI.java:504)
>       at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:484)
>       at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:139)
> java.lang.NullPointerException
>       at 
> org.apache.tika.parser.journal.GrobidRESTParser.parse(GrobidRESTParser.java:89)
>       at 
> org.apache.tika.parser.journal.JournalParser.parse(JournalParser.java:67)
>       at 
> org.apache.tika.parser.ParserDecorator.parse(ParserDecorator.java:177)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
>       at 
> org.apache.tika.parser.RecursiveParserWrapper.parse(RecursiveParserWrapper.java:158)
>       at org.apache.tika.cli.TikaCLI.handleRecursiveJson(TikaCLI.java:504)
>       at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:484)
>       at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:139)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to