Hi, As far as I know, currently, the NLP stack within Stanbol doesn't include any information in the Enhancement structure (output), probably because that information could be large and verbose. In my opinion, a flag for requesting it in the request would be nice to have, but it is not implemented right now. Actually the NLP stage is used as a necessary step for further analysis engines.
Cheers, Rafa On Tue, Jul 19, 2016 at 10:18 AM mzl <m...@informatik.uni-kiel.de> wrote: > Hello, > > how do I get the AnalysedText content part when using the REST API? > > I'm trying to execute an enhancement chain via the REST API by calling > each enhancement engine with the result of the preceeding. This works > for the langdetect engine but when I'm calling some other engines, like > the opennlp-sentence engine, the result seems contain no additional > information. The error log is clean: > "19.07.2016 10:03:51.190 *INFO* [qtp298757786-36] > org.apache.stanbol.enhancer.jobmanager.event.impl.EventJobManagerImpl > Execution of Chain opennlp-sentenceChain finished after 3ms for > ContentItem > <urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce> > 19.07.2016 10:03:51.190 *INFO* [qtp298757786-36] > org.apache.stanbol.enhancer.servicesapi.EnhancementJobManager > > processed ContentItem > <urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce> with > Chain 'opennlp-sentenceChain' in 2ms | chain:[opennlp-sentence: 2ms > (100%)], concurrency: 1.0 (0%)" > But the result seems to not contain the AnalysedText part which causes > errors in later steps. > > > > The engines I try to call: > 1. Tika > 2. langdetect > 3. opennlp-sentence > 4. opennlp-token > 5. opennlp-pos > 6. opennlp-chunker > > > > What I did so far: > - Get Stanbol code from https://svn.apache.org/repos/asf/stanbol/trunk/ > - Build Stanbol with mvn clean install -DskipTests > - Start Stanbol with java -Xmx1g -jar > ./org.apache.stanbol.launchers.stable-1.0.0-SNAPSHOT.jar > - Use Firefox HttpRequester Plugin to send a Requests to the > langdetect-Engine at > http://localhost:8080/enhancer/engine/langdetect?outputContent=*/* > with "Accept multipart/form-data" Header > - Received enhanced content (see below) > - Send this with content type "multipart/form-data; charset=UTF-8; > boundary=contentItem-U9u25OIBks0JM-j1GP" to > http://localhost:8080/enhancer/engine/opennlp-pos?outputContent=*/* > - Received the response shown below > - Send the response to the opennlp-chunker at > "http://localhost:8080/enhancer/engine/opennlp-chunker?outputContent=*/*" > with "multipart/form-data; charset=UTF-8; > boundary=contentItem-KFLGIdIWg8rZZ7AF_" > - Receive response with status 200 OK and content equal to the content > of the request > - error log: > > 19.07.2016 10:14:24.360 *WARN* [Thread-9] > org.apache.stanbol.enhancer.nlp.utils.NlpEngineHelper The Enhancement > Engine 'opennlp-chunker (impl: OpenNlpChunkingEngine)' CAN NOT enhance > ContentItem InMemoryContentItem > uri=[urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce], > content=[size:204 bytes;;mime-type:text/plain], metadata=[8 triples], > > parts=[<urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce_main>, > <urn:apache.org:stanbol.enhancer:request.properties>, > < > http://stanbol.apache.org/ontology/enhancer/executionmetadata#ChainExecution > >] > because the AnalysedText ContentPart is missing. Users might want to add > an EnhancementEngine that creates the AnalysedText ContentPart such as > the POSTaggingEngine (o.a.stanbol.enhancer.engines.opennlp.pos)! > > > > > ########################################################################################### > Content from langdetect: > > ########################################################################################### > > --contentItem-U9u25OIBks0JM-j1GP > Content-Disposition: form-data; name="metadata"; > filename="urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce" > Content-Type: application/ld+json; charset=UTF-8 > Content-Transfer-Encoding: 8bit > > [ { > "@id" : "urn:enhancement-25ded47b-bdd2-cef8-abee-7ea258c03390", > "http://fise.iks-project.eu/ontology/confidence" : [ { > "@value" : "0.9999959994881431", > "@type" : "http://www.w3.org/2001/XMLSchema#double" > } ], > "http://fise.iks-project.eu/ontology/extracted-from" : [ { > "@id" : > "urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce" > } ], > "http://purl.org/dc/terms/created" : [ { > "@value" : "2016-07-14T15:12:09.795Z", > "@type" : "http://www.w3.org/2001/XMLSchema#dateTime" > } ], > "http://purl.org/dc/terms/creator" : [ { > "@value" : > > "org.apache.stanbol.enhancer.engines.langdetect.LanguageDetectionEnhancementEngine" > } ], > "http://purl.org/dc/terms/language" : [ { > "@value" : "en" > } ], > "http://purl.org/dc/terms/type" : [ { > "@id" : "http://purl.org/dc/terms/LinguisticSystem" > } ], > "@type" : [ "http://fise.iks-project.eu/ontology/Enhancement", > "http://fise.iks-project.eu/ontology/TextAnnotation" ] > } ] > --contentItem-U9u25OIBks0JM-j1GP > Content-Disposition: form-data; name="content" > Content-Type: multipart/alternate; > boundary=contentParts-U9u25OIBks0JM-j1GP > Content-Transfer-Encoding: 8bit > > --contentParts-U9u25OIBks0JM-j1GP > Content-Disposition: form-data; > name="urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce_main" > Content-Type: text/plain > Content-Transfer-Encoding: binary > > The Stanbol enhancer can detect famous cities such as Paris and people > such as Bob Marley. It is hard to use its REST-API. We will see if the > language detection works, if there are more than one sentence. > --contentParts-U9u25OIBks0JM-j1GP-- > > --contentItem-U9u25OIBks0JM-j1GP-- > > > ########################################################################################### > Content from opennlp-pos: > > ########################################################################################### > > --contentItem-KFLGIdIWg8rZZ7AF_ > Content-Disposition: form-data; name="metadata"; > filename="urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce" > Content-Type: application/ld+json; charset=UTF-8 > Content-Transfer-Encoding: 8bit > > [ { > "@id" : "urn:enhancement-25ded47b-bdd2-cef8-abee-7ea258c03390", > "http://fise.iks-project.eu/ontology/confidence" : [ { > "@value" : "0.9999959994881431", > "@type" : "http://www.w3.org/2001/XMLSchema#double" > } ], > "http://fise.iks-project.eu/ontology/extracted-from" : [ { > "@id" : > "urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce" > } ], > "http://purl.org/dc/terms/created" : [ { > "@value" : "2016-07-14T15:12:09.795Z", > "@type" : "http://www.w3.org/2001/XMLSchema#dateTime" > } ], > "http://purl.org/dc/terms/creator" : [ { > "@value" : > > "org.apache.stanbol.enhancer.engines.langdetect.LanguageDetectionEnhancementEngine" > } ], > "http://purl.org/dc/terms/language" : [ { > "@value" : "en" > } ], > "http://purl.org/dc/terms/type" : [ { > "@id" : "http://purl.org/dc/terms/LinguisticSystem" > } ], > "@type" : [ "http://fise.iks-project.eu/ontology/Enhancement", > "http://fise.iks-project.eu/ontology/TextAnnotation" ] > } ] > --contentItem-KFLGIdIWg8rZZ7AF_ > Content-Disposition: form-data; name="content" > Content-Type: multipart/alternate; > boundary=contentParts-KFLGIdIWg8rZZ7AF_ > Content-Transfer-Encoding: 8bit > > --contentParts-KFLGIdIWg8rZZ7AF_ > Content-Disposition: form-data; > name="urn:content-item-sha1-ccfad800c413a3ba0297c202badb0eaebb4a57ce_main" > Content-Type: text/plain > Content-Transfer-Encoding: binary > > The Stanbol enhancer can detect famous cities such as Paris and people > such as Bob Marley. It is hard to use its REST-API. We will see if the > language detection works, if there are more than one sentence. > --contentParts-KFLGIdIWg8rZZ7AF_-- > > --contentItem-KFLGIdIWg8rZZ7AF_-- >