Hello,
I am trying to configure the Entityhub linking engine to use an Entityhub
site with vocabulary from the Disease Ontology (http://disease-ontology.org/),
but when I enhance text with it, labels from the ontology are not being
annotated in the text. I am looking for advice on how to debug this. Here
is what I've tried so far:
- I used the genericrdf indexing tool to import the Disease Ontology into a
new Entityhub site. When I used the entityhub /find API endpoint to search
for the name "dengue hemorrhagic fever" a result from the Disease Ontology
was returned.
- I configured and built a EntityhubLinkingEngine and a WeightedChain
containing the linking engine. They show up on the Stanbol admin site and
felix console. These are the config files:
https://github.com/ecohealthalliance/t11/tree/master/ansible/roles/stanbol/templates/enhancer
- When I used the following API call to enhance text containing the same
term I was able to find using the /find endpoint, the language detected is
the only annotation returned.

curl -X POST -H "Accept: appltion/json" -H "Content-type: text/plain"
--data "Avoid dengue hemorrhagic fever."
http://54.197.175.163:3000/enhancer/chain/doidEnhancerChain

This appears in the Stanbol error.log when the enhancement runs:

```
31.05.2016 12:05:06.204 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.langid.LangIdEnhancementEngine language
identified as no
31.05.2016 12:05:06.206 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhancementEngine
No NER Model for person and language no available!
31.05.2016 12:05:06.206 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhancementEngine
No NER Model for organization and language no available!
31.05.2016 12:05:06.207 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhancementEngine
No NER Model for location and language no available!
31.05.2016 12:05:06.210 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
EntityLinking Statistics:
31.05.2016 12:05:06.210 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
    - overal: 2ms (text processing: 4%, lookup: 127%, matching 0%, ranking
0%, other -31%)
31.05.2016 12:05:06.210 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
  - Text Processing: 0.071543ms [count: 4 | time: 0.01788575ms
(max:0.051031, min:0.005928)]
31.05.2016 12:05:06.211 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
  - Vocabulary Lookup: 2.541598ms [count: 3 | time: 0.8471993333333333ms
(max:1.190281, min:0.667284)]
31.05.2016 12:05:06.211 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
    - cache hits: 1 (33.333332%)
31.05.2016 12:05:06.211 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
      - 0 query results (0 filtered - NaN%)
31.05.2016 12:05:06.211 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
  - Label Matching: 0.00218ms [count: 3 | time: 7.266666666666667E-4ms
(max:7.55E-4, min:7.04E-4)]
31.05.2016 12:05:06.211 *INFO* [Thread-5]
org.apache.stanbol.enhancer.engines.entitylinking.engine.EntityLinkingEngine
  - Suggestion Ranking: 0.0ms [count: 0 | time: NaNms (max:-1.0E-6,
min:9.223372036854775E12)]
31.05.2016 12:05:06.214 *INFO* [qtp1118916813-38]
org.apache.stanbol.enhancer.jobmanager.event.impl.EventJobManagerImpl
Execution of Chain doidEnhancerChain finished after 14ms for ContentItem
<urn:content-item-sha1-d2851c0b02e12cc3b42bb6608fa2e1d50c43b17f>
31.05.2016 12:05:06.215 *INFO* [qtp1118916813-38]
org.apache.stanbol.enhancer.servicesapi.EnhancementJobManager > processed
ContentItem
<urn:content-item-sha1-d2851c0b02e12cc3b42bb6608fa2e1d50c43b17f> with Chain
'doidEnhancerChain' in 14ms | chain:[tika: 1ms (7%), langid: 3ms (21%),
opennlp-sentence: 0ms (0%), opennlp-token: 0ms (0%), opennlp-pos: 1ms (7%),
opennlp-ner: 1ms (7%), entityhubExtraction: 4ms (29%), doidEnhancer: 7ms
(50%), dbpediaLinking: 0ms (0%)], concurrency: 1.0 (0%)
```

The Ansible playbook here performs all the steps I am been using to set up
Stanbol: https://github.com/ecohealthalliance/t11/tree/master/ansible

Thanks,
-Nathan Breit

-- 

Nathan Breit

Software Developer

EcoHealth Alliance

460 West 34th Street – 17th floor

New York, NY 10001

My Skype: nathanathan3 <http://is.gd/OyRVnD>

My Phone Number: 1-425-296-1123

www.ecohealthalliance.org

EcoHealth Alliance leads cutting-edge research into the critical
connections between human and wildlife health and delicate ecosystems. With
this science we develop solutions that promote conservation and prevent
pandemics.

Reply via email to