[
https://issues.apache.org/jira/browse/STANBOL-47?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Fabian Christ updated STANBOL-47:
---------------------------------
Priority: Minor (was: Major)
> Split up the EntityMentionEnhancementEngine into two separate one
> -----------------------------------------------------------------
>
> Key: STANBOL-47
> URL: https://issues.apache.org/jira/browse/STANBOL-47
> Project: Stanbol
> Issue Type: Improvement
> Components: FISE
> Reporter: Fabian Christ
> Assignee: Olivier Grisel
> Priority: Minor
>
> Reported by project member rupert.westenthaler, Jun 16, 2010
> Currently the EntityMentionEnhancementEngine does two things:
> - first extract named entities (currently Persons, Organisations and Places)
> from the content. For this the openNLP framework is used
> - second recommend up to three Entities as defined in Wikipedia for the
> named entities found. For this the autotagger component is used.
> Expected Result:
> 1) An engine that provides named entity extraction based on openNLP. This
> engine creates TextAnnotaion type Enhancements.
> 2) An engine that provides recommendations for entries as defined in dbpedia.
> This engine consumes TextAnnotation type enhancements and produces EntityType
> enhancements.
> This would also allow to
> - use an other natural language processing framework for the named entity
> extraction
> - use other engines to calculate entity recommendations for text annotations.
> Comment 2 by project member rupert.westenthaler, Jun 17, 2010
> Current state:
> There are two new Engines:
> -
> eu.iksproject.fise.engines.opennlp.impl.NamedEntityExtractionEnhancementEngine:
> This engine uses openNLP to perform Named Entity extraction. It has still a
> dependency to the configured autotagging provider, because the models of
> openNLP are loaded via this bundle context.
> - eu.iksproject.fise.engines.autotagging.impl.EntityMentionEnhancementEngine
> This engine uses the autotagger to calculate entity recommendations for
> TextAnnotations.
> Both EnhancementEngines implement the ServiceProperties interface to parse
> information about ordering.
> The NamedEntityExtractionEnhancementEngine needs to run first, because it
> produces the TextAnnotations consumed by the EntityMentionEnhancementEngine.
> TODOs: Remove the dependency of the opennlp-ner bundle to the configured
> autotagging provider bundle
> Also split up the unit tests
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.