Rupert Westenthaler created STANBOL-717:
-------------------------------------------

             Summary: Add explicit default EnhancementChain configuration to 
the Stanbol Launchers
                 Key: STANBOL-717
                 URL: https://issues.apache.org/jira/browse/STANBOL-717
             Project: Stanbol
          Issue Type: Bug
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
            Priority: Minor


With more and more Enhancement Engines managed by the Stanbol Community the 
currently used default Enhancement Chain (all active EnhancementEngines) 
becomes more and more problematic.

To give only some recent examples: 

* STANBOL-707 a 2nd language identification engine was added. resulting in the 
fact that now two Engines add language annotations with the current default 
chain. 

* STANBOL-706 will bring support for DBpedia Spotlight. This includes Engines 
for Spotting, Entity Candidates and full DBpedia Spotlight annotations. With 
the current default chain all those Engines would be included (typically one 
would only want one of those Engines within a single Enhancement Chain). In 
addition results of those Engines would be expected to be mostly duplicates to 
those produced by the NER and EntityTagging Engine working with the DBpedia 
default data included with the Stanbol Launcher.

To work around that the proposal is to:

1. explicitly configure the default EnhancementChain used by the Stanbol 
Launchers
2. keep the current default chain - that includes all active EnhancementEngines 
- but ensure that this is not used as default. "all-active" should be used as 
name for this chain.

Those configuration changes should be provided by the 
"org.apache.stanbol.data.defaultconfig" module.

Default Chain configuration:

The Default Chain configuration should include the following Engines

    metaxa;optional
    tika;optional
    langid
    ner
    dbpediaLinking
    entityhubExtraction

this represents the typical configuration as it was already with the 
0.9.0-incubating relase

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to