Extendable indexing infrastructure for the Entityhub
----------------------------------------------------
Key: STANBOL-187
URL: https://issues.apache.org/jira/browse/STANBOL-187
Project: Stanbol
Issue Type: Improvement
Components: Entity Hub
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Currently the Entityhub includes some utilities to create Indexes for dbPedia,
geonames and dblp. There exists also an generic RDF indexer that is used by the
dbPedia and dblp however also this implementation is not extendable and not
really suitable to add features requested by issues like STANBOL-92, STANBOL-93
and STANBOL-163.
The goal is to create an infrastructure that provides an implementation of
- the indexing workflow
- configuration and initialization
and defines Interfaces that allows to plug in
- different Data Sources
- entity ranking implementations
- entity data mapper (e.g. filtering some fields, schema translations ...)
- indexing targets (the Yard that stores the indexed entities)
The existing Indexing utilities need to be moved to use the new Infrastructure
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira