[
https://issues.apache.org/jira/browse/STANBOL-187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13033965#comment-13033965
]
Rupert Westenthaler commented on STANBOL-187:
---------------------------------------------
Status Update
The new Indexing Infrastructure is functional. The Indexer for DBLP and DBpedia
are already ported.
The old generic RDF indexer was deleted and is no longer used.
Still open:
(1) Port the geonames.org indexer: It does not use RDF, but directly
reads/processes the DB dumps. Therefore a customized Indexing Source has to be
implemented based on the current implementation
(2) Add support to the Solr Yard destination to create a OSGI configuration
file for the SolrYard that loads the index based on the Solr Archive or Solr
Archive Reference (the two files already created in the distribution folder)
> Extendable indexing infrastructure for the Entityhub
> ----------------------------------------------------
>
> Key: STANBOL-187
> URL: https://issues.apache.org/jira/browse/STANBOL-187
> Project: Stanbol
> Issue Type: Improvement
> Components: Entity Hub
> Reporter: Rupert Westenthaler
> Assignee: Rupert Westenthaler
>
> Currently the Entityhub includes some utilities to create Indexes for
> dbPedia, geonames and dblp. There exists also an generic RDF indexer that is
> used by the dbPedia and dblp however also this implementation is not
> extendable and not really suitable to add features requested by issues like
> STANBOL-92, STANBOL-93 and STANBOL-163.
> The goal is to create an infrastructure that provides an implementation of
> - the indexing workflow
> - configuration and initialization
> and defines Interfaces that allows to plug in
> - different Data Sources
> - entity ranking implementations
> - entity data mapper (e.g. filtering some fields, schema translations ...)
> - indexing targets (the Yard that stores the indexed entities)
> The existing Indexing utilities need to be moved to use the new Infrastructure
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira