[ 
https://issues.apache.org/jira/browse/STANBOL-187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13033965#comment-13033965
 ] 

Rupert Westenthaler commented on STANBOL-187:
---------------------------------------------

Status Update

The new Indexing Infrastructure is functional. The Indexer for DBLP and DBpedia 
are already ported.
The old generic RDF indexer was deleted and is no longer used.

Still open:
 
(1) Port the geonames.org indexer: It does not use RDF, but directly 
reads/processes the DB dumps. Therefore a customized Indexing Source has to be 
implemented based on the current implementation
(2) Add support to the Solr Yard destination to create a OSGI configuration 
file for the SolrYard that loads the index based on the Solr Archive or Solr 
Archive Reference (the two files already created in the distribution folder)



> Extendable indexing infrastructure for the Entityhub
> ----------------------------------------------------
>
>                 Key: STANBOL-187
>                 URL: https://issues.apache.org/jira/browse/STANBOL-187
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Entity Hub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> Currently the Entityhub includes some utilities to create Indexes for 
> dbPedia, geonames and dblp. There exists also an generic RDF indexer that is 
> used by the dbPedia and dblp however also this implementation is not 
> extendable and not really suitable to add features requested by issues like 
> STANBOL-92, STANBOL-93 and STANBOL-163.
> The goal is to create an infrastructure that provides an implementation of
>  - the indexing workflow
>  - configuration and initialization
> and defines Interfaces that allows to plug in
>  - different Data Sources
>  - entity ranking implementations
>  - entity data mapper (e.g. filtering some fields, schema translations ...)
>  - indexing targets (the Yard that stores the indexed entities)
> The existing Indexing utilities need to be moved to use the new Infrastructure

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to