[ 
https://issues.apache.org/jira/browse/STANBOL-187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13043060#comment-13043060
 ] 

Rupert Westenthaler commented on STANBOL-187:
---------------------------------------------

The Indexing Utils now create also a Bundle that - when installed - adds and 
configures all the components needed to use the indexed data with the Stanbol 
Entityhub:

The two files needed are within the "/indexing/dist" folder

To install an index

1. copy the "{name}.solrindex.zip" to the "/sling/datafiles" folder within the 
home directory of your running Apache Stanbol instance.
2. Install the bundle "org.apache.stanbol.data.site.{name}-1.0.0.jar" by
 * Go to the OSGI Webconsole (http://{host}:{port}/system/console/bundles)
 * Click on "Install/update…"
 * Add this Bundle to the Dialog and activate the "Start Bundle" option
 * Reload the page. Now you should see a Bundle with the Name "Apache Stanbol 
Data: iptc (org.apache.stanbol.data.site.{name}) "and the Satus "Active"
 * The indexed dataset is now available as ReferencedSite at 
"http://{host}:{port}/entityhub/site/{name}";
3. If you want you can not delete the "{name}.solrindex.zip" in the 
"/sling/datafiles" folder.



> Extendable indexing infrastructure for the Entityhub
> ----------------------------------------------------
>
>                 Key: STANBOL-187
>                 URL: https://issues.apache.org/jira/browse/STANBOL-187
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Entity Hub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> Currently the Entityhub includes some utilities to create Indexes for 
> dbPedia, geonames and dblp. There exists also an generic RDF indexer that is 
> used by the dbPedia and dblp however also this implementation is not 
> extendable and not really suitable to add features requested by issues like 
> STANBOL-92, STANBOL-93 and STANBOL-163.
> The goal is to create an infrastructure that provides an implementation of
>  - the indexing workflow
>  - configuration and initialization
> and defines Interfaces that allows to plug in
>  - different Data Sources
>  - entity ranking implementations
>  - entity data mapper (e.g. filtering some fields, schema translations ...)
>  - indexing targets (the Yard that stores the indexed entities)
> The existing Indexing utilities need to be moved to use the new Infrastructure

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to