[
https://issues.apache.org/jira/browse/STANBOL-187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13043060#comment-13043060
]
Rupert Westenthaler commented on STANBOL-187:
---------------------------------------------
The Indexing Utils now create also a Bundle that - when installed - adds and
configures all the components needed to use the indexed data with the Stanbol
Entityhub:
The two files needed are within the "/indexing/dist" folder
To install an index
1. copy the "{name}.solrindex.zip" to the "/sling/datafiles" folder within the
home directory of your running Apache Stanbol instance.
2. Install the bundle "org.apache.stanbol.data.site.{name}-1.0.0.jar" by
* Go to the OSGI Webconsole (http://{host}:{port}/system/console/bundles)
* Click on "Install/update…"
* Add this Bundle to the Dialog and activate the "Start Bundle" option
* Reload the page. Now you should see a Bundle with the Name "Apache Stanbol
Data: iptc (org.apache.stanbol.data.site.{name}) "and the Satus "Active"
* The indexed dataset is now available as ReferencedSite at
"http://{host}:{port}/entityhub/site/{name}"
3. If you want you can not delete the "{name}.solrindex.zip" in the
"/sling/datafiles" folder.
> Extendable indexing infrastructure for the Entityhub
> ----------------------------------------------------
>
> Key: STANBOL-187
> URL: https://issues.apache.org/jira/browse/STANBOL-187
> Project: Stanbol
> Issue Type: Improvement
> Components: Entity Hub
> Reporter: Rupert Westenthaler
> Assignee: Rupert Westenthaler
>
> Currently the Entityhub includes some utilities to create Indexes for
> dbPedia, geonames and dblp. There exists also an generic RDF indexer that is
> used by the dbPedia and dblp however also this implementation is not
> extendable and not really suitable to add features requested by issues like
> STANBOL-92, STANBOL-93 and STANBOL-163.
> The goal is to create an infrastructure that provides an implementation of
> - the indexing workflow
> - configuration and initialization
> and defines Interfaces that allows to plug in
> - different Data Sources
> - entity ranking implementations
> - entity data mapper (e.g. filtering some fields, schema translations ...)
> - indexing targets (the Yard that stores the indexed entities)
> The existing Indexing utilities need to be moved to use the new Infrastructure
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira