[
https://issues.apache.org/jira/browse/STANBOL-1148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rupert Westenthaler updated STANBOL-1148:
-----------------------------------------
Description:
The dbpedia default data index will be updated to:
* use the current dbpedia.org version. Currently dbpedia 3.6 is used by the
default index. The new one will be based on dbpedia 3.8.
* do no longer index entities that are redirects. Rather generate a
'dbp-ont:surfaceForm' field that indexes all labels of entities that redirect
to indexed one (e.g 'US', 'USA', 'U.S.A' … -> 'United States')
* make the index compatible to the FST linking engine
* include generated FST models
As a lot of unit tests and integration test do depend on the data contained in
the index this will also require to adapt those test.
was:
The dbpedia default data index should be updated so that it can be used with
the FST linking engine.
This is an own issue as this change in data will most likely also require to
change existing unit and integration test that relay on the current data
present in the dbpedia default data index.
The current dbpedia default data index is based on dbpedia version 3.6. The new
one will use version 3.8
> Update the dbpedia default data
> -------------------------------
>
> Key: STANBOL-1148
> URL: https://issues.apache.org/jira/browse/STANBOL-1148
> Project: Stanbol
> Issue Type: Improvement
> Components: Enhancement Engines
> Reporter: Rupert Westenthaler
> Assignee: Rupert Westenthaler
>
> The dbpedia default data index will be updated to:
> * use the current dbpedia.org version. Currently dbpedia 3.6 is used by the
> default index. The new one will be based on dbpedia 3.8.
> * do no longer index entities that are redirects. Rather generate a
> 'dbp-ont:surfaceForm' field that indexes all labels of entities that redirect
> to indexed one (e.g 'US', 'USA', 'U.S.A' … -> 'United States')
> * make the index compatible to the FST linking engine
> * include generated FST models
> As a lot of unit tests and integration test do depend on the data contained
> in the index this will also require to adapt those test.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira