Rafa Haro created STANBOL-1125:
----------------------------------

             Summary: Create a lightweight EntityHub Indexing Tool for Freebase
                 Key: STANBOL-1125
                 URL: https://issues.apache.org/jira/browse/STANBOL-1125
             Project: Stanbol
          Issue Type: Improvement
          Components: Entityhub
            Reporter: Rafa Haro


Due to the enormous size of the dumps, current Freebase indexing tool in 
Stanbol can't barely work in machines without several gigas of RAM and/or SSD 
disks. JenaTDB importer has been identified as the bootle neck of the indexing 
process. To use an RDF database is mandatory in order to, for instance, use 
LDPath programs at indexing time.

The idea is to develop a lightweight indexing tool that stream data from the 
dumps and push it directly to Solr. Despite losing some functionality, it is 
possible for any user to generate Freebase EntityHub indexes from any dump.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to