Rajan Shah created STANBOL-1420:
-----------------------------------
Summary: Indexer for the Freebase knowledge base
Key: STANBOL-1420
URL: https://issues.apache.org/jira/browse/STANBOL-1420
Project: Stanbol
Issue Type: Bug
Components: Entityhub
Affects Versions: entityhub-0.11.0
Environment: Mac OS Yosemite 10.10.3
Reporter: Rajan Shah
Priority: Blocker
Fix For: 1.0.0
Hi,
I am working on the HEAD branch of the stanbol repo and observed following
while building freebase entityhub indexing.
1. target directory contains following jars
directory: stanbol/entityhub/indexing/freebase/target
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT-sources.jar
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar
original-org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar
2. per documentation copy these jars to some place where freebase data resides
For ex. /tmp/freebase/
3. Initialize configuration by issuing following command
The configuration can be initialized with the defaults by calling
java -jar org.apache.stanbol.entityhub.indexing.freebase-*.jar init
The above results into an error stating
"no main manifest attribute, in
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT-sources.jar"
If I change it to "java -jar
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar init" it
works however it might not be generating all the files as it generates errors
in next step.
4. java -jar -Xmx32g
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar index => This
command generates error as follows:
Exception in thread "Thread-2" 00:02:35,705 [Thread-0] INFO
solryard.SolrYardIndexingDestination - ... copy Solr Configuration form
/private/tmp/freebase/freebase-index/indexing/config/freebase to
/private/tmp/freebase/freebase-index/indexing/destination/indexes/default/freebase
java.lang.IllegalStateException: The file with the Entity Scores is missing
at
org.apache.stanbol.entityhub.indexing.core.source.LineBasedEntityIterator.initialise(LineBasedEntityIterator.java:476)
at
org.apache.stanbol.entityhub.indexing.core.impl.IndexingSourceInitialiser.run(IndexingSourceInitialiser.java:43)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.io.FileNotFoundException:
/private/tmp/freebase/freebase-index/indexing/resources/incoming_links.txt (No
such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.<init>(FileInputStream.java:146)
at
org.apache.stanbol.entityhub.indexing.core.source.LineBasedEntityIterator.initialise(LineBasedEntityIterator.java:474)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)