Rajan Shah created STANBOL-1420:
-----------------------------------

             Summary: Indexer for the Freebase knowledge base
                 Key: STANBOL-1420
                 URL: https://issues.apache.org/jira/browse/STANBOL-1420
             Project: Stanbol
          Issue Type: Bug
          Components: Entityhub
    Affects Versions: entityhub-0.11.0
         Environment: Mac OS  Yosemite 10.10.3 
            Reporter: Rajan Shah
            Priority: Blocker
             Fix For: 1.0.0


Hi,

I am working on the HEAD branch of the stanbol repo and observed following 
while building freebase entityhub indexing.

1. target directory contains following jars

directory: stanbol/entityhub/indexing/freebase/target
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT-sources.jar
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar
original-org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar

2. per documentation copy these jars to some place where freebase data resides 
For ex. /tmp/freebase/

3. Initialize configuration by issuing following command
The configuration can be initialized with the defaults by calling

    java -jar org.apache.stanbol.entityhub.indexing.freebase-*.jar init

The above results into an error stating 

"no main manifest attribute, in 
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT-sources.jar"

If I change it to "java -jar 
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar init" it 
works however it might not be generating all the files as it generates errors 
in next step.

4. java -jar -Xmx32g 
org.apache.stanbol.entityhub.indexing.freebase-1.0.0-SNAPSHOT.jar index => This 
command generates error as follows:

Exception in thread "Thread-2" 00:02:35,705 [Thread-0] INFO  
solryard.SolrYardIndexingDestination -  ... copy Solr Configuration form 
/private/tmp/freebase/freebase-index/indexing/config/freebase to 
/private/tmp/freebase/freebase-index/indexing/destination/indexes/default/freebase
java.lang.IllegalStateException: The file with the Entity Scores is missing
        at 
org.apache.stanbol.entityhub.indexing.core.source.LineBasedEntityIterator.initialise(LineBasedEntityIterator.java:476)
        at 
org.apache.stanbol.entityhub.indexing.core.impl.IndexingSourceInitialiser.run(IndexingSourceInitialiser.java:43)
        at java.lang.Thread.run(Thread.java:745)
Caused by: java.io.FileNotFoundException: 
/private/tmp/freebase/freebase-index/indexing/resources/incoming_links.txt (No 
such file or directory)
        at java.io.FileInputStream.open(Native Method)
        at java.io.FileInputStream.<init>(FileInputStream.java:146)
        at 
org.apache.stanbol.entityhub.indexing.core.source.LineBasedEntityIterator.initialise(LineBasedEntityIterator.java:474)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to