Hi everyone,
I'm getting a hard time to run the Wikidata language links extractor
(WikidataLLExtractor).
It currently produces empty datasets.
I run it as usual, with:
cd dump
../run extraction {PROPERTIES_FILE}
Using the the following PROPERTIES_FILE:
base-dir=/home/dbpedia/data
require-download-complete=true
languages=wikidata
extractors=.WikidataLLExtractor
ontology=../ontology.xml
mappings=../mappings
uri-policy.uri=uri:en; generic:en; xml-safe-predicates:*; reject-long:*
uri-policy.iri=generic:en; xml-safe-predicates:*; reject-long:*
format.ttl.gz=turtle-triples;uri-policy.iri
The base-dir contains both the complete wikidata dump, i.e.:
wikidatawiki/{DATE}/wikidatawiki-{DATE}-pages-articles.xml.bz2
and the wikidatawiki-{DATE}-download-complete file
When running, I keep getting these exceptions, logged with the following
message, where WIKIDATA_ENTITY is for instance Q290:
feb 13, 2015 5:04:46 PM
org.dbpedia.extraction.dump.extract.ExtractionJob$$anonfun$1 apply
WARNING: error processing page
'title={WIKIDATA_ENTITY};ns=0/Main/;language:wiki=wikidata,locale=en':
org.json.JSONException: JSONObject["claims"] is not a JSONArray.
I'm using the freshly pulled master branch.
Any clue of what I'm doing wrong?
Thanks!
Cheers,
--
Marco Fossati
http://about.me/marco.fossati
Twitter: @hjfocs
Skype: hell_j
------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Dbpedia-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers