Hi Ali, Is the extractor in your branch working? I'm using the 20150113 Wikidata dump. If so, I would love to check it out for the new release of the Italian chapter. Let me know. Cheers!
On 2/13/15 5:24 PM, Ali Ismayilov wrote: > Hi Marco, > > We have not merged latest code to main branch. It is still on my branch. > And also there was a problem latest wikidata dump. We need some tests > and after that we will announce our code and merge to main merge. > > > -- > Best regards, > Ali Ismayilov > > > > > On Fri, Feb 13, 2015 at 5:09 PM, Marco Fossati <[email protected] > <mailto:[email protected]>> wrote: > > Hi everyone, > > I'm getting a hard time to run the Wikidata language links extractor > (WikidataLLExtractor). > It currently produces empty datasets. > > I run it as usual, with: > cd dump > ../run extraction {PROPERTIES_FILE} > > Using the the following PROPERTIES_FILE: > > base-dir=/home/dbpedia/data > require-download-complete=true > languages=wikidata > extractors=.WikidataLLExtractor > ontology=../ontology.xml > mappings=../mappings > uri-policy.uri=uri:en; generic:en; xml-safe-predicates:*; reject-long:* > uri-policy.iri=generic:en; xml-safe-predicates:*; reject-long:* > format.ttl.gz=turtle-triples;uri-policy.iri > > The base-dir contains both the complete wikidata dump, i.e.: > wikidatawiki/{DATE}/wikidatawiki-{DATE}-pages-articles.xml.bz2 > and the wikidatawiki-{DATE}-download-complete file > > When running, I keep getting these exceptions, logged with the following > message, where WIKIDATA_ENTITY is for instance Q290: > > feb 13, 2015 5:04:46 PM > org.dbpedia.extraction.dump.extract.ExtractionJob$$anonfun$1 apply > WARNING: error processing page > 'title={WIKIDATA_ENTITY};ns=0/Main/;language:wiki=wikidata,locale=en': > org.json.JSONException: JSONObject["claims"] is not a JSONArray. > > I'm using the freshly pulled master branch. > Any clue of what I'm doing wrong? > Thanks! > Cheers, > -- > Marco Fossati > http://about.me/marco.fossati > Twitter: @hjfocs > Skype: hell_j > > > ------------------------------------------------------------------------------ > Dive into the World of Parallel Programming. The Go Parallel Website, > sponsored by Intel and developed in partnership with Slashdot Media, > is your > hub for all things parallel software development, from weekly thought > leadership blogs to news, videos, case studies, tutorials and more. > Take a > look and join the conversation now. http://goparallel.sourceforge.net/ > _______________________________________________ > Dbpedia-developers mailing list > [email protected] > <mailto:[email protected]> > https://lists.sourceforge.net/lists/listinfo/dbpedia-developers > > > > -- Marco Fossati http://about.me/marco.fossati Twitter: @hjfocs Skype: hell_j ------------------------------------------------------------------------------ Dive into the World of Parallel Programming. The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ _______________________________________________ Dbpedia-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-developers
