Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-28 Thread gnosygnu
> If you have problems accessing the datatype from Lua or elsewhere, let me > know. Honestly, I haven't tried. Just so you know, I'm the developer of XOWA which is an offline wiki app in Java. As such, I'm accessing Wikidata data directly, not through the Wikibase code. (If you're curious, I

Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-28 Thread Daniel Kinzler
Am 28.11.2016 um 17:34 schrieb gnosygnu: >> The datatype is implicit, it can be derived from the property ID. You can >> find >> it by looking at the Property page's JSON. >> ... > > Thanks for all the info. I see my error. I didn't realize that > mainsnak.datatype was inferred. I assumed it

Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-28 Thread Daniel Kinzler
Am 28.11.2016 um 16:31 schrieb gnosygnu: >> If you are also using the same software (Wikibase on MediaWiki), the XML >> dumps >> should Just Work (tm). The idea of the XML dumps is that the "text" blobs are >> opaque to 3rd parties, but will continue to work with future versions of >> MediaWiki &

Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-28 Thread gnosygnu
> If you are also using the same software (Wikibase on MediaWiki), the XML dumps > should Just Work (tm). The idea of the XML dumps is that the "text" blobs are > opaque to 3rd parties, but will continue to work with future versions of > MediaWiki & friends (with a compatible configuration - which

Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-27 Thread Daniel Kinzler
Am 27.11.2016 um 01:15 schrieb gnosygnu: > This is useful, but unfortunately it won't suffice. Wikidata also has > pages which are wikitext (for example, > https://www.wikidata.org/wiki/Wikidata:WikiProject_Names). These > wikitext pages are in the XML dumps, but aren't in the stub dumps nor > the

Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-26 Thread gnosygnu
Hi Daniel, Thanks for the quick and helpful reply. I was hoping that the XML dumps could be changed, but I understand now that the JSON dumps are the recommended format. > To avoid downloading redundant information, you can use one of the > wikidatawiki-20161120-stub-* dumps instead of the full

Re: [Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-26 Thread Daniel Kinzler
Hi gnosygnu! The JSON in the XML dumps is the raw contents of the storage backend. It can't be changed retroactively, and re-encoding everything on the fly would be too expensive. Also, the JSON embedded in the XML files is not officially supported as a stable interface of Wikibase. The JSON

[Wikidata] Can mainsnak.datatype be included in the pages-articles.xml dump?

2016-11-25 Thread gnosygnu
Hi everyone. I have a question about the Wikidata xml dump, but I'm posting this question here, because it looks more related to Wikidata. In short, it seems that the "pages-articles.xml" does not include the datatype property for snaks. For example, the xml dump does not list a datatype for Q38