Re: [Wikidata] Announcing the release of the Wikidata Query Service

2015-09-07 Thread Stas Malyshev
Hi! > I am particularly looking forward to the tool that builds a query.. The > examples as provided proved really important for me to start using these > tools. I really hope for a similar service for the new service. That's where community input/contribution is very welcome :) > The documentat

Re: [Wikidata] Announcing the release of the Wikidata Query Service

2015-09-07 Thread Gerard Meijssen
Hoi, Wonderful to learn that we finally have progressed towards a live query system. Is it your intention that tools will use this service and, do you hope / anticipate that the tools by Magnus will move towards this new system ? I am particularly looking forward to the tool that builds a query..

Re: [Wikidata] (Almost) empty items

2015-09-07 Thread Stas Malyshev
Hi! > My recent tests produced lists of empty or almost empty items, meaning > that they have no sitelinks, no statements, no label (almost empty), and > sometimes no descriptions or aliases either (empty). Many of the empty > ones seem to be redirects now, but not all (e.g. Q18482644). > > Maybe

Re: [Wikidata] Item count

2015-09-07 Thread Daniel Kinzler
Thanks for investigating, Makrus! Am 07.09.2015 um 22:54 schrieb Markus Krötzsch: > On 07.09.2015 22:10, Markus Krötzsch wrote: >> On 07.09.2015 21:48, Markus Krötzsch wrote: >> ... >>> >>> I'll count how many of each we have. Back in 30min. >> >> This does not seem to be the explanation after all

[Wikidata] Announcing the release of the Wikidata Query Service

2015-09-07 Thread Dan Garry
The Discovery Department at the Wikimedia Foundation is pleased to announce the release of the Wikidata Query Service ! You can find the interface for the service at https://query.wikidata.org. The Wikidata Query Service is designed to let use

[Wikidata] (Almost) empty items

2015-09-07 Thread Markus Krötzsch
Hi all, My recent tests produced lists of empty or almost empty items, meaning that they have no sitelinks, no statements, no label (almost empty), and sometimes no descriptions or aliases either (empty). Many of the empty ones seem to be redirects now, but not all (e.g. Q18482644). Maybe so

Re: [Wikidata] Item count

2015-09-07 Thread Markus Krötzsch
On 07.09.2015 22:10, Markus Krötzsch wrote: On 07.09.2015 21:48, Markus Krötzsch wrote: ... I'll count how many of each we have. Back in 30min. This does not seem to be the explanation after all. I could only find 33 items in total that have no data at all. If I also count items that have not

Re: [Wikidata] Source statistics

2015-09-07 Thread Stas Malyshev
Hi! > A small fix though: I think you should better use count(?statement) > rather than count(?ref), right? Yes, of course, my mistake - I modified it from different query and forgot to change it. > I have tried a similar query on the public test endpoint on labs > earlier, but it timed out for

Re: [Wikidata] Source statistics

2015-09-07 Thread Markus Krötzsch
On 07.09.2015 21:45, Stas Malyshev wrote: Hi! I'm wondering if there is a way (SQL, api, tool or otherwise) for finding out how often a particular source is used on Wikidata. Something like this probably would work: http://tinyurl.com/plssk4j This runs the following query: prefix prov:

Re: [Wikidata] Item count

2015-09-07 Thread Andrew Gray
How many items have no sitelinks at all (regardless of labels, properties, etc)? That might be a more substantial number... Andrew. On 7 September 2015 at 21:10, Markus Krötzsch wrote: > On 07.09.2015 21:48, Markus Krötzsch wrote: > ... >> >> >> I'll count how many of each we have. Back in 30min

Re: [Wikidata] Item count

2015-09-07 Thread Markus Krötzsch
On 07.09.2015 21:48, Markus Krötzsch wrote: ... I'll count how many of each we have. Back in 30min. This does not seem to be the explanation after all. I could only find 33 items in total that have no data at all. If I also count items that have nothing but descriptions or aliases, I get 589

Re: [Wikidata] Item count

2015-09-07 Thread Markus Krötzsch
On 07.09.2015 19:37, Daniel Kinzler wrote: Am 07.09.2015 um 18:05 schrieb Emilio J. Rodríguez-Posada: Wow, that is a big difference. Almost 4 million. I think that MediaWiki doesn't count pages without any [[link]]. Is that the reason? No, that only applies to Wikitext. Here is the relevant

Re: [Wikidata] Source statistics

2015-09-07 Thread Stas Malyshev
Hi! > I'm wondering if there is a way (SQL, api, tool or otherwise) for > finding out how often a particular source is used on Wikidata. Something like this probably would work: http://tinyurl.com/plssk4j This runs the following query: prefix prov: prefix pr:

Re: [Wikidata] Item count

2015-09-07 Thread Stas Malyshev
Hi! > We have 725691 redirects per SPARQL engine. We do have some sizeable > number of entities which have no statements (alas!) but I have hard time > believing we have ~3 mln of those not having even a single label. Unless > there's some bot gone wild here. The problem is that if entity has no >

Re: [Wikidata] Item count

2015-09-07 Thread Stas Malyshev
Hi! > Is it possible that the difference of 3,694,285 is mainly redirects? Which > dump We have 725691 redirects per SPARQL engine. We do have some sizeable number of entities which have no statements (alas!) but I have hard time believing we have ~3 mln of those not having even a single label.

Re: [Wikidata] Item count

2015-09-07 Thread Addshore
I know that over the past 9 months I have created 500,000 redirects. Other than that I would guess that maybe 100,000 other redirect have been created, at most 500,000 more meaning 1,000,000 in total. Such a big difference does seem rather odd to me... On 7 September 2015 at 19:37, Daniel Kinzler

Re: [Wikidata] Item count

2015-09-07 Thread Daniel Kinzler
Am 07.09.2015 um 18:05 schrieb Emilio J. Rodríguez-Posada: > Wow, that is a big difference. Almost 4 million. > > I think that MediaWiki doesn't count pages without any [[link]]. Is that the > reason? No, that only applies to Wikitext. Here is the relevant code from ItemContent: public

[Wikidata] weekly summary #174

2015-09-07 Thread Lydia Pintscher
Hey folks :) Here's what's been happening around Wikidata over the last week: Events /Press/Blogs - Wikimedia Grafana graphs of Wikidata profiling information

Re: [Wikidata] Item count

2015-09-07 Thread Emilio J . Rodríguez-Posada
Wow, that is a big difference. Almost 4 million. I think that MediaWiki doesn't count pages without any [[link]]. Is that the reason? 2015-09-07 17:39 GMT+02:00 Markus Krötzsch : > Hi all, > > The main page of Wikidata shows an item count that is getting increasingly > out of synch with reality.

[Wikidata] Item count

2015-09-07 Thread Markus Krötzsch
Hi all, The main page of Wikidata shows an item count that is getting increasingly out of synch with reality. The 31 Aug dump contains 18,483,096 items, while the front page says that there are 14,788,811 now. I think this is caused by how MediaWiki counts "articles" (which is not what we are

[Wikidata] Wikidata's 3 birthday is coming up

2015-09-07 Thread Lydia Pintscher
Hey folks :) Wikidata's birthday is coming up in less than 2 months (29th of October). We're currently brainstorming some cool ideas for presents. Last year we had a few very cool ones from different corners of our community: https://www.wikidata.org/wiki/Wikidata:Second_Birthday So if you're int

Re: [Wikidata] [ANNOUNCEMENT] first StrepHit dataset for the primary sources tool

2015-09-07 Thread Markus Krötzsch
Dear Marco, Sounds interesting, but the project page still has a lot of gaps. Will you notify us again when you are done? It is a bit tricky to endorse a proposal that is not finished yet ;-) Markus On 04.09.2015 17:01, Marco Fossati wrote: [Begging pardon if you have already read this in t

Re: [Wikidata] Source statistics

2015-09-07 Thread Markus Krötzsch
On 07.09.2015 14:25, Edgard Marx wrote: Is not an updated version, but dbtrends.aksw.org I am getting an error there. Is the server down maybe? Markus best, Edgard On Mon, Sep 7, 2015 at 1:25 PM, André Costa mailto:andre.co...@wikimedia.se>> wrote: Hi all!

Re: [Wikidata] Source statistics

2015-09-07 Thread Markus Krötzsch
P.S. If you want to do this yourself to play with it, below is the relevant information on how I wrote this code (looks a bit clumsy in email, but I don't have time now to set up a tutorial page ;-). Markus (1) I modified the example program "EntityStatisticsProcessor" that is part of Wikida

Re: [Wikidata] Source statistics

2015-09-07 Thread Markus Krötzsch
Hi André, I just made a small counting program with Wikidata Toolkit to count unique references. Running it on the most recent dump took about 30min. I uploaded the results: http://tools.wmflabs.org/wikidata-exports/statistics/20150831/reference-counts-50.txt The file lists all references th

Re: [Wikidata] Source statistics

2015-09-07 Thread Edgard Marx
Is not an updated version, but dbtrends.aksw.org best, Edgard On Mon, Sep 7, 2015 at 1:25 PM, André Costa wrote: > Hi all! > > I'm wondering if there is a way (SQL, api, tool or otherwise) for finding > out how often a particular source is used on Wikidata. > > The background is a collaboratio

[Wikidata] Source statistics

2015-09-07 Thread André Costa
Hi all! I'm wondering if there is a way (SQL, api, tool or otherwise) for finding out how often a particular source is used on Wikidata. The background is a collaboration with two GLAMs where we have used ther open (and CC0) datasets to add and/or source statements on Wikidata for items on which