[Wikidata] Why do these two SPARQL queries take such different times to run?

James Heald Wed, 09 Sep 2015 06:06:59 -0700

Prompted by this thread at Project Chat,
  https://www.wikidata.org/wiki/Wikidata:Project_chat#Identical_data_sets

here's a query to find multiple humans with nationality:Greece that havethe same day of birth and day of death:

  http://tinyurl.com/ow6lpen
It produces one pair, and executes in about 0.6 seconds.

Here's a query to try to add item numbers and labels to the previous search:
  http://tinyurl.com/ovjwzc9

It *just* completes, taking just over 60 seconds to execute.

(Please don't merge the two items yet, because that will destroy theexample).

Analogous queries with lookups for France (71 apparent sets ofduplicates), UK (32), and Italy(14) fail to complete.



Two questions therefore:
(1)  Why are the two queries taking such different times to run ?
(2)  Is there a good way to rewrite the second to make it faster ?

Obviously the second query as written at the moment involves asub-query, which inevitably must make it a bit slower -- but given thesolution set of the sub-query only has two rows, and an exact date for agiven property ought to be a fairly quick key to look up, why is thesecond query taking 100 times longer than the first ?

And is there a better way I should be doing this, since the query doesappear to be producing useful real matches ?


Thanks,

   James.


_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

[Wikidata] Why do these two SPARQL queries take such different times to run?

Reply via email to