MicrobeBot created this task.
MicrobeBot added a subscriber: MicrobeBot.
MicrobeBot added a project: Wikidata-Query-Service.
Herald added a subscriber: Aklapper.
Herald added projects: Wikidata, Discovery.

TASK DESCRIPTION
  SPARQL queries searching for items added months ago fail to locate the item.  
For example the disease item https://www.wikidata.org/wiki/Q18975220 has the 
statement P486 (MeSH ID) with string value "D004927".
  Executing the following SPARQL query searching for the item with statement 
P486 and value "D004927" returns nothing.  
  
  
https://query.wikidata.org/#PREFIX%20wd%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fentity%2F%3E%20%0APREFIX%20wdt%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fprop%2Fdirect%2F%3E%0A%0ASELECT%20*%20WHERE%20%7B%0A%20%20%3Fdisease%20wdt%3AP486%20%20%22D004927%22%20.%0A%20%20%0A%7D.
  
  If you execute the following query (searching for all items with MeSH IDs, 
4652 results come back and the above item is not in that list.
  
  
https://query.wikidata.org/#PREFIX%20wd%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fentity%2F%3E%20%0APREFIX%20wdt%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fprop%2Fdirect%2F%3E%0A%0ASELECT%20*%20WHERE%20%7B%0A%20%20%3Fdisease%20wdt%3AP486%20%20%3Fitem%20.%0A%20%20%0A%7D
  
  If you then take an MeSH ID value from that list and query for that, it is 
returned as follows:
  
  
https://query.wikidata.org/#PREFIX%20wd%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fentity%2F%3E%20%0APREFIX%20wdt%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fprop%2Fdirect%2F%3E%0A%0ASELECT%20*%20WHERE%20%7B%0A%20%20%3Fdisease%20wdt%3AP486%20%20%22D003093%22%20.%0A%20%20%0A%7D
  
  This would indicate that the above item 
https://www.wikidata.org/wiki/Q18975220 is not in the SPARQL endpoint, however 
it is possible to query for it when searching for a Wikidata Item datatype like 
so:
  
  
https://query.wikidata.org/#PREFIX%20wd%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fentity%2F%3E%20%0APREFIX%20wdt%3A%20%3Chttp%3A%2F%2Fwww.wikidata.org%2Fprop%2Fdirect%2F%3E%0A%0ASELECT%20*%20WHERE%20%7B%0A%20%20%3Fdisease%20wdt%3AP828%20%20%20wd%3AQ21102933%20%20.%0A%20%20%0A%7D
  
  Summary:  The item is only found when querying for WD Item datatype claims.  
String datatype property claims are not queryable for this item, even though 
these claims have been on there since before August 2015. 
  
  Known scope of the issue:
  Using the backlink api feature I returned all items that link to page 
Property:P486 and after parsing out user pages there were 5006 items with MeSH 
IDs (P486).  As mentioned above the SPARQL query only returns 4652 results 
indicating almost 400 items are missing from the endpoint when searching by 
string datatype properties.
  
  This is a very high priority for our Project Molecular Biology Bots because 
they identify items based on external identifiers.

TASK DETAIL
  https://phabricator.wikimedia.org/T125558

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: MicrobeBot
Cc: MicrobeBot, Aklapper, debt, Izno, jkroll, Smalyshev, Wikidata-bugs, 
Jdouglas, aude, Deskana, Manybubbles, Mbch331



_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to