dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  In T302189 <https://phabricator.wikimedia.org/T302189> it was reported:
  
  In T302189#8501314 <https://phabricator.wikimedia.org/T302189#8501314>, 
@Nikki wrote:
  
  > This report of grammatical features 
<https://www.wikidata.org/wiki/Wikidata:Lexicographical_data/Statistics/Count_of_forms_by_grammatical_feature>
 is wrong because it includes deleted data. Like with the previous queries I 
mentioned, I'm unable to fix it because that takes it from running in under a 
second to timing out.
  >
  > This query 
<https://query.wikidata.org/#select%20*%20%7B%20?f%20wikibase:grammaticalFeature%20wd:Q109459317%20%7D>
 returns a form which was deleted 11 months ago.
  >
  > (Here's 
<https://query.wikidata.org/#select%20%2a%20%7B%0A%20%20%3Ff%20ontolex%3Arepresentation%20%5B%5D.%0A%20%20minus%20%7B%20%3Fl%20ontolex%3AlexicalForm%20%3Ff%20%7D%0A%7D%20limit%20100>
 100 forms which need cleaning up)
  
  It suggests that forms of deleted lexemes are leaked in triple store.
  
  Triples whose subject have a form that is attached to a deleted Lexeme using 
`ontolex:lexicalForm` should be deleted as well.
  It might be possible that senses are leaked too (attached with 
`ontolex:sense`).
  
  AC:
  
  - when a Lexeme is deleted all its forms and senses should be removed from 
WDQS.

TASK DETAIL
  https://phabricator.wikimedia.org/T326311

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: dcausse, Nikki, Aklapper, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, 
Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, 
Jdouglas, aude, Tobias1984, Manybubbles
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to