dcausse created this task. dcausse added a project: Wikidata-Query-Service. Restricted Application added a subscriber: Aklapper.
TASK DESCRIPTION In T302189 <https://phabricator.wikimedia.org/T302189> it was reported: In T302189#8501314 <https://phabricator.wikimedia.org/T302189#8501314>, @Nikki wrote: > This report of grammatical features <https://www.wikidata.org/wiki/Wikidata:Lexicographical_data/Statistics/Count_of_forms_by_grammatical_feature> is wrong because it includes deleted data. Like with the previous queries I mentioned, I'm unable to fix it because that takes it from running in under a second to timing out. > > This query <https://query.wikidata.org/#select%20*%20%7B%20?f%20wikibase:grammaticalFeature%20wd:Q109459317%20%7D> returns a form which was deleted 11 months ago. > > (Here's <https://query.wikidata.org/#select%20%2a%20%7B%0A%20%20%3Ff%20ontolex%3Arepresentation%20%5B%5D.%0A%20%20minus%20%7B%20%3Fl%20ontolex%3AlexicalForm%20%3Ff%20%7D%0A%7D%20limit%20100> 100 forms which need cleaning up) It suggests that forms of deleted lexemes are leaked in triple store. Triples whose subject have a form that is attached to a deleted Lexeme using `ontolex:lexicalForm` should be deleted as well. It might be possible that senses are leaked too (attached with `ontolex:sense`). AC: - when a Lexeme is deleted all its forms and senses should be removed from WDQS. TASK DETAIL https://phabricator.wikimedia.org/T326311 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: dcausse Cc: dcausse, Nikki, Aklapper, AWesterinen, MPhamWMF, CBogen, Namenlos314, Gq86, Lucas_Werkmeister_WMDE, EBjune, merbst, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org