Manuel created this task. Manuel added projects: Wikidata, Wikidata Analytics (Kanban). Restricted Application added a subscriber: Aklapper.
TASK DESCRIPTION WMDE Analytics Request ====================== This task was generated using the Wikidata Analytics Request form <https://phabricator.wikimedia.org/project/profile/5408/>. Please use this template to create issues for the team. Thank you! Purpose ------- Rationale of how the requested analysis will impact your decision *We’re currently building an endpoint to create an item from scratch. We’ve added a “rule” that does not allow users to create empty items i.e., they must add at least a label or a description in one language, which follows the logic of creating an item in the UI *But, I can go and create an item with one of these as a placeholder. Then go to the item and remove them, leaving an empty item. This “loophole” can also be done with a combination of REST API endpoints. *There is no check for this on the Wikidata UI and I want to keep the REST API & UI as aligned as possible. There seems to be qualitative feedback that historically developers at least need this check to not be there (hence why the Action API doesn’t have it). I’d like to look at the actual data, talk it through with Wikidata PMs and make a final decision that is aligned across the board. Rationale of how your decision will impact the Wikidata strategy *We want the REST API to encourage behavior that serves the strategy of having good data quality (no empty Items). *At the same time it should also become a full substitute to the main use cases of the Action API, so we need to understand if we are breaking anything important or not. Scope ----- Please include the specific questions that the analysis should answer. - How often are empty items created and either filled in later or never filled in at all? Desired output -------------- Population A: All Items *All Wikidata Items (including deleted Items, and independent of source, e.g. including UI and Action API edits) Outputs *Number of Items in population A that were created empty (see this <https://wikidata.beta.wmflabs.org/w/index.php?title=Q628257&oldid=1357018> example) Population B: Items that were created empty *Items in population A that were created empty (see this <https://wikidata.beta.wmflabs.org/w/index.php?title=Q628257&oldid=1357018> example) Outputs *Number of Items in population B that are currently deleted (Next step only if there is a significant number of Items that are currently not deleted) Population C: Current Items that were created empty *Items in population B that are currently not deleted Outputs *Number of Items in population C with **no further edits ever **at least one additional edit within 6 months after creation **at least one additional edit after 6 months or later (=the rest) Urgency ------- Not really time sensitive, ideally within the next 2-4 weeks (is that realistic?) --- **Information below this point is filled out by the Wikidata Analytics team.** General Planning ---------------- Internal Request by Ifrah https://docs.google.com/document/d/1SA59_upd2NryZnVnhj-B6aLeKpkYBO4CkfZMh5ePIOc/edit Assignee Planning ----------------- Information is filled out by the assignee of this task. Estimation ---------- Estimate: Actual: Sub Tasks --------- Full breakdown of the steps to complete this task: [ ] subtask Data to be used --------------- See Analytics/Data_Lake <https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake> for the breakdown of the data lake databases and tables. The following tables will be referenced in this task: - link_to_table Notes and Questions ------------------- Things that came up during the completion of this task, questions to be answered and follow up tasks: - Note TASK DETAIL https://phabricator.wikimedia.org/T360761 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Manuel Cc: Aklapper, Ifrahkhanyaree_WMDE, Manuel, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org