Manuel created this task.
Manuel added projects: Wikidata, Wikidata Analytics (Kanban).
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  WMDE Analytics Request
  ======================
  
  This task was generated using the Wikidata Analytics Request form 
<https://phabricator.wikimedia.org/project/profile/5408/>. Please use this 
template to create issues for the team. Thank you!
  
  Purpose
  -------
  
  Rationale of how the requested analysis will impact your decision
  *We’re currently building an endpoint to create an item from scratch. We’ve 
added a “rule” that does not allow users to create empty items i.e., they must 
add at least a label or a description in one language, which follows the logic 
of creating an item in the UI
  *But, I can go and create an item with one of these as a placeholder. Then go 
to the item and remove them, leaving an empty item. This “loophole” can also be 
done with a combination of REST API endpoints.
  *There is no check for this on the Wikidata UI and I want to keep the REST 
API & UI as aligned as possible. There seems to be qualitative feedback that 
historically developers at least need this check to not be there (hence why the 
Action API doesn’t have it). I’d like to look at the actual data, talk it 
through with Wikidata PMs and make a final decision that is aligned across the 
board.
  
  Rationale of how your decision will impact the  Wikidata strategy
  *We want the REST API to encourage behavior that serves the strategy of 
having good data quality (no empty Items). 
  *At the same time it should also become a full substitute to the main use 
cases of the Action API, so we need to understand if we are breaking anything 
important or not.
  
  Scope
  -----
  
  Please include the specific questions that the analysis should answer.
  
  - How often are empty items created and either filled in later or never 
filled in at all?
  
  Desired output
  --------------
  
  Population A: All Items
  *All Wikidata Items (including deleted Items, and independent of source, e.g. 
including UI and Action API edits)
  
  Outputs
  *Number of Items in population A that were created empty (see this 
<https://wikidata.beta.wmflabs.org/w/index.php?title=Q628257&oldid=1357018> 
example)
  
  Population B: Items that were created empty
  *Items in population A that were created empty (see this 
<https://wikidata.beta.wmflabs.org/w/index.php?title=Q628257&oldid=1357018> 
example)
  
  Outputs
  *Number of Items in population B that are currently deleted
  
  (Next step only if there is a significant number of Items that are currently 
not deleted)
  
  Population C: Current Items that were created empty
  *Items in population B that are currently not deleted
  
  Outputs
  *Number of Items in population C with
  **no further edits ever
  **at least one additional edit within 6 months after creation
  **at least one additional edit after 6 months or later (=the rest)
  
  Urgency
  -------
  
  Not really time sensitive, ideally within the next 2-4 weeks (is that 
realistic?)
  
  ---
  
  **Information below this point is filled out by the Wikidata Analytics team.**
  
  General Planning
  ----------------
  
  Internal Request by Ifrah
  
https://docs.google.com/document/d/1SA59_upd2NryZnVnhj-B6aLeKpkYBO4CkfZMh5ePIOc/edit
  
  Assignee Planning
  -----------------
  
  Information is filled out by the assignee of this task.
  
  Estimation
  ----------
  
  Estimate: 
  Actual:
  
  Sub Tasks
  ---------
  
  Full breakdown of the steps to complete this task:
  
  [ ] subtask
  
  Data to be used
  ---------------
  
  See Analytics/Data_Lake 
<https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake> for the breakdown of 
the data lake databases and tables.
  
  The following tables will be referenced in this task:
  
  - link_to_table
  
  Notes and Questions
  -------------------
  
  Things that came up during the completion of this task, questions to be 
answered and follow up tasks:
  
  - Note

TASK DETAIL
  https://phabricator.wikimedia.org/T360761

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Manuel
Cc: Aklapper, Ifrahkhanyaree_WMDE, Manuel, Danny_Benjafield_WMDE, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, KimKelting, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to