Lydia_Pintscher created this task.
Lydia_Pintscher added projects: Item Quality Scoring Improvement, Wikidata.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  **Problem:**
  We would like to see if the new model is better than the old model in 
predicting the quality of Items. To do this we want to check how the old and 
new model performs with the new training data we collected.
  
  **Acceptance criteria:**
  
  [ ] we have an overview of how many Items the old model judges to be 
A/B/C/D/E class compared to the human judgement
  [ ] we have an overview of how many Items the new model judges to be 
A/B/C/D/E class compared to the human judgement

TASK DETAIL
  https://phabricator.wikimedia.org/T261849

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Lydia_Pintscher
Cc: Aklapper, Lydia_Pintscher, guergana.tzatchkova, Akuckartz, darthmon_wmde, 
Michael, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude, Ladsgroup, Mbch331
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to