Lydia_Pintscher created this task. Lydia_Pintscher added projects: Item Quality Scoring Improvement, Wikidata. Restricted Application added a subscriber: Aklapper.
TASK DESCRIPTION **Problem:** We would like to see if the new model is better than the old model in predicting the quality of Items. To do this we want to check how the old and new model performs with the new training data we collected. **Acceptance criteria:** [ ] we have an overview of how many Items the old model judges to be A/B/C/D/E class compared to the human judgement [ ] we have an overview of how many Items the new model judges to be A/B/C/D/E class compared to the human judgement TASK DETAIL https://phabricator.wikimedia.org/T261849 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Lydia_Pintscher Cc: Aklapper, Lydia_Pintscher, guergana.tzatchkova, Akuckartz, darthmon_wmde, Michael, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Ladsgroup, Mbch331
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs