Isaac added a comment.

  Updates:
  
  - Finally ported all the code from the API to work on the cluster. I don't 
know if it'll run to completeness yet but I ran it on a subset and the results 
largely matched the API: 
https://gitlab.wikimedia.org/isaacj/miscellaneous-wikimedia/-/blob/master/annotation-gap/wikidata-completeness.ipynb
    - Notably, I got rid of the statsmodel ordinal logistic regression 
dependency which was painful and just take the parameters/thresholds from the 
model and do the math myself.
  - Next step will be running this fully or on a sample of data and then 
choosing a sample of items to provide to raters to compare the scores and 
choose whether the quality or completeness models seems to best capture the 
concept of "this Wikidata item is in good shape".

TASK DETAIL
  https://phabricator.wikimedia.org/T321224

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Isaac
Cc: Michael, Lydia_Pintscher, diego, Miriam, Isaac, KinneretG, Astuthiodit_1, 
YLiou_WMF, karapayneWMDE, Invadibot, Ywats0ns, maantietaja, ItamarWMDE, 
Akuckartz, Nandana, Abdeaitali, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, Avner, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Capt_Swing, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to