Pintoch added a comment.

@Lydia_Pintscher personally here is what I would concretely implement in the EditGroups tool. For each edit that is part of an edit group:

  • fetch the constraints violations before and after the edit (this fetching would happen as the edit is retrieved, so in near real-time)
  • compute the difference of constraints violations of each type (for instance, 1 new "value type constraint" violation and 2 less "statement required constraint" violation)
  • aggregate these statistics at a batch level and expose them in batch views (for instance, this batch added 342 new value type constraint" violations and solved 764 "statement required constraint" violations)

Together with the number of reverted edits in a batch (which the tool already aggregates), this could potentially make it easier to spot problematic batches.

Other ideas of applications (that I would not write myself):

  • I am not involved in ORES development but I believe the statistics computed above (at edit level) could be useful for vandalism detection - if the constraints violations are already computed and cheap to retrieve, it might be much easier for them to rely on that.
  • I believe @Tpt could be interested in this as he has been working on detecting edits which introduce / solve constraint violations.

TASK DETAIL
https://phabricator.wikimedia.org/T204024

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Pintoch
Cc: Lydia_Pintscher, Pintoch, Tpt, Smalyshev, Eevans, daniel, mobrovac, Jonas, Lucas_Werkmeister_WMDE, Aklapper, Addshore, Lahi, Gq86, GoranSMilovanovic, QZanden, merbst, LawExplorer, Agabi10, Hardikj, Wikidata-bugs, aude, Mbch331, fgiunchedi
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to