Isaac added a comment.

  The results reported in T249654#6352573 
<https://phabricator.wikimedia.org/T249654#6352573> have some potential insight 
into how we think about supporting patrolling of Wikidata transclusion within 
Wikipedia articles so I wanted to record some of my initial thoughts here. We 
would want to talk with patrollers before actually thinking about implementing 
any of these and unfortunately I'm not actually working on this aspect of the 
project at the moment. However: the recent changes feed for a given article 
likely has many more Wikidata-related changes than are actually pertinent to an 
article from a patrolling standpoint. Some thoughts on reducing this noise:
  
  - Many entries to wbc_entity_usage are from transclusion that only generates 
tracking categories (e.g., Category:Coordinates on Wikidata 
<https://en.wikipedia.org/wiki/Category:Coordinates_on_Wikidata>) so arguably 
there should be a way to mark events on Recent Changes caused by these as 
tracking-only so patrollers could easily ignore them.
  - Many entries to wbc_entity_usage are from metadata templates like Authority 
Control and Taxonbar that are very valuable from a linked-data perspective but 
less from a reader's perspective and have a very low potential for harmful 
vandalism. Because the way both of these templates are written, they also 
trigger a general "statements" aspect usage, so any changes to statements on 
the Wikidata item would trigger an event on recent changes. This adds a bunch 
of noise to the Recent Changes feed from Wikidata where these templates are 
used. Additionally, in reality, changes to Wikidata identifiers that impact 
Authority Control and Taxonbar have a very low likelihood of being problematic 
from a reader's perspective because the external links that are generated via 
these templates go to well-curated repositories of information so the reader 
should quickly realize the link is incorrect and probably won't end up viewing 
offensive material. Ideally these templates would be rewritten to only trigger 
the specific properties they transclude, but in practice I could see that being 
difficult, inefficient, or causing the wbc_entity_usage table to become far too 
large to be practical (as each usage of Authority Control would trigger close 
to 100 rows, 1 for each property that can be transcluded 
<https://en.wikipedia.org/wiki/Template:Authority_control#Wikidata>). Instead, 
maybe wbc_entity_usage could be expanded to distinguish between general 
statements (C.S?) and identifiers (C.I?)? This would make filtering out changes 
to identifiers far easier and metadata templates then could still be recorded 
simply without causing every change to date of birth, occupation, etc. to also 
trigger a change. Unfortunately, I suspect this would require making 
non-trivial changes to the Lua modules and then convincing template coders to 
adapt the code.
  - Some entries to wbc_entity_usage go to generating external links that could 
more clearly generate harm if vandalized and probably do warrant focus from 
patrollers. For instance, Wikidata templates that generate links to Commons 
categories or external links to IMDb etc. could more clearly be abused to link 
to offensive material. Thankfully, given the specific nature of these 
templates, they generally are recorded with their specific property and so 
don't generate noise for patrollers. That said, a not insignificant amount of 
their usage (on enwiki) is only for tracking categories, so any changes that 
would distinguish between actual transclusion and tracking categories would 
serve to reduce noise for this.
  - Finally, infobox transclusion has probably the greatest potential for harm 
(e.g., falsifying someone's age or where they were born). This seems to be 
tracked pretty well for most infoboxes (the specific properties each get their 
own row and labels for each item that was actually transcluded) so I think it's 
more about reducing the noise from the above so that patrollers can more easily 
see these changes.

TASK DETAIL
  https://phabricator.wikimedia.org/T246709

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Isaac
Cc: hoo, Ladsgroup, Lydia_Pintscher, Addshore, Capt_Swing, Isaac, Akuckartz, 
darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to