Re: [Wikimedia-l] On toxic communities

Pharos Sun, 15 Nov 2015 10:37:52 -0800

The figure quoted is quite interesting, but do we have a comparable metric
for the Wikimedia projects? :


"... incidences of homophobia, sexism and racism ... have fallen to a combined
2 percent of all games"

2% sounds "low", but do we indeed know if this is better or worse than us?
Would our comparable metric be the % of bigoted comments per article, per
talk page discussion, per time that an editor spends at the project?  I
would think that encountering bigoted comments on 1 in 50 discussions would
still be pretty significant.

Thanks,
Pharos

On Sun, Nov 15, 2015 at 1:21 PM, Ziko van Dijk <zvand...@gmail.com> wrote:

> Hello,
>
> Just yesterday I had a long talk with a researcher about how to define
> and detect trolls on Wikipedia. E.g., whether "unintentional trolling"
> should be included or not.
>
> In my opinion, it is not possible to detect by machine trollism,
> unkindness, harassment, mobbing etc., maybe with the exception of
> swear words. A lot of turntaking, deviation from the topic and other
> phenomena can be experienced by the participants as positive or as
> negative. You might need to ask them, and even then they might not be
> aware of a problem that works through in subtlety. Also, third persons
> not involved in the conversation can be effected negatively (look at
> ... page X... and you know why you don't like to contribute there).
>
> Kind regards
> Ziko
>
>
> 2015-11-15 17:40 GMT+01:00 Katherine Casey <fluffernutter.w...@gmail.com>:
> > I'd be happy to offer my admin/oversighter experience and knowledge to
> help
> > you develop the labeling and such, Aaron! I just commented on Andreas's
> > proposal on the Community Wishlist, but to summarize here: I see a lot of
> > potential pitfalls in trying to handle/generalize this with machine
> > learning, but I also see a lot of potential value, and I think it's
> > something we should be investigating.
> >
> > -Fluffernutter
> >
> > On Sun, Nov 15, 2015 at 11:32 AM, Aaron Halfaker <
> ahalfa...@wikimedia.org>
> > wrote:
> >
> >> >
> >> > The League of Legends team collaborated with outside scientists to
> >> > analyse their dataset. I would love to see the Wikimedia Foundation
> >> engage
> >> > in a similar research project.
> >>
> >>
> >> Oh!  We are!  :) When we have time. :\ One of the projects that I'd
> like to
> >> see done, but I've struggled to find the time for is a common talk page
> >> parser[1] that could produce a dataset of talk page interactions.  I'd
> like
> >> this dataset to be easy to join to editor outcome measures.  E.g. there
> >> might be "aggressive" talk that we don't know is problematic until we
> see
> >> the kind of effect that it has on other conversation participants.
> >>
> >> Anyway, I want some powerful utilities and datasets out there to help
> >> academics look into this problem more easily.  For revscoring, I'd like
> to
> >> be able to take a set of talk page diffs, have them classified in Wiki
> >> labels[2] as "aggressive" and the build a model for ORES[3] to be used
> >> however people see fit.  You could then use ORES to do offline analysis
> of
> >> discussions for research.  You could use ORES to interrupt the a user
> >> before saving a change.  I'm sure there are other clever ideas that
> people
> >> have for what to do with such a model that I'm happy to enable it via
> the
> >> service.  The hard part is getting a good dataset labeled.
> >>
> >> If someone wants to invest some time and energy into this, I'm happy to
> >> work with you.  We'll need more than programming help.  We'll need a
> lot of
> >> help to figure out what dimensions we'll label talk page postings by
> and to
> >> do the actual labeling.
> >>
> >> 1. https://github.com/Ironholds/talk-parser
> >> 2. https://meta.wikimedia.org/wiki/Wiki_labels
> >> 3. https://meta.wikimedia.org/wiki/ORES
> >>
> >> On Sun, Nov 15, 2015 at 6:56 AM, Andreas Kolbe <jayen...@gmail.com>
> wrote:
> >>
> >> > On Sat, Nov 14, 2015 at 9:13 PM, Benjamin Lees <emufarm...@gmail.com>
> >> > wrote:
> >> >
> >> > > This article highlights the happier side of things, but it appears
> >> > > that Lin's approach also involved completely removing bad actors:
> >> > > "Some players have also asked why we've taken such an aggressive
> >> > > stance when we've been focused on reform; well, the key here is that
> >> > > for most players, reform approaches are quite effective. But, for a
> >> > > number of players, reform attempts have been very unsuccessful which
> >> > > forces us to remove some of these players from League entirely."[0]
> >> > >
> >> >
> >> >
> >> > Thanks for the added context, Benjamin. Of course, banning bad actors
> >> that
> >> > they consider unreformable is something Wikipedia admins have always
> done
> >> > as well.
> >> >
> >> > The League of Legends team began by building a dataset of interactions
> >> that
> >> > the community considered unacceptable, and then applied
> machine-learning
> >> to
> >> > that dataset.
> >> >
> >> > It occurs to me that the English Wikipedia has ready access to such a
> >> > dataset: it's the totality of revision-deleted and oversighted talk
> page
> >> > posts. The League of Legends team collaborated with outside
> scientists to
> >> > analyse their dataset. I would love to see the Wikimedia Foundation
> >> engage
> >> > in a similar research project.
> >> >
> >> > I've added this point to the community wishlist survey:
> >> >
> >> >
> >> >
> >>
> https://meta.wikimedia.org/wiki/2015_Community_Wishlist_Survey#Machine-learning_tool_to_reduce_toxic_talk_page_interactions
> >> >
> >> >
> >> >
> >> > > P.S. As Rupert noted, over 90% of LoL players are male (how much
> over
> >> > > 90%?).[1] It would be interesting to know whether this percentage
> has
> >> > > changed along with the improvements described in the article.
> >> > >
> >> >
> >> >
> >> > Indeed.
> >> > _______________________________________________
> >> > Wikimedia-l mailing list, guidelines at:
> >> > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> >> > Wikimedia-l@lists.wikimedia.org
> >> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
> ,
> >> > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
> >> >
> >> _______________________________________________
> >> Wikimedia-l mailing list, guidelines at:
> >> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> >> Wikimedia-l@lists.wikimedia.org
> >> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> >> <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
> >>
> >
> >
> >
> > --
> > Karen Brown
> > user:Fluffernutter
> >
> > *Unless otherwise specified, any email sent from this address is in my
> > volunteer capacity and does not represent the views or wishes of the
> > Wikimedia Foundation*
> > _______________________________________________
> > Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > Wikimedia-l@lists.wikimedia.org
> > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
>
> _______________________________________________
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>
>
_______________________________________________
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
<mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>

Re: [Wikimedia-l] On toxic communities

Reply via email to