Re: [Wikimedia-l] Structured data ethical implications

2019-05-23 Thread Jonathan Morgan
Hi Mister Thrapostibongles,

This is a good point and a valid consideration. WMF is starting to think
about issues like this, and what tools we have available to mitigate
unintended consequences of AI tech (even in cases where we're not building
the AI tech itself, but rather providing training data). I wrote up a white
paper

on this topic recently, in consultation with some other folks in research,
product, and legal. This isn't a policy (yet), just a proposal and a
conversation starter. Feedback and discussion welcome!

Best,
Jonathan



On Sun, May 12, 2019 at 1:50 AM Mister Thrapostibongles <
thrapostibong...@gmail.com> wrote:

> Dear all,
> There have been announcements about the Structured data project on Commons,
> that is intended to make it easier to view, search, edit, organize and
> re-use the metadata on media.  This is clearly of great value to
> researchers and developers in image recognition, who will have a large
> repository of tagged image files to train their AI implementations on.
>
> There is however an ethical issue here.  Readers will recall that Google
> discovered that its facial regonition software was prone to classifying
> African-American faces as "gorilla", because the training dataset had not
> contained enough non-white faces -- see for example The Verge
>
> https://www.theverge.com/2018/1/12/16882408/google-racist-gorillas-photo-recognition-algorithm-ai
>
>
> Is the Foundation confident that the Commons repository is sufficiently
> diverse that it can ethically offer it to others as a source of training
> data?
>
> Thrapostibongles
> ___
> Wikimedia-l mailing list, guidelines at:
> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and
> https://meta.wikimedia.org/wiki/Wikimedia-l
> New messages to: Wikimedia-l@lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> 



-- 
Jonathan T. Morgan
Senior Design Researcher
Wikimedia Foundation
User:Jmorgan (WMF) 
(Uses He/Him)
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


[Wikimedia-l] Structured data ethical implications

2019-05-12 Thread Mister Thrapostibongles
Dear all,
There have been announcements about the Structured data project on Commons,
that is intended to make it easier to view, search, edit, organize and
re-use the metadata on media.  This is clearly of great value to
researchers and developers in image recognition, who will have a large
repository of tagged image files to train their AI implementations on.

There is however an ethical issue here.  Readers will recall that Google
discovered that its facial regonition software was prone to classifying
African-American faces as "gorilla", because the training dataset had not
contained enough non-white faces -- see for example The Verge
https://www.theverge.com/2018/1/12/16882408/google-racist-gorillas-photo-recognition-algorithm-ai


Is the Foundation confident that the Commons repository is sufficiently
diverse that it can ethically offer it to others as a source of training
data?

Thrapostibongles
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Structured Data on Commons feedback - What gets stored where (Ontology)

2018-02-22 Thread Keegan Peterzell
Hello,

On Thu, Feb 15, 2018 at 4:34 PM, Keegan Peterzell 
wrote:

> Greetings,
>
> There is a new feedback request up on Wikimedia Commons regarding
> Structured Data on Commons. The topic is a very important discussion:
> between wikitext-in-Mediawiki, Wikibase on Commons, and Wikibase on
> Wikidata, what file metadata gets store where?
>
> The discussion is here: https://commons.wikimedia.org/
> wiki/Commons:Structured_data/Get_involved/Feedback_requests/Ontology [0]
>
> It will formally run for two weeks, closing on 1 March. There will not be
> decisions made at that time, this is a part of the information-gathering
> process in order to make the informed decisions.
>
> Thank you for your time, see you on the wiki.
>
> 0. Plaintext link: < https://commons.wikimedia.org/
> wiki/Commons:Structured_data/Get_involved/Feedback_requests/Ontology >
>
> --
> Keegan Peterzell
> Technical Collaboration Specialist
> Wikimedia Foundation
>

A friendly reminder that this discussion about an important aspect of
Structured Data on Commons runs for one more week.

​The link one more time, for those whose email clients may have collapsed
the quoted text:
https://commons.wikimedia.org/wiki/Commons:Structured_data/Get_involved/Feedback_requests/Ontology

The ongoing discussion:
https://commons.wikimedia.org/wiki/Commons_talk:Structured_data/Get_involved/Feedback_requests/Ontology

Thank you.

-- 
Keegan Peterzell
Technical Collaboration Specialist
Wikimedia Foundation
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


[Wikimedia-l] Structured Data on Commons feedback - What gets stored where (Ontology)

2018-02-15 Thread Keegan Peterzell
Greetings,

There is a new feedback request up on Wikimedia Commons regarding
Structured Data on Commons. The topic is a very important discussion:
between wikitext-in-Mediawiki, Wikibase on Commons, and Wikibase on
Wikidata, what file metadata gets store where?

The discussion is here:
https://commons.wikimedia.org/wiki/Commons:Structured_data/Get_involved/Feedback_requests/Ontology
[0]

It will formally run for two weeks, closing on 1 March. There will not be
decisions made at that time, this is a part of the information-gathering
process in order to make the informed decisions.

Thank you for your time, see you on the wiki.

0. Plaintext link: <
https://commons.wikimedia.org/wiki/Commons:Structured_data/Get_involved/Feedback_requests/Ontology
>

-- 
Keegan Peterzell
Technical Collaboration Specialist
Wikimedia Foundation
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines and 
https://meta.wikimedia.org/wiki/Wikimedia-l
New messages to: Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 


Re: [Wikimedia-l] Structured Data

2013-06-02 Thread Federico Leva (Nemo)

Adam Baso, 31/05/2013 20:39:

http://googlewebmastercentral.blogspot.com/2013/05/getting-started-with-structured-data.html


I've tried the helper but I didn't find many tags of use (they already 
know that stuff about Wikipedia). However, some of those schema.org tags 
(like "Article") could be added to MediaWiki maybe?

We've seen some requests:
https://bugzilla.wikimedia.org/29968
https://bugzilla.wikimedia.org/28776
Also: 



Nemo

___
Wikimedia-l mailing list
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l


[Wikimedia-l] Structured Data

2013-05-31 Thread Adam Baso
http://googlewebmastercentral.blogspot.com/2013/05/getting-started-with-structured-data.html

___
Wikimedia-l mailing list
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l