Re: [Wiki-research-l] Looking for help finding tools to measure UNESCO project

2015-10-07 Thread john cummings
Hi all

The Mix n' Match lists are amazing, very surprising that 1/4 of World
Heritage sites don't have an en.wiki article, I assume some of this is
where the WHS status is for an area combining more than one structure.

It appears Wikidata is the way to go, I will work some more on "is of
interest to UNESCO", the main list of topics initially would be:

   - World Heritage Sites
   - Man and the Biosphere reserves
   - Global Geoparks Network
   - Memory of the World Programme
   - Masterpieces of the Oral and Intangible Heritage of Humanity
   - International days observed by UNESCO
   - UNESCO prizes
   - The UNESCO art collection

Would Mix n' Match be the best way to add Wikidata items for each of the
things covered by these programmes? I can provide spreadsheets of items for
each with properties.

Thanks

John

On 6 October 2015 at 18:45, Magnus Manske 
wrote:

> I happen to work on a tool (initially for Liam Wyatt) that might do some
> of what you want on Wikidata. Given a Wikidata Query (separate topic ;-) or
> a simple list of Wikidata items, it can record changes made to these items
> over time. It records the JSON for the Wikidata items, max of one
> revision/day.
>
> A front-end (to be written) can then extract things like number of
> sitelinks (Wikipedia articles) for these items over time; Wikidata labels
> in different languages; number/type of statements added; etc. Ideally, this
> can be exported as a table, to make pretty stats in R (or the like).
>
> As I said, it's work in progress, but if you have a (initial) list of
> items, I can start "recording".
>
> On Tue, Oct 6, 2015 at 4:54 PM Andrew Gray 
> wrote:
>
>> On 6 October 2015 at 14:12, Amir E. Aharoni
>>  wrote:
>> > Thanks for this email.
>> >
>> > This raises a wider question: What is the comfortable way to compare the
>> > coverage of a topic in different languages?
>> >
>> > For example, I'd love to see a report that says:
>> >
>> > Number of articles about UNESCO cultural heritage:
>> > English Wikipedia: 1000
>> > French Wikipedia: 1200
>> > Hebrew Wikipedia: 742
>> > etc.
>> >
>> > And also to track this over time, so if somebody would work hard on
>> creating
>> > articles about UNESCO cultural heritage in Hebrew, I'd see a trend
>> graph.
>>
>> There's two general approaches to this:
>>
>> a) On Wikidata
>> b) On the individual wikis
>>
>> Approach (a) would rely on having a defined set of things in Wikidata
>> that we can identify. For example, "is a World Heritage Site" would be
>> easy enough, since we have a property explicitly dealing with WHS
>> identifiers (and we have 100% coverage in Wikidata). "Is of interest
>> to UNESCO" is a trickier one - but if you can construct a suitable
>> Wikidata query...
>>
>> As Federico notes, for WHS records, we can generate a report like
>> https://tools.wmflabs.org/mix-n-match/?mode=sitestats=93
>> (57.4% coverage on hewiki!). No graphs but if you were interested then
>> you could probably set one up without much work.
>>
>> b) is more useful for fuzzy groups like "of relevance to UNESCO",
>> since this is more or less perfect for a category system. However, it
>> would require examining the category tree for each WP you're
>> interested in to figure out exactly which categories are relevant, and
>> then running a script to count those daily.
>>
>> A.
>> --
>> - Andrew Gray
>>   andrew.g...@dunelm.org.uk
>>
>> ___
>> Wiki-research-l mailing list
>> Wiki-research-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>>
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] Looking for help finding tools to measure UNESCO project

2015-10-06 Thread Andrew Gray
On 6 October 2015 at 14:12, Amir E. Aharoni
 wrote:
> Thanks for this email.
>
> This raises a wider question: What is the comfortable way to compare the
> coverage of a topic in different languages?
>
> For example, I'd love to see a report that says:
>
> Number of articles about UNESCO cultural heritage:
> English Wikipedia: 1000
> French Wikipedia: 1200
> Hebrew Wikipedia: 742
> etc.
>
> And also to track this over time, so if somebody would work hard on creating
> articles about UNESCO cultural heritage in Hebrew, I'd see a trend graph.

There's two general approaches to this:

a) On Wikidata
b) On the individual wikis

Approach (a) would rely on having a defined set of things in Wikidata
that we can identify. For example, "is a World Heritage Site" would be
easy enough, since we have a property explicitly dealing with WHS
identifiers (and we have 100% coverage in Wikidata). "Is of interest
to UNESCO" is a trickier one - but if you can construct a suitable
Wikidata query...

As Federico notes, for WHS records, we can generate a report like
https://tools.wmflabs.org/mix-n-match/?mode=sitestats=93
(57.4% coverage on hewiki!). No graphs but if you were interested then
you could probably set one up without much work.

b) is more useful for fuzzy groups like "of relevance to UNESCO",
since this is more or less perfect for a category system. However, it
would require examining the category tree for each WP you're
interested in to figure out exactly which categories are relevant, and
then running a script to count those daily.

A.
-- 
- Andrew Gray
  andrew.g...@dunelm.org.uk

___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] Looking for help finding tools to measure UNESCO project

2015-10-06 Thread Federico Leva (Nemo)

Amir E. Aharoni, 06/10/2015 15:12:

This raises a wider question: What is the comfortable way to compare the
coverage of a topic in different languages?


https://tools.wmflabs.org/mix-n-match/ . Example: 
https://tools.wmflabs.org/mix-n-match/?mode=sitestats=17


Nemo

___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] Looking for help finding tools to measure UNESCO project

2015-10-06 Thread Amir E. Aharoni
Thanks for this email.

This raises a wider question: What is the comfortable way to compare the
coverage of a topic in different languages?

For example, I'd love to see a report that says:

Number of articles about UNESCO cultural heritage:
English Wikipedia: 1000
French Wikipedia: 1200
Hebrew Wikipedia: 742
etc.

And also to track this over time, so if somebody would work hard on
creating articles about UNESCO cultural heritage in Hebrew, I'd see a trend
graph.

Of course, this can be relevant to any topic.


--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

2015-10-02 13:43 GMT+03:00 john cummings :

> Hi All
>
> This is my first time posing on this list, I'm sorry if it is perhaps a
> little off topic. I'm currently Wikimedian in Residence at UNESCO and plan
> to run an online collaboration, a little bit like a short term Wikiproject
> with two main goals:
>
>
>- Help organise reuse of UNESCO content on Wikimedia projects (UNESCO
>has released content under an open license and will do more shortly).
>
>
>- Help improve content on Wikimedia of the subjects of UNESCO
>programmes e.g the World Heritage Sites.
>
>
> I have been planning ways that I can use tools to:
>
>
>- Organise work for contributors across all languages
>- Provide contributors feedback on their contributions (e.g page views
>for all contributions combined)
>- Measure success of the project.
>
>
> I've been doing this on wiki here
> https://en.wikipedia.org/wiki/User:John_Cummings/Planning_UNESCO_metrics
>
> In short I'm finding it very hard to find the tools needed, I have found
> less than a third of what I think would be helpful but found others that
> may be tangentially useful which I've added in.
>
> Any help would be appreciate, please feel free to comment here, on the
> talk page or just add tools to the fields
>
> Thanks
>
> John
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] Looking for help finding tools to measure UNESCO project

2015-10-06 Thread Magnus Manske
I happen to work on a tool (initially for Liam Wyatt) that might do some of
what you want on Wikidata. Given a Wikidata Query (separate topic ;-) or a
simple list of Wikidata items, it can record changes made to these items
over time. It records the JSON for the Wikidata items, max of one
revision/day.

A front-end (to be written) can then extract things like number of
sitelinks (Wikipedia articles) for these items over time; Wikidata labels
in different languages; number/type of statements added; etc. Ideally, this
can be exported as a table, to make pretty stats in R (or the like).

As I said, it's work in progress, but if you have a (initial) list of
items, I can start "recording".

On Tue, Oct 6, 2015 at 4:54 PM Andrew Gray 
wrote:

> On 6 October 2015 at 14:12, Amir E. Aharoni
>  wrote:
> > Thanks for this email.
> >
> > This raises a wider question: What is the comfortable way to compare the
> > coverage of a topic in different languages?
> >
> > For example, I'd love to see a report that says:
> >
> > Number of articles about UNESCO cultural heritage:
> > English Wikipedia: 1000
> > French Wikipedia: 1200
> > Hebrew Wikipedia: 742
> > etc.
> >
> > And also to track this over time, so if somebody would work hard on
> creating
> > articles about UNESCO cultural heritage in Hebrew, I'd see a trend graph.
>
> There's two general approaches to this:
>
> a) On Wikidata
> b) On the individual wikis
>
> Approach (a) would rely on having a defined set of things in Wikidata
> that we can identify. For example, "is a World Heritage Site" would be
> easy enough, since we have a property explicitly dealing with WHS
> identifiers (and we have 100% coverage in Wikidata). "Is of interest
> to UNESCO" is a trickier one - but if you can construct a suitable
> Wikidata query...
>
> As Federico notes, for WHS records, we can generate a report like
> https://tools.wmflabs.org/mix-n-match/?mode=sitestats=93
> (57.4% coverage on hewiki!). No graphs but if you were interested then
> you could probably set one up without much work.
>
> b) is more useful for fuzzy groups like "of relevance to UNESCO",
> since this is more or less perfect for a category system. However, it
> would require examining the category tree for each WP you're
> interested in to figure out exactly which categories are relevant, and
> then running a script to count those daily.
>
> A.
> --
> - Andrew Gray
>   andrew.g...@dunelm.org.uk
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


Re: [Wiki-research-l] Looking for help finding tools to measure UNESCO project

2015-10-05 Thread Jonathan Morgan
Hi John,

I added one suggestion re: pages-in-a-category to your work page. Not sure
it was a particularly helpful one, though. Looking through your other
possible metrics, I see ways of gathering most of them via database and/or
API queries, but I don't know of many tools (other than the ones you've
already listed) that provide a GUI interface that allows you to get those
data without writing some queries or scripts yourself.

For queries related to Wikidata, you might try the new Wikidata Query
Service (web interface[1], manual[2]). And if anyone you're working with
knows any SQL or a scripting language like Python, you can gather most of
your participation metrics (pages edited, etc) via Quarry[3].

If you need a particular query run on a one-off basis, like "tell me how
many of the users in this list have created an article in this category",
post a request to this list and someone here may be able to run it for you.
If you want to learn how to run queries yourself, I can take an hour to get
you started. If you're able to take some extra time now to learn the ropes
of SQL, it will pay off in the future. We are blessed with a bunch of
awesome research tools, but as you've discovered there will always be
questions that you want to ask, that there isn't a tool for.

Hope that helps,
J

1. https://query.wikidata.org/
2. https://www.mediawiki.org/wiki/Wikidata_query_service/User_Manual
3. http://quarry.wmflabs.org/

On Fri, Oct 2, 2015 at 3:43 AM, john cummings 
wrote:

> Hi All
>
> This is my first time posing on this list, I'm sorry if it is perhaps a
> little off topic. I'm currently Wikimedian in Residence at UNESCO and plan
> to run an online collaboration, a little bit like a short term Wikiproject
> with two main goals:
>
>
>- Help organise reuse of UNESCO content on Wikimedia projects (UNESCO
>has released content under an open license and will do more shortly).
>
>
>- Help improve content on Wikimedia of the subjects of UNESCO
>programmes e.g the World Heritage Sites.
>
>
> I have been planning ways that I can use tools to:
>
>
>- Organise work for contributors across all languages
>- Provide contributors feedback on their contributions (e.g page views
>for all contributions combined)
>- Measure success of the project.
>
>
> I've been doing this on wiki here
> https://en.wikipedia.org/wiki/User:John_Cummings/Planning_UNESCO_metrics
>
> In short I'm finding it very hard to find the tools needed, I have found
> less than a third of what I think would be helpful but found others that
> may be tangentially useful which I've added in.
>
> Any help would be appreciate, please feel free to comment here, on the
> talk page or just add tools to the fields
>
> Thanks
>
> John
>
> ___
> Wiki-research-l mailing list
> Wiki-research-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
>


-- 
Jonathan T. Morgan
Senior Design Researcher
Wikimedia Foundation
User:Jmorgan (WMF) 
___
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l