Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-19 Thread Denny Vrandečić
Can I have a statement about how much easier it would have been with
Wikidata? :)


2013/6/13 Brent Hecht 

> Hi all,
>
> In my (recently finished) thesis, I looked at a lot of different
> properties (e.g. topic, centrality, popularity via pageviews) of "common"
> and "unique" concepts across multilingual Wikipedia.
>
> It's all in Chapter 3:
> http://www-users.cs.umn.edu/~bhecht/publications/bhecht_thesis_final.pdf.
>
> A lot of these questions were addressed in the pre-Wikidata era :-)
>
> - Brent
>
> Brent Hecht, Ph.D.
> Assistant Professor
> Department of Computer Science and Engineering
> University of Minnesota
> e: bhe...@cs.umn.edu
> t: @bhecht
> w: http://www-users.cs.umn.edu/~bhecht/
>
> On Jun 13, 2013, at 12:33 PM, "Klein,Max"  wrote:
>
> > That's an excellent recommendation. I will attempt to research the
> common properties of the least unique Wikidata items.
> >
> > Maximilian Klein
> > Wikipedian in Residence, OCLC
> > +17074787023
> >
> > 
> > From: wikidata-l-boun...@lists.wikimedia.org on behalf of Paul A. Houle
> > Sent: Thursday, June 13, 2013 6:57 AM
> > To: Discussion list for the Wikidata project.
> > Subject: Re: [Wikidata-l] Visualisations of The Most Unique
> Wikipedias  According to Wikidata
> >
> >I think Poland may do better than average because Polish people,  out
> of
> > national pride,  have made a special effort to be well documented in
> English
> > Wikipedia and represent a Polish point-of-view on topics like the city of
> > Gdansk.
> >
> >   One fascinating thing about Wikidata is that it provides access to all
> of
> > the wonderful concepts shared in the Wikiverse,  so now sites like
> Ookaboo
> > can collect pictures of many beautiful places that don't exist in en
> > Wikipedia.
> >
> >  On the other hand I'm also interested in the other end of the curve,
> > those elite concepts which are represented widely across the Wikipedias.
> > Surely this is connected with subjective importance,  with some flavor
> > towards "global" appeal,  whatever that would turn out to mean.  Any
> chance
> > you could run a report on those?
> >
> >
> > -Original Message-
> > From: Mathieu Stumpf
> > Sent: Thursday, June 13, 2013 4:51 AM
> > To: wikidata-l@lists.wikimedia.org
> > Subject: Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias
> > According to Wikidata
> >
> > Le 2013-06-12 22:22, Klein,Max a écrit :
> >> Hello Wikidatians,
> >>
> >> I made a few visualizations of the distributions of language links
> >> in Wikidata Items. You can also use these stats to see which Items
> >> represent wikipedia articles which are unique to a language and
> >> compare the uniquenesses of all languages. Also I investigate all the
> >> items with just two language links, to look at Wikipedia "pairs"
> >>
> >> See the full analysis:
> >>
> http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/
> >> [1]
> >
> > Interesting! Could you also create that kind of visualisations by
> > topics : how much uniqueness come from biographies of local football
> > people, compared with history events or abstract concepts ?
> >
> > Also, in a completly unrelated topic, you may explain me in private
> > what you mean with "Create a communal house to live in" which is in your
> > public todo list, it sounds interesting. :P
> >
> >
> > --
> > Association Culture-Libre
> > http://www.culture-libre.org/
> >
> > ___
> > Wikidata-l mailing list
> > Wikidata-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikidata-l
> >
> >
> > ___
> > Wikidata-l mailing list
> > Wikidata-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikidata-l
> >
> >
> > ___
> > Wikidata-l mailing list
> > Wikidata-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>
>
> ___
> Wikidata-l mailing list
> Wikidata-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>



-- 
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 | http://wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/681/51985.
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-14 Thread Brent Hecht
Hi all,

In my (recently finished) thesis, I looked at a lot of different properties 
(e.g. topic, centrality, popularity via pageviews) of "common" and "unique" 
concepts across multilingual Wikipedia. 

It's all in Chapter 3: 
http://www-users.cs.umn.edu/~bhecht/publications/bhecht_thesis_final.pdf. 

A lot of these questions were addressed in the pre-Wikidata era :-)

- Brent

Brent Hecht, Ph.D.
Assistant Professor
Department of Computer Science and Engineering
University of Minnesota
e: bhe...@cs.umn.edu
t: @bhecht
w: http://www-users.cs.umn.edu/~bhecht/

On Jun 13, 2013, at 12:33 PM, "Klein,Max"  wrote:

> That's an excellent recommendation. I will attempt to research the common 
> properties of the least unique Wikidata items.
> 
> Maximilian Klein
> Wikipedian in Residence, OCLC
> +17074787023
> 
> 
> From: wikidata-l-boun...@lists.wikimedia.org on behalf of Paul A. Houle
> Sent: Thursday, June 13, 2013 6:57 AM
> To: Discussion list for the Wikidata project.
> Subject: Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias
>   According to Wikidata
> 
>I think Poland may do better than average because Polish people,  out of
> national pride,  have made a special effort to be well documented in English
> Wikipedia and represent a Polish point-of-view on topics like the city of
> Gdansk.
> 
>   One fascinating thing about Wikidata is that it provides access to all of
> the wonderful concepts shared in the Wikiverse,  so now sites like Ookaboo
> can collect pictures of many beautiful places that don't exist in en
> Wikipedia.
> 
>  On the other hand I'm also interested in the other end of the curve,
> those elite concepts which are represented widely across the Wikipedias.
> Surely this is connected with subjective importance,  with some flavor
> towards "global" appeal,  whatever that would turn out to mean.  Any chance
> you could run a report on those?
> 
> 
> -Original Message-
> From: Mathieu Stumpf
> Sent: Thursday, June 13, 2013 4:51 AM
> To: wikidata-l@lists.wikimedia.org
> Subject: Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias
> According to Wikidata
> 
> Le 2013-06-12 22:22, Klein,Max a écrit :
>> Hello Wikidatians,
>> 
>> I made a few visualizations of the distributions of language links
>> in Wikidata Items. You can also use these stats to see which Items
>> represent wikipedia articles which are unique to a language and
>> compare the uniquenesses of all languages. Also I investigate all the
>> items with just two language links, to look at Wikipedia "pairs"
>> 
>> See the full analysis:
>> http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/
>> [1]
> 
> Interesting! Could you also create that kind of visualisations by
> topics : how much uniqueness come from biographies of local football
> people, compared with history events or abstract concepts ?
> 
> Also, in a completly unrelated topic, you may explain me in private
> what you mean with "Create a communal house to live in" which is in your
> public todo list, it sounds interesting. :P
> 
> 
> --
> Association Culture-Libre
> http://www.culture-libre.org/
> 
> ___
> Wikidata-l mailing list
> Wikidata-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
> 
> 
> ___
> Wikidata-l mailing list
> Wikidata-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
> 
> 
> ___
> Wikidata-l mailing list
> Wikidata-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l


___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-13 Thread Klein,Max
That's an excellent recommendation. I will attempt to research the common 
properties of the least unique Wikidata items.

Maximilian Klein
Wikipedian in Residence, OCLC
+17074787023


From: wikidata-l-boun...@lists.wikimedia.org on behalf of Paul A. Houle
Sent: Thursday, June 13, 2013 6:57 AM
To: Discussion list for the Wikidata project.
Subject: Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias  
According to Wikidata

I think Poland may do better than average because Polish people,  out of
national pride,  have made a special effort to be well documented in English
Wikipedia and represent a Polish point-of-view on topics like the city of
Gdansk.

   One fascinating thing about Wikidata is that it provides access to all of
the wonderful concepts shared in the Wikiverse,  so now sites like Ookaboo
can collect pictures of many beautiful places that don't exist in en
Wikipedia.

  On the other hand I'm also interested in the other end of the curve,
those elite concepts which are represented widely across the Wikipedias.
Surely this is connected with subjective importance,  with some flavor
towards "global" appeal,  whatever that would turn out to mean.  Any chance
you could run a report on those?


-Original Message-
From: Mathieu Stumpf
Sent: Thursday, June 13, 2013 4:51 AM
To: wikidata-l@lists.wikimedia.org
Subject: Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias
According to Wikidata

Le 2013-06-12 22:22, Klein,Max a écrit :
> Hello Wikidatians,
>
>  I made a few visualizations of the distributions of language links
> in Wikidata Items. You can also use these stats to see which Items
> represent wikipedia articles which are unique to a language and
> compare the uniquenesses of all languages. Also I investigate all the
> items with just two language links, to look at Wikipedia "pairs"
>
>  See the full analysis:
>  http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/
> [1]

Interesting! Could you also create that kind of visualisations by
topics : how much uniqueness come from biographies of local football
people, compared with history events or abstract concepts ?

Also, in a completly unrelated topic, you may explain me in private
what you mean with "Create a communal house to live in" which is in your
public todo list, it sounds interesting. :P


--
Association Culture-Libre
http://www.culture-libre.org/

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-13 Thread Federico Leva (Nemo)
Cebuano and Waray are definitely outliers because they're 
bot-Wikipedias, 70 and 95 % articles bot created respectively. 

sv should soon reach about 75 % bot creations and nl is rather stable 
around 50-60 %, so that explains most weird clusters.
For your left to right ordering "by size", you should use "Usage" rather 
than number of articles, because when they differ too much there's 
something wrong. http://stats.wikimedia.org/EN/Sitemap.htm
For instance, of the top 11-20 Wikipedias by number of articles only 2 
are in the official www.wikipedia.org top 20 (which is by usage).


Nemo

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-13 Thread Paul A. Houle
   I think Poland may do better than average because Polish people,  out of 
national pride,  have made a special effort to be well documented in English 
Wikipedia and represent a Polish point-of-view on topics like the city of 
Gdansk.


  One fascinating thing about Wikidata is that it provides access to all of 
the wonderful concepts shared in the Wikiverse,  so now sites like Ookaboo 
can collect pictures of many beautiful places that don't exist in en 
Wikipedia.


 On the other hand I'm also interested in the other end of the curve, 
those elite concepts which are represented widely across the Wikipedias. 
Surely this is connected with subjective importance,  with some flavor 
towards "global" appeal,  whatever that would turn out to mean.  Any chance 
you could run a report on those?



-Original Message- 
From: Mathieu Stumpf

Sent: Thursday, June 13, 2013 4:51 AM
To: wikidata-l@lists.wikimedia.org
Subject: Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias 
According to Wikidata


Le 2013-06-12 22:22, Klein,Max a écrit :

Hello Wikidatians,

 I made a few visualizations of the distributions of language links
in Wikidata Items. You can also use these stats to see which Items
represent wikipedia articles which are unique to a language and
compare the uniquenesses of all languages. Also I investigate all the
items with just two language links, to look at Wikipedia "pairs"

 See the full analysis:
 http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/ 
[1]


Interesting! Could you also create that kind of visualisations by
topics : how much uniqueness come from biographies of local football
people, compared with history events or abstract concepts ?

Also, in a completly unrelated topic, you may explain me in private
what you mean with "Create a communal house to live in" which is in your
public todo list, it sounds interesting. :P


--
Association Culture-Libre
http://www.culture-libre.org/

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l 



___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-13 Thread Jane Darnell
Max,
Nice work! It takes some study to read your graphs, but they are
indeed fascinating. I am sure though that given time these will
fluctuate a great deal. My gut feeling on the small wiki side of
things is that there are lots of broken interwiki links because there
are not enough people fixing those for small wikis. Thus, the amount
of uniqueness on the small wiki side might be off by quite a bit.
Something else I noticed just from working on Wiki Loves Monuments, is
that most of the large wikipedia projects have a fairly dense coverage
of topics by disambiguation pages, whereas the smaller projects have
only got the local version, so for example, the article for "monument"
may be split into many articles on the English wikipedia (Monument,
National monument, Monument historique, etc), but in Slovenian there
is just one, and so forth. It would be nice to see "Article trees"
somehow, using disambiguation pages as startoff points.
Jane

2013/6/13, Mathieu Stumpf :
> Le 2013-06-12 22:22, Klein,Max a écrit :
>> Hello Wikidatians,
>>
>>  I made a few visualizations of the distributions of language links
>> in Wikidata Items. You can also use these stats to see which Items
>> represent wikipedia articles which are unique to a language and
>> compare the uniquenesses of all languages. Also I investigate all the
>> items with just two language links, to look at Wikipedia "pairs"
>>
>>  See the full analysis:
>>
>> http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/
>> [1]
>
> Interesting! Could you also create that kind of visualisations by
> topics : how much uniqueness come from biographies of local football
> people, compared with history events or abstract concepts ?
>
> Also, in a completly unrelated topic, you may explain me in private
> what you mean with "Create a communal house to live in" which is in your
> public todo list, it sounds interesting. :P
>
>
> --
> Association Culture-Libre
> http://www.culture-libre.org/
>
> ___
> Wikidata-l mailing list
> Wikidata-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l
>

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-13 Thread Mathieu Stumpf

Le 2013-06-12 22:22, Klein,Max a écrit :

Hello Wikidatians,

 I made a few visualizations of the distributions of language links
in Wikidata Items. You can also use these stats to see which Items
represent wikipedia articles which are unique to a language and
compare the uniquenesses of all languages. Also I investigate all the
items with just two language links, to look at Wikipedia "pairs"

 See the full analysis:
 
http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/ 
[1]


Interesting! Could you also create that kind of visualisations by 
topics : how much uniqueness come from biographies of local football 
people, compared with history events or abstract concepts ?


Also, in a completly unrelated topic, you may explain me in private 
what you mean with "Create a communal house to live in" which is in your 
public todo list, it sounds interesting. :P



--
Association Culture-Libre
http://www.culture-libre.org/

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-12 Thread Johan Jönsson
2013/6/12 Klein,Max 

>  Hello Wikidatians,
>
> I made a few visualizations of the distributions of language links in
> Wikidata Items. You can also use these stats to see which Items represent
> wikipedia articles which are unique to a language and compare the
> uniquenesses of all languages. Also I investigate all the items with just
> two language links, to look at Wikipedia "pairs"
>
>
The Cebuano-Waray-Swedish cluster is due to the fact that Lsjbot (
http://sv.wikipedia.org/wiki/Anv%C3%A4ndare:Lsjbot) has created a fair
share of the articles on all three of them (and, yes, mainly about taxons).

//Johan Jönsson
--
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


[Wikidata-l] Visualisations of The Most Unique Wikipedias According to Wikidata

2013-06-12 Thread Klein,Max
Hello Wikidatians,

I made a few visualizations of the distributions of language links in Wikidata 
Items. You can also use these stats to see which Items represent wikipedia 
articles which are unique to a language and compare the uniquenesses of all 
languages. Also I investigate all the items with just two language links, to 
look at Wikipedia "pairs"

See the full analysis:
http://notconfusing.com/the-most-unique-wikipedias-according-to-wikidata/

Some sample visualisations:
[http://notconfusing.com/wp-content/uploads/2013/06/Composition_zoom-effect-1024x577.png]
[http://notconfusing.com/wp-content/uploads/2013/06/LangLinks_log-1024x577.png]
[http://notconfusing.com/wp-content/uploads/2013/06/Uniquenesses-1024x577.png]



Maximilian Klein
Wikipedian in Residence, OCLC
+17074787023
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l