Re: [Xmldatadumps-l] [Wikitech-l] XML dumps/Media mirrors update

2012-05-17 Thread Platonides
On 17/05/12 14:23, Ariel T. Glenn wrote: > There's a few other reasons articles get deleted: copyright issues, > personal identifying data, etc. This makes maintaning the sort of > mirror you propose problematic, although a similar mirror is here: > http://deletionpedia.dbatley.com/w/index.php?tit

Re: [Xmldatadumps-l] [Wikitech-l] XML dumps/Media mirrors update

2012-05-17 Thread Ariel T. Glenn
There's a few other reasons articles get deleted: copyright issues, personal identifying data, etc. This makes maintaning the sort of mirror you propose problematic, although a similar mirror is here: http://deletionpedia.dbatley.com/w/index.php?title=Main_Page The dumps contain only data publica

Re: [Xmldatadumps-l] [Wikitech-l] XML dumps/Media mirrors update

2012-05-17 Thread Mike Dupont
Hi, I am thinking about how to collect articles deleted based on the "not notable" criteria, is there any way we can extract them from the mysql binlogs? how are these mirrors working? I would be interested in setting up a mirror of deleted data, at least that which is not spam/vandalism based on t

Re: [Xmldatadumps-l] [Wikitech-l] XML dumps/Media mirrors update

2012-05-17 Thread emijrp
Good work. We are approaching finally to an indestructible corpus of knowledge. 2012/5/17 Ariel T. Glenn > We now have three mirror sites, yay! The full list is linked to from > http://dumps.wikimedia.org/ and is also available at > > http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_X

[Xmldatadumps-l] XML dumps/Media mirrors update

2012-05-17 Thread Ariel T. Glenn
We now have three mirror sites, yay! The full list is linked to from http://dumps.wikimedia.org/ and is also available at http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps#Current_Mirrors Summarizing, we have: C3L (Brazil) with the last 5 good known dumps, Masaryk University