Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread K. Peachey
On Mon, May 28, 2012 at 12:43 PM, Rob Lanphier ro...@wikimedia.org wrote:
 or goes by with no comment.  In fact, we should probably just enable
 it on mediawiki.org if there's no objection on list.

Lets not?
A. Stuff like this should be the same no matter where you are on the cluster
B. The MW Wiki community is more the developer side of things so I
doubt there is as many people having issues understanding it
C. This whole lets test every little thing on MW Wiki seems
misplaced, since we are a smaller more aimed project wiki (Development
of MediaWiki and subsequent support)

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] Desktop upload tool

2012-05-28 Thread Petr Bena
I would name it Snuggle, so that it's in harmony with Huggle :)

On Fri, May 25, 2012 at 5:55 PM, John phoenixoverr...@gmail.com wrote:
 One thing I have done quite a bit of was Image modification. Everything
 from watermark removal to coping to rotating. When Im on a spur of
 watermark removal, it would be awesome if I could just point the program to
 a directory of fixed files, set a upload summary and have the bot upload
 those files keeping the existing file information. (that is the biggest
 drawback of  http://commons.wikimedia.org/wiki/Commons:Tools/Commonist )

 John

 On Fri, May 25, 2012 at 11:47 AM, Platonides platoni...@gmail.com wrote:

 As some of you are already aware, I'm doing for this GSoC a Desktop tool
 for conveniently uploading images to commons.
  http://thread.gmane.org/gmane.org.wikimedia.wikilovesmonuments/2641

 How can you help with this?

 a) Propose a cool name for this tool


 b) Suggest new functionalities/requisites
 Either on http://www.mediawiki.org/wiki/User:Platonides/GSOC_proposal or
 in this thread.


 c) Volunteer for testing the program and giving early feedback
 (please reply directly to avoid spamming the mailing list with I can
 test it, too)


 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Desktop upload tool

2012-05-28 Thread Mike Dupont
How will it be different than the commonist? geotagging would be one
big feature and the ability to group photos with common tags would be
useful!
mike

On Mon, May 28, 2012 at 7:16 AM, Petr Bena benap...@gmail.com wrote:
 I would name it Snuggle, so that it's in harmony with Huggle :)

 On Fri, May 25, 2012 at 5:55 PM, John phoenixoverr...@gmail.com wrote:
 One thing I have done quite a bit of was Image modification. Everything
 from watermark removal to coping to rotating. When Im on a spur ofH 
 watermark removal, it would be awesome if I could just point the program to
 a directory of fixed files, set a upload summary and have the bot upload
 those files keeping the existing file information. (that is the biggest
 drawback of  http://commons.wikimedia.org/wiki/Commons:Tools/Commonist )

 John

 On Fri, May 25, 2012 at 11:47 AM, Platonides platoni...@gmail.com wrote:

 As some of you are already aware, I'm doing for this GSoC a Desktop tool
 for conveniently uploading images to commons.
  http://thread.gmane.org/gmane.org.wikimedia.wikilovesmonuments/2641

 How can you help with this?

 a) Propose a cool name for this tool


 b) Suggest new functionalities/requisites
 Either on http://www.mediawiki.org/wiki/User:Platonides/GSOC_proposal or
 in this thread.


 c) Volunteer for testing the program and giving early feedback
 (please reply directly to avoid spamming the mailing list with I can
 test it, too)


 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l



-- 
James Michael DuPont
Member of Free Libre Open Source Software Kosova http://flossk.org
Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread MZMcBride
Tim Starling wrote:
 On 28/05/12 03:50, Raimond Spekking wrote:
 Since MediaWiki 1.18 we have the variable $wgUseCombinedLoginLink
 [1] which is set to true per default.
 
 During edit workshops with students and seniors I registered that
 new editors are confused about the combined login page. They
 tried to register new accounts on the login page.
 
 Surely, these observations are not representative but I think
 that the usability could be improved by setting
 $wgUseCombinedLoginLink=false
 
 If I missed a prior discussion about this issue I apologize and
 would be happy if someone could point me to the discussion.
 
 Otherwise I suggest to set $wgUseCombinedLoginLink to false for
 all WMF wikis.
 
 Can't it be set to false by default?
 
 They were combined to start with because the forms were combined.
 Then the developer who split the forms was too lazy to fix the
 skins, so for years, everyone who wanted to create an account was
 forced to click through the login page. I always figured it was a
 bug. Why would anyone want them combined?

Someone might want the links combined if they present both the create
account and login forms on the same page. Wikia used to do this. Twitter and
Facebook do this currently at https://twitter.com/ and
https://www.facebook.com when you're not logged in. This is not a reason
to not flip the default on Wikimedia wikis, but it is perhaps a reason to
keep the configuration variable around.

I'll also note that creating an account is still considered a sub-type of
logging in (Special:UserLogin?type=signup), though Special:CreateAccount
properly redirects. I think it's a bit strange that they are not two
separate forms completely (Special:UserLogin and Special:CreateAccount).

MZMcBride



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread Raimond Spekking
Am 28.05.2012 18:02, schrieb MZMcBride:

 Someone might want the links combined if they present both the create
 account and login forms on the same page. Wikia used to do this. Twitter and
 Facebook do this currently at https://twitter.com/ and
 https://www.facebook.com when you're not logged in.

Yes, but both forms are better than our 2 forms because they have the
input fields for login and registration on _one_ page. IMO more clearly
for newbies.

 This is not a reason
 to not flip the default on Wikimedia wikis, but it is perhaps a reason to
 keep the configuration variable around.

Sure, keep the config var for b/c.

 
 I'll also note that creating an account is still considered a sub-type of
 logging in (Special:UserLogin?type=signup), though Special:CreateAccount
 properly redirects. I think it's a bit strange that they are not two
 separate forms completely (Special:UserLogin and Special:CreateAccount).

That's bug 15700 https://bugzilla.wikimedia.org/show_bug.cgi?id=15700

 
 MZMcBride

Raimond.



signature.asc
Description: OpenPGP digital signature
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread Daniel Friesen
On Sun, 27 May 2012 21:18:26 -0700, Tim Starling tstarl...@wikimedia.org  
wrote:



On 28/05/12 03:50, Raimond Spekking wrote:

Since MediaWiki 1.18 we have the variable $wgUseCombinedLoginLink
[1] which is set to true per default.

During edit workshops with students and seniors I registered that
new editors are confused about the combined login page. They
tried to register new accounts on the login page.

Surely, these observations are not representative but I think
that the usability could be improved by setting
$wgUseCombinedLoginLink=false

If I missed a prior discussion about this issue I apologize and
would be happy if someone could point me to the discussion.

Otherwise I suggest to set $wgUseCombinedLoginLink to false for
all WMF wikis.


Can't it be set to false by default?

They were combined to start with because the forms were combined.
Then the developer who split the forms was too lazy to fix the
skins, so for years, everyone who wanted to create an account was
forced to click through the login page. I always figured it was a
bug. Why would anyone want them combined?

-- Tim Starling


+1

I created $wgUseCombinedLoginLink partially due to skinning where some  
skins want to have separate links to style in different ways (Skin has a  
method that lets a skin override the setting for itself) and also looking  
at a number of other websites and finding that we are practically the only  
website that does something like that.


I set it to true by default because that was the current expected behavior  
and didn't want to bother with the usual backlash from the en.wp community  
whenever you touch anything to do with how MW looks or the markup it  
outputs.


But if we have found a usability reason to have separate links then I see  
absolutely no reason to set it false by default on all wiki.
In fact I'd almost say we could go ahead and delete the setting and make  
MW always have two separate links.


--
~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://daniel.friesen.name]

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread Daniel Friesen

On Mon, 28 May 2012 09:02:35 -0700, MZMcBride z...@mzmcbride.com wrote:


Tim Starling wrote:

On 28/05/12 03:50, Raimond Spekking wrote:

Since MediaWiki 1.18 we have the variable $wgUseCombinedLoginLink
[1] which is set to true per default.

During edit workshops with students and seniors I registered that
new editors are confused about the combined login page. They
tried to register new accounts on the login page.

Surely, these observations are not representative but I think
that the usability could be improved by setting
$wgUseCombinedLoginLink=false

If I missed a prior discussion about this issue I apologize and
would be happy if someone could point me to the discussion.

Otherwise I suggest to set $wgUseCombinedLoginLink to false for
all WMF wikis.


Can't it be set to false by default?

They were combined to start with because the forms were combined.
Then the developer who split the forms was too lazy to fix the
skins, so for years, everyone who wanted to create an account was
forced to click through the login page. I always figured it was a
bug. Why would anyone want them combined?


Someone might want the links combined if they present both the create
account and login forms on the same page. Wikia used to do this. Twitter  
and

Facebook do this currently at https://twitter.com/ and
https://www.facebook.com when you're not logged in. This is not a  
reason

to not flip the default on Wikimedia wikis, but it is perhaps a reason to
keep the configuration variable around.


Facebook and Twitter do that on their homepage and don't even bother with  
login links. The design of those doesn't really fit discussion of login  
page links.


- To have a combined form like that you have to create a brand new special  
page; you can't do that within our current Special:UserLogin
- We don't provide a way to simply override the special page used for  
login links
Hence with these two facts that means it's impossible to make an alternate  
login form without replacing stuff inside the personal_urls by using hooks  
and constructing your own arrays to put in the list.
Wikia doesn't even bother with hooks, they just make core modifications  
for this stuff.


So I don't believe this or anything Wikia does on the topic is a valid  
reason to keep the configuration setting around. It's not needed for those  
kind of modifications.
Heck, it's because of one of Wikia's old skins that I split the links in  
the first place.



I'll also note that creating an account is still considered a sub-type of
logging in (Special:UserLogin?type=signup), though Special:CreateAccount
properly redirects. I think it's a bit strange that they are not two
separate forms completely (Special:UserLogin and Special:CreateAccount).

MZMcBride


I believe there are some backend bits that still link the two together.  
Though perhaps someone can still manage to make the create account form  
served from Special:CreateAccount while the backend is still in that state.
Though, this area of MW has probably been overdue for an overhaul for  
awhile.


--
~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://daniel.friesen.name]

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [Xmldatadumps-l] XML dumps/Media mirrors update

2012-05-28 Thread Mike Dupont
first version of the Script is ready , it gets the versions, puts them
in a zip and puts that on archive.org
https://github.com/h4ck3rm1k3/pywikipediabot/blob/master/export_deleted.py

here is an example output :
http://archive.org/details/wikipedia-delete-2012-05
http://ia601203.us.archive.org/24/items/wikipedia-delete-2012-05/archive2012-05-28T21:34:02.302183.zip

I will cron this, and it should give a start of saving deleted data.
Articles will be exported once a day, even if they they were exported
yesterday as long as they are in one of the categories.

mike

On Mon, May 21, 2012 at 7:21 PM, Mike  Dupont
jamesmikedup...@googlemail.com wrote:
 Thanks! and run that 1 time per day, they dont get deleted that quickly.
 mike

 On Mon, May 21, 2012 at 9:11 PM, emijrp emi...@gmail.com wrote:
 Create a script that makes a request to Special:Export using this category
 as feed
 https://en.wikipedia.org/wiki/Category:Candidates_for_speedy_deletion

 More info https://www.mediawiki.org/wiki/Manual:Parameters_to_Special:Export


 2012/5/21 Mike Dupont jamesmikedup...@googlemail.com

 Well I whould be happy for items like this :
 http://en.wikipedia.org/wiki/Template:Db-a7
 would it be possible to extract them easily?
 mike

 On Thu, May 17, 2012 at 2:23 PM, Ariel T. Glenn ar...@wikimedia.org
 wrote:
  There's a few other reasons articles get deleted: copyright issues,
  personal identifying data, etc.  This makes maintaning the sort of
  mirror you propose problematic, although a similar mirror is here:
  http://deletionpedia.dbatley.com/w/index.php?title=Main_Page
 
  The dumps contain only data publically available at the time of the run,
  without deleted data.
 
  The articles aren't permanently deleted of course.  The revisions texts
  live on in the database, so a query on toolserver, for example, could be
  used to get at them, but that would need to be for research purposes.
 
  Ariel
 
  Στις 17-05-2012, ημέρα Πεμ, και ώρα 13:30 +0200, ο/η Mike Dupont έγραψε:
  Hi,
  I am thinking about how to collect articles deleted based on the not
  notable criteria,
  is there any way we can extract them from the mysql binlogs? how are
  these mirrors working? I would be interested in setting up a mirror of
  deleted data, at least that which is not spam/vandalism based on tags.
  mike
 
  On Thu, May 17, 2012 at 1:09 PM, Ariel T. Glenn ar...@wikimedia.org
  wrote:
   We now have three mirror sites, yay!  The full list is linked to from
   http://dumps.wikimedia.org/ and is also available at
  
   http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps#Current_Mirrors
  
   Summarizing, we have:
  
   C3L (Brazil) with the last 5 good known dumps,
   Masaryk University (Czech Republic) with the last 5 known good dumps,
   Your.org (USA) with the complete archive of dumps, and
  
   for the latest version of uploaded media, Your.org with
   http/ftp/rsync
   access.
  
   Thanks to Carlos, Kevin and Yenya respectively at the above sites for
   volunteering space, time and effort to make this happen.
  
   As people noticed earlier, a series of media tarballs per-project
   (excluding commons) is being generated.  As soon as the first run of
   these is complete we'll announce its location and start generating
   them
   on a semi-regular basis.
  
   As we've been getting the bugs out of the mirroring setup, it is
   getting
   easier to add new locations.  Know anyone interested?  Please let us
   know; we would love to have them.
  
   Ariel
  
  
   ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
 
 
 
 
 
  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l



 --
 James Michael DuPont
 Member of Free Libre Open Source Software Kosova http://flossk.org
 Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
 Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l




 --
 Emilio J. Rodríguez-Posada. E-mail: emijrp AT gmail DOT com
 Pre-doctoral student at the University of Cádiz (Spain)
 Projects: AVBOT | StatMediaWiki | WikiEvidens | WikiPapers | WikiTeam
 Personal website: https://sites.google.com/site/emijrp/


 ___
 Xmldatadumps-l mailing list
 xmldatadump...@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l




 --
 James Michael DuPont
 Member of Free Libre Open Source Software Kosova http://flossk.org
 Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
 Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3



-- 
James Michael DuPont
Member of Free Libre Open Source Software Kosova http://flossk.org
Contributor 

Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread Steven Walling
On Mon, May 28, 2012 at 12:39 PM, Daniel Friesen
li...@nadir-seen-fire.comwrote:

 - To have a combined form like that you have to create a brand new special
 page; you can't do that within our current Special:UserLogin
 - We don't provide a way to simply override the special page used for
 login links
 Hence with these two facts that means it's impossible to make an alternate
 login form without replacing stuff inside the personal_urls by using hooks
 and constructing your own arrays to put in the list.
 Wikia doesn't even bother with hooks, they just make core modifications
 for this stuff.


Just FYI, work is being done to decouple account creation from
Special:UserLogin. The initial SignupAPI extension was a GSOC project, and
it's currently undergoing code review etc. This should allow us to
implement some long overdue improvements, such as:
https://bugzilla.wikimedia.org/show_bug.cgi?id=34447

The relevant bug: https://bugzilla.wikimedia.org/show_bug.cgi?id=36225

Steven
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Set $wgUseCombinedLoginLink = false on WMF cluster?

2012-05-28 Thread Happy Melon
On 28 May 2012 22:14, Steven Walling steven.wall...@gmail.com wrote:

 On Mon, May 28, 2012 at 12:39 PM, Daniel Friesen
 li...@nadir-seen-fire.comwrote:

  - To have a combined form like that you have to create a brand new
 special
  page; you can't do that within our current Special:UserLogin
  - We don't provide a way to simply override the special page used for
  login links
  Hence with these two facts that means it's impossible to make an
 alternate
  login form without replacing stuff inside the personal_urls by using
 hooks
  and constructing your own arrays to put in the list.
  Wikia doesn't even bother with hooks, they just make core modifications
  for this stuff.
 

 Just FYI, work is being done to decouple account creation from
 Special:UserLogin. The initial SignupAPI extension was a GSOC project, and
 it's currently undergoing code review etc. This should allow us to
 implement some long overdue improvements, such as:
 https://bugzilla.wikimedia.org/show_bug.cgi?id=34447

 The relevant bug: https://bugzilla.wikimedia.org/show_bug.cgi?id=36225


This corner of MediaWiki is horrifically old and rusty; I tried a big
refactoring a while ago but it broke CentralAuth and so got pulled a couple
of times.  I might have another go now we have labs up and running and we
can easily set up a SUL domain to match the cluster's.  SignupAPI basically
copied mountains of code from LoginForm, IIRC; wasn't a real refactoring
(could be mis-remembering though).

--HM
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [Xmldatadumps-l] XML dumps/Media mirrors update

2012-05-28 Thread Hydriz Wikipedia
This is quite nice, though the item's metadata is too little :)

On Tue, May 29, 2012 at 3:40 AM, Mike Dupont jamesmikedup...@googlemail.com
 wrote:

 first version of the Script is ready , it gets the versions, puts them
 in a zip and puts that on archive.org
 https://github.com/h4ck3rm1k3/pywikipediabot/blob/master/export_deleted.py

 here is an example output :
 http://archive.org/details/wikipedia-delete-2012-05

 http://ia601203.us.archive.org/24/items/wikipedia-delete-2012-05/archive2012-05-28T21:34:02.302183.zip

 I will cron this, and it should give a start of saving deleted data.
 Articles will be exported once a day, even if they they were exported
 yesterday as long as they are in one of the categories.

 mike

 On Mon, May 21, 2012 at 7:21 PM, Mike  Dupont
 jamesmikedup...@googlemail.com wrote:
  Thanks! and run that 1 time per day, they dont get deleted that quickly.
  mike
 
  On Mon, May 21, 2012 at 9:11 PM, emijrp emi...@gmail.com wrote:
  Create a script that makes a request to Special:Export using this
 category
  as feed
  https://en.wikipedia.org/wiki/Category:Candidates_for_speedy_deletion
 
  More info
 https://www.mediawiki.org/wiki/Manual:Parameters_to_Special:Export
 
 
  2012/5/21 Mike Dupont jamesmikedup...@googlemail.com
 
  Well I whould be happy for items like this :
  http://en.wikipedia.org/wiki/Template:Db-a7
  would it be possible to extract them easily?
  mike
 
  On Thu, May 17, 2012 at 2:23 PM, Ariel T. Glenn ar...@wikimedia.org
  wrote:
   There's a few other reasons articles get deleted: copyright issues,
   personal identifying data, etc.  This makes maintaning the sort of
   mirror you propose problematic, although a similar mirror is here:
   http://deletionpedia.dbatley.com/w/index.php?title=Main_Page
  
   The dumps contain only data publically available at the time of the
 run,
   without deleted data.
  
   The articles aren't permanently deleted of course.  The revisions
 texts
   live on in the database, so a query on toolserver, for example,
 could be
   used to get at them, but that would need to be for research purposes.
  
   Ariel
  
   Στις 17-05-2012, ημέρα Πεμ, και ώρα 13:30 +0200, ο/η Mike Dupont
 έγραψε:
   Hi,
   I am thinking about how to collect articles deleted based on the
 not
   notable criteria,
   is there any way we can extract them from the mysql binlogs? how are
   these mirrors working? I would be interested in setting up a mirror
 of
   deleted data, at least that which is not spam/vandalism based on
 tags.
   mike
  
   On Thu, May 17, 2012 at 1:09 PM, Ariel T. Glenn 
 ar...@wikimedia.org
   wrote:
We now have three mirror sites, yay!  The full list is linked to
 from
http://dumps.wikimedia.org/ and is also available at
   
   
 http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps#Current_Mirrors
   
Summarizing, we have:
   
C3L (Brazil) with the last 5 good known dumps,
Masaryk University (Czech Republic) with the last 5 known good
 dumps,
Your.org (USA) with the complete archive of dumps, and
   
for the latest version of uploaded media, Your.org with
http/ftp/rsync
access.
   
Thanks to Carlos, Kevin and Yenya respectively at the above sites
 for
volunteering space, time and effort to make this happen.
   
As people noticed earlier, a series of media tarballs per-project
(excluding commons) is being generated.  As soon as the first run
 of
these is complete we'll announce its location and start generating
them
on a semi-regular basis.
   
As we've been getting the bugs out of the mirroring setup, it is
getting
easier to add new locations.  Know anyone interested?  Please let
 us
know; we would love to have them.
   
Ariel
   
   
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
  
  
  
  
  
  
   ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
 
 
  --
  James Michael DuPont
  Member of Free Libre Open Source Software Kosova http://flossk.org
  Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
  Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3
 
  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
 
 
 
  --
  Emilio J. Rodríguez-Posada. E-mail: emijrp AT gmail DOT com
  Pre-doctoral student at the University of Cádiz (Spain)
  Projects: AVBOT | StatMediaWiki | WikiEvidens | WikiPapers | WikiTeam
  Personal website: https://sites.google.com/site/emijrp/
 
 
  ___
  Xmldatadumps-l mailing list
  xmldatadump...@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
 
 
 
 
  --
  James 

Re: [Wikitech-l] [Xmldatadumps-l] XML dumps/Media mirrors update

2012-05-28 Thread Mike Dupont
Well, I have now updated the script to include  the xml dump in raw
format. I will have to add more information the achive.org item, at
least a basic readme.
other thing is that the wikipybot does not support the full history it
seems, so that I will have to move over to the wikiteam version and
rework it,
I just spent 2 hours on this so i am pretty happy for the first version.

mike

On Tue, May 29, 2012 at 1:52 AM, Hydriz Wikipedia ad...@alphacorp.tk wrote:
 This is quite nice, though the item's metadata is too little :)

 On Tue, May 29, 2012 at 3:40 AM, Mike Dupont jamesmikedup...@googlemail.com
 wrote:

 first version of the Script is ready , it gets the versions, puts them
 in a zip and puts that on archive.org
 https://github.com/h4ck3rm1k3/pywikipediabot/blob/master/export_deleted.py

 here is an example output :
 http://archive.org/details/wikipedia-delete-2012-05

 http://ia601203.us.archive.org/24/items/wikipedia-delete-2012-05/archive2012-05-28T21:34:02.302183.zip

 I will cron this, and it should give a start of saving deleted data.
 Articles will be exported once a day, even if they they were exported
 yesterday as long as they are in one of the categories.

 mike

 On Mon, May 21, 2012 at 7:21 PM, Mike  Dupont
 jamesmikedup...@googlemail.com wrote:
  Thanks! and run that 1 time per day, they dont get deleted that quickly.
  mike
 
  On Mon, May 21, 2012 at 9:11 PM, emijrp emi...@gmail.com wrote:
  Create a script that makes a request to Special:Export using this
 category
  as feed
  https://en.wikipedia.org/wiki/Category:Candidates_for_speedy_deletion
 
  More info
 https://www.mediawiki.org/wiki/Manual:Parameters_to_Special:Export
 
 
  2012/5/21 Mike Dupont jamesmikedup...@googlemail.com
 
  Well I whould be happy for items like this :
  http://en.wikipedia.org/wiki/Template:Db-a7
  would it be possible to extract them easily?
  mike
 
  On Thu, May 17, 2012 at 2:23 PM, Ariel T. Glenn ar...@wikimedia.org
  wrote:
   There's a few other reasons articles get deleted: copyright issues,
   personal identifying data, etc.  This makes maintaning the sort of
   mirror you propose problematic, although a similar mirror is here:
   http://deletionpedia.dbatley.com/w/index.php?title=Main_Page
  
   The dumps contain only data publically available at the time of the
 run,
   without deleted data.
  
   The articles aren't permanently deleted of course.  The revisions
 texts
   live on in the database, so a query on toolserver, for example,
 could be
   used to get at them, but that would need to be for research purposes.
  
   Ariel
  
   Στις 17-05-2012, ημέρα Πεμ, και ώρα 13:30 +0200, ο/η Mike Dupont
 έγραψε:
   Hi,
   I am thinking about how to collect articles deleted based on the
 not
   notable criteria,
   is there any way we can extract them from the mysql binlogs? how are
   these mirrors working? I would be interested in setting up a mirror
 of
   deleted data, at least that which is not spam/vandalism based on
 tags.
   mike
  
   On Thu, May 17, 2012 at 1:09 PM, Ariel T. Glenn 
 ar...@wikimedia.org
   wrote:
We now have three mirror sites, yay!  The full list is linked to
 from
http://dumps.wikimedia.org/ and is also available at
   
   
 http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps#Current_Mirrors
   
Summarizing, we have:
   
C3L (Brazil) with the last 5 good known dumps,
Masaryk University (Czech Republic) with the last 5 known good
 dumps,
Your.org (USA) with the complete archive of dumps, and
   
for the latest version of uploaded media, Your.org with
http/ftp/rsync
access.
   
Thanks to Carlos, Kevin and Yenya respectively at the above sites
 for
volunteering space, time and effort to make this happen.
   
As people noticed earlier, a series of media tarballs per-project
(excluding commons) is being generated.  As soon as the first run
 of
these is complete we'll announce its location and start generating
them
on a semi-regular basis.
   
As we've been getting the bugs out of the mirroring setup, it is
getting
easier to add new locations.  Know anyone interested?  Please let
 us
know; we would love to have them.
   
Ariel
   
   
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
  
  
  
  
  
  
   ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
 
 
  --
  James Michael DuPont
  Member of Free Libre Open Source Software Kosova http://flossk.org
  Contributor FOSM, the CC-BY-SA map of the world http://fosm.org
  Mozilla Rep https://reps.mozilla.org/u/h4ck3rm1k3
 
  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l