suggestions.
Mihai
-Original Message-
From: wikitech-l-boun...@lists.wikimedia.org
[mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Jeremy Baron
Sent: 23 September 2013 17:12
To: Wikimedia developers; Wikipedia Xmldatadumps-l
Subject: Re: [Wikitech-l] Bulk download
On Sep 23
...@lists.wikimedia.org
[mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Jeremy Baron
Sent: 23 September 2013 17:12
To: Wikimedia developers; Wikipedia Xmldatadumps-l
Subject: Re: [Wikitech-l] Bulk download
On Sep 23, 2013 9:25 AM, Mihai Chintoanu mihai.chinto...@skobbler.com
wrote:
I have a list of about
Hi everyone,
I have a list of about 1.8 million images which I have to download from
commons.wikimedia.org. Is there any simple way to do this which doesn't involve
an individual HTTP hit for each image?
Many thanks in advance.
Mihai
___
Wikitech-l
On Mon, Sep 23, 2013 at 11:22 PM, Mihai Chintoanu
mihai.chinto...@skobbler.com wrote:
Hi everyone,
I have a list of about 1.8 million images which I have to download from
commons.wikimedia.org.
Why?
___
Wikitech-l mailing list
We have a somewhat out of date off site mirror of images (I'm working on
the out of date part). This includes commons. It's accessible by
rsync, http, ftp:
http://meta.wikimedia.org/wiki/Mirroring_Wikimedia_project_XML_dumps#Media
Thanks again to your.org for hosting that.
Are these images
On Sep 23, 2013 9:25 AM, Mihai Chintoanu mihai.chinto...@skobbler.com
wrote:
I have a list of about 1.8 million images which I have to download from
commons.wikimedia.org. Is there any simple way to do this which doesn't
involve an individual HTTP hit for each image?
You mean full size
I added Jeremy's helpful tips to
https://en.wikipedia.org/wiki/Wikipedia:Database_download#Where_are_images_and_uploaded_files--
feel free to improve these/reference them from other appropriate
places,
etc.
--scott
___
Wikitech-l mailing list
Ariel and others have already touched upon this, but just in case you
want more details (I'm trying to do something similar):
If your images are centered around one wiki (for example, the 1.8
million images are for articles in English Wikipedia), you can use the
tarballs at your.org: