Hi Hydriz,

thanks for your answer. These are quite unfortunate news. I look forward to the updated service.

Best,

Daniel

On 11/25/2020 5:29 AM, Hydriz Scholz wrote:
Hi Daniel,

I am the one managing the archival process and indeed, it was around end-2018 when the archival process just died (you can see the status here: https://dumps.wmflabs.org/status.php <https://dumps.wmflabs.org/status.php>).

The current status is that the software behind the archival process is being reworked and will come with features that I will be announcing once it is ready. The Wikidata JSON dumps will resume archival starting next week, so unfortunately all information between end-2018 till around October 2020 will be lost (unless someone has a copy somewhere). As for the dumps in 2017, there were other issues that caused the archival process to stall as well (you can see the list of available and archived dumps here: https://dumps.wmflabs.org/wikidata.txt <https://dumps.wmflabs.org/wikidata.txt>).

I sincerely apologize for the lost information. The new version that I'm currently working on right now will definitely be much better and more robust to handle failures.


Warmest regards,
Hydriz


On Wed, 25 Nov 2020 at 20:22, Daniel Garijo <dgar...@isi.edu <mailto:dgar...@isi.edu>> wrote:

    Hello,

    I am writing this message because I am analyzing the Wikidata JSON
    dumps
    available in the Internet Archive and I have found there are no dumps
    available after Feb 8th, 2019 (see
    
https://archive.org/details/wikimediadownloads?and%5B%5D=%22Wikidata%20entity%20dumps%22
    
<https://archive.org/details/wikimediadownloads?and%5B%5D=%22Wikidata%20entity%20dumps%22>).

    I know the latest dumps are available at
    https://dumps.wikimedia.org/wikidatawiki/entities/
    <https://dumps.wikimedia.org/wikidatawiki/entities/>, but
    unfortunately
    they only cover the last few months.

    I also noticed some gaps in the years where there are JSON dumps
    available. For example, there are no JSON dumps available between
    end of
    Feb, 2017 and Aug 21st, 2017; or between August 21st, 2017 and Nov
    16, 2017.

    Another strange finding is that while there are some entries for the
    dumps in the Internet Archive between March 19th, 2018 and Nov 26th,
    2018 (e.g.,
    https://archive.org/details/wikibase-wikidatawiki-20181104
    <https://archive.org/details/wikibase-wikidatawiki-20181104>),
    none of them contain a JSON dump. That's another gap of more than
    8 months.

    Does anyone on this list know where some of these missing Wikidata
    dumps
    may be found? If anyone has pointers to a server where they can be
    downloaded, I would highly appreciate it.

    Thanks in advance,
    Daniel


    _______________________________________________
    Xmldatadumps-l mailing list
    Xmldatadumps-l@lists.wikimedia.org
    <mailto:Xmldatadumps-l@lists.wikimedia.org>
    https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
    <https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l>



--
Hydriz Scholz
_______________________________________________
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l

Reply via email to