Re: [Wikidata] Significant change of Wikidata dump size

2019-06-26 Thread Ariel Glenn WMF
Let's take this to the filed task, https://phabricator.wikimedia.org/T226601 On Wed, Jun 26, 2019 at 9:22 AM Stas Malyshev wrote: > Hi! > > On 6/25/19 11:17 PM, Ariel Glenn WMF wrote: > > I think the issue is with the 0624 json dumps, which do seem a lot > > smaller than previous weeks' runs. >

Re: [Wikidata] Significant change of Wikidata dump size

2019-06-26 Thread Stas Malyshev
Hi! On 6/25/19 11:17 PM, Ariel Glenn WMF wrote: > I think the issue is with the 0624 json dumps, which do seem a lot > smaller than previous weeks' runs. Ah, true, I didn't realize that. I think this may be because of that dumpJson.php issue, which is now fixed. Maybe rerun the dump? -- Stas

Re: [Wikidata] Significant change of Wikidata dump size

2019-06-26 Thread Ariel Glenn WMF
I think the issue is with the 0624 json dumps, which do seem a lot smaller than previous weeks' runs. On Wed, Jun 26, 2019 at 8:22 AM Stas Malyshev wrote: > Hi! > > > Which script, please, and which dump? (The conversation was not > > forwarded so I don't have the context.) > > Sorry, the

Re: [Wikidata] Significant change of Wikidata dump size

2019-06-25 Thread Stas Malyshev
Hi! > Which script, please, and which dump? (The conversation was not > forwarded so I don't have the context.) Sorry, the original complaint was: > I apologize if I missed something, but why the current JSON dump size is ~25GB while a week ago it was ~58GB? (see

Re: [Wikidata] Significant change of Wikidata dump size

2019-06-25 Thread Ariel Glenn WMF
Which script, please, and which dump? (The conversation was not forwarded so I don't have the context.) On Wed, Jun 26, 2019 at 3:39 AM Stas Malyshev wrote: > Hi! > > > Follow-up: according to my processing script, this dump contains > > only 30280591 entries, while the main page is still

Re: [Wikidata] Significant change of Wikidata dump size

2019-06-25 Thread Stas Malyshev
Hi! > Follow-up: according to my processing script, this dump contains > only 30280591 entries, while the main page is still advertising 57M+ > data items. > Isn't it a bug in the dump process? There was a problem with dump script (since fixed), so the dump may indeed be broken. CCing Ariel to

Re: [Wikidata] Significant change of Wikidata dump size

2019-06-25 Thread Vladimir Ryabtsev
Follow-up: according to my processing script, this dump contains only 30280591 entries, while the main page is still advertising 57M+ data items. Isn't it a bug in the dump process? Regards, Vladimir пн, 24 июн. 2019 г. в 19:37, Vladimir Ryabtsev : > Hello, > > I apologize if I missed

[Wikidata] Significant change of Wikidata dump size

2019-06-24 Thread Vladimir Ryabtsev
Hello, I apologize if I missed something, but why the current JSON dump size is ~25GB while a week ago it was ~58GB? (see https://dumps.wikimedia.org/wikidatawiki/entities/20190617/) Regards, Vladimir ___ Wikidata mailing list