[Xmldatadumps-l] Re: Missing pages/stale data in HTML dumps

2022-04-05 Thread Jan Berkel
> Thanks for noticing and sharing. Another known issue with HTML dumps > is that it seems that categories and templates are not always > extracted: https://phabricator.wikimedia.org/T300124 How are these dumps actually produced? Where is code hosted? If the process is easily reproducible "at

[Xmldatadumps-l] Re: Missing pages/stale data in HTML dumps

2022-04-05 Thread Mitar
Hi! Thanks for noticing and sharing. Another known issue with HTML dumps is that it seems that categories and templates are not always extracted: https://phabricator.wikimedia.org/T300124 Mitar On Tue, Apr 5, 2022 at 12:59 PM Jan Berkel wrote: > > Hello, > > just a heads-up for anyone using