Re: [Xmldatadumps-l] Housekeeping categories?

2013-02-13 Thread Robert Crowe
So I guess ideally we would create a table that lists all the subcategories of Category:Wikipedia administration. Are there any subcategories that are not housekeeping? I could try to write some code for that, but I don't have any idea what the check in process is like to include it in the bui

Re: [Xmldatadumps-l] Housekeeping categories?

2013-02-13 Thread Petr Onderka
> Copy_to_Wikimedia_Commons_(bot-assessed) > Creative_Commons_Attribution-ShareAlike_3.0_files These contain only files. > Stub-Class_biography_articles > Automatically_assessed_biography_articles > WikiProject_Disambiguation_pages These contain only talk pages. Also, all of them are indirect s

Re: [Xmldatadumps-l] Housekeeping categories?

2013-02-13 Thread Federico Leva (Nemo)
I don't think there's any simple/reliable way: your only option is probably crossing the whole category tree and find out whether a category is not a (sub-){1,100}category of https://en.wikipedia.org/wiki/Category:Articles or equivalent... and hope there are not too many loops! Nemo

Re: [Xmldatadumps-l] Housekeeping categories?

2013-02-13 Thread Robert Crowe
Sorry, those were poorly chosen examples. Here are some better ones: Copy_to_Wikimedia_Commons_(bot-assessed) Stub-Class_biography_articles Automatically_assessed_biography_articles WikiProject_Disambiguation_pages Creative_Commons_Attribution-ShareAlike_3.0_files I don't think that any of these

Re: [Xmldatadumps-l] Housekeeping categories?

2013-02-13 Thread Petr Onderka
Both of the categories you mentioned *are* hidden, so I think you can use that. Petr Onderka [[en:User:Svick]] On Wed, Feb 13, 2013 at 6:24 PM, Robert Crowe wrote: > Is there any way to distinguish between categories like History, or > Literature for example, and what I would think of as categor

[Xmldatadumps-l] Housekeeping categories?

2013-02-13 Thread Robert Crowe
Is there any way to distinguish between categories like History, or Literature for example, and what I would think of as categories that are used for internal housekeeping like "Unprintworthy_redirects" or "Nonindexed_pages"? They're not hidden categories, but conceptually there is a clear differe

Re: [Xmldatadumps-l] mw errors, dumps off-line til it's fixed

2013-02-13 Thread Ariel T. Glenn
The bug was found and squashed by MaxSem and we are back in business. I will be rerunning the broken jobs throughout the day. Ariel Στις 13-02-2013, ημέρα Τετ, και ώρα 09:11 +0200, ο/η Ariel T. Glenn έγραψε: > Good morning folks, > > Peple monitoring the dumps progress pae will have noticed tha