Thank you, guys, I've already taken what I needed. Namespaces are easily determined from the page prefix, I am not bothered if there are any anomalies out there (i.e. page starting with "User talk:" being in NS 0) and the query is lighter in case ns isn't being pulled out from the DB. In overall, it was taking about 17 seconds to write down the data about 1M revisions.
Setting span for rev_ids was only to take the data in chunks, I didn't have to specify them if I wanted to take them all at once. But hey, such chunks are even easier to sort by rev_id. M _______________________________________________ Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette