Thank you, guys, I've already taken what I needed.

Namespaces are easily determined from the page prefix, I am not
bothered if there are any anomalies out there (i.e. page starting with
"User talk:" being in NS 0) and the query is lighter in case ns isn't
being pulled out from the DB. In overall, it was taking about 17
seconds to write down the data about 1M revisions.

Setting span for rev_ids was only to take the data in chunks, I didn't
have to specify them if I wanted to take them all at once. But hey,
such chunks are even easier to sort by rev_id.

M

_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list: 
https://wiki.toolserver.org/view/Mailing_list_etiquette

Reply via email to