Chengbin Zheng wrote:

Thank you for dropping by and sharing this information with us Tomasz!

It is good just knowing that it is in the queue. Have you considered making
a version of static HTML Wikipedia where there are no user talk and
discussion pages that eating up half the space (like the 5GB XML dump for
English Wikipedia)? As in the previous E-Mail, it is impossible to delete
millions of pages through Windows Vista's search function (I left it
overnight, and it ended up eating 1.3GB of RAM and maxing out one of my
cores. Even deleting a single file took minutes).

The Windows (and others?) GUI wasn't really designed with what you are trying to do in mind in terms of the number of items. You are asking it to search for all the files that match your pattern, keep the millions (?) of results in memory, and then to show you a windows containing the millions of items and to let you do all the magic GUI operations (selecting / dragging ...) all the while keeping track of which you've selected / move about etc.

I know you want to avoid using command line, but in this case it's really much simpler / only feasible choice to search the internet / ask around for the right commands and issue that on the command line. It's only going to be one line of typing once you've got it, and you can write it down on a piece of paper or something for future reference. It's not like you have to learn the ins and out of all the commands and its options and what not. (Of course, you would want to test it on a small sample to make sure the command is correct before you let it loose on the whole dump.)

KTC

--
Experience is a good school but the fees are high.
    - Heinrich Heine

Attachment: PGP.sig
Description: PGP signature

_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to