At 6:25 PM +0300 2/28/00, Andrey Novikov wrote:
>How can I smoothly index 30Gb of HTML text not disturbing
>existing index? Can I do it incremently in several steps?
>What hints can you share with me for that great job?

Well, if you mean you have an existing index that you want to keep 
querying, you'll need to use -a at the least. :-)

As for hints, I'd say you'd want a lot of disk space, RAM, and swap. 
You'll probably want to do it incrementally. You don't say much about 
how your data is organized, but I'd try splitting it into several 
pieces and then use htmerge to merge them together. Of course the 
resulting database is going to be huge, so if you never need to 
search the whole database together, I wouldn't even bother merging it 
together.

Beyond that, you're in somewhat uncharted territory. I can think of 
one or two people who have that much indexed, but they have their 
databases split over many categories.

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to