Hi folks!!! My htdig index all the servers in a database: I run the indexing script via cron the night, with the options -i -a for the htdig. So it wipes the database and create a whole new database with the new links and words. But every morning, a service, located at a precise URL with pattern /tlm/concorsi/ , is updated. And so I have 2 possibilities: - Reindex the whole database - Index only the URLs containing the pattern /tlm/concorsi/ Well, I think the 1st chance, could not be very bad, because it takes 15 minutes to do the work. But I think it isn't the most elegant. I tried the second, but I'm not sure about its real goodness. I ran htdig with the option -a (to create .work files) and set the start URL at the home of the service and the limits_urls_to directive to /tlm/concorsi/. It indexes the right documents, but then it keeps in the database the old files too. Is there a way to erase from the db all the documents with pattern specified in the limits_urls_to or similar, by making possibile the real updating? I think it could be very useful. Thanks and Ciao Gabriele ---------------------------------------------------------- U.O. Rete Civica - Comune di Prato Via Ricasoli, 4 - 59100 Prato PO Italia Tel. +39 0574616342 Fax +39 0574616003 http://www.comune.prato.it E-Mail: [EMAIL PROTECTED] ---------------------------------------------------------- ------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to [EMAIL PROTECTED] containing the single word "unsubscribe" in the SUBJECT of the message.
[htdig3-dev] Updating only a part of the database
U.O. Telematica Municipale - Comune di Prato Fri, 29 Jan 1999 07:16:06 -0500
- Re: [htdig3-dev] Updating onl... U.O. Telematica Municipale - Comune di Prato
- Re: [htdig3-dev] Updatin... Geoff Hutchison
- Re: [htdig3-dev] Upd... U.O. Telematica Municipale - Comune di Prato
- Re: [htdig3-dev]... Geoff Hutchison
- Re: [htdig3-... U.O. Telematica Municipale - Comune di Prato
