Hi folks!!!

My htdig index all the servers in a database: I run the indexing script via
cron the night, with the options -i -a for the htdig. So it wipes the
database and create a whole new database with the new links and words.

But every morning, a service, located at a precise URL with pattern
/tlm/concorsi/ , is updated. And so I have 2 possibilities:

- Reindex the whole database
- Index only the URLs containing the pattern /tlm/concorsi/

Well, I think the 1st chance, could not be very bad, because it takes 15
minutes to do the work. But I think it isn't the most elegant. I tried the
second, but I'm not sure about its real goodness. I ran htdig with the
option -a (to create .work files) and set the start URL at the home of the
service and the limits_urls_to directive to /tlm/concorsi/.

It indexes the right documents, but then it keeps in the database the old
files too. Is there a way to erase from the db all the documents with
pattern specified in the limits_urls_to or similar, by making possibile the
real updating?

I think it could be very useful.

Thanks and Ciao
Gabriele

----------------------------------------------------------

 U.O. Rete Civica - Comune di Prato
 Via Ricasoli, 4 - 59100 Prato PO Italia
 Tel. +39 0574616342    Fax +39 0574616003

 http://www.comune.prato.it
 E-Mail: [EMAIL PROTECTED]

----------------------------------------------------------
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to