Ciao friends, I'm bothering you again ...
I've noticed that when I index my site htdig doesn't realize that
http://www.pippo.it is the same document as http://www.pippo.it/home.htm
(cos I've set home.htm as default page in my Apache server). A way to
resolve this has been done by introducing a configuration option
("remove_default_doc)), but it seems to me not to work really fine.
Tell me if I am wrong. Is possible to use ETag response-header to
"identify" a document on the WEB? I've made some tests? If I ask the URL
"http://www.comune.prato.it" and then
"http://www.comune.prato.it/home.htm", it gives me the same ETag value as
response. Has it any meaning, for you?
Why don't we store it and use it to compare 2 docs? This would permit to
store the same document only once.
Probably, if it was possible, you would have already adopted this solution
!!! But, who knows ...
Another way to avoid storing more than once the same document, coulb be to
compare the size and the modification date of the docs.
Well, that's all
Let me know
-Gabriele
>>> Zinedine "Zizou" Zidane, don't leave us !!! <<<
-------------------------------------------------
Gabriele Bartolini
U.O. Rete Civica - Comune di Prato
Prato - Italia - Europa
e-mail: [EMAIL PROTECTED]
http://www.po-net.prato.it
-------------------------------------------------
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.