On Fri, 09 Apr 1999, you wrote:
>hello,
>i have got a problem. 
>
>i am archiving my high-volume lists with mhonarc with automagically
>gziping html documents. 
>my apache server is extended with mod_gzip, so i can rich eg.
>
>http://nowhere.te/dupadupa.html.gz
>
>and my server decompress it on the fly and show to the clinet text/html
>content.
>but htdig ignores index.html.gz files (and does not include them to search
>database). 
>
>how to tell him to be a good child ;-) ??

ht://Dig assumes any compressed file to be not of text/* type by default,
i.e. it assumes from the URL that your files do not contain indexable
stuff.  You can put ht://Dig into a mode of operation which also indexes
compressed files by removing the ending for compressed files from the
"bad_extensions" list of file type not to index (for  more information
pls have a look at the configuration file documentation at the home of
ht://Dig: http://www.htdig.org). 
However, you will run into trouble if you also have some other compressed
items on your server, e.g. a compressed tar(1) archive, because ht://Dig 
would then try to download and compress _any_ *.gz file. A better way of
handling this would IMHO be to store the compressed HTML files under a 
new file extension and declare a proper Apache handler for them (e.g.: 
by using an extension like .zhtml).  This will avoid any trouble that
could emerge from other, non-HTML, compressed files on your server.
Furthermore it will leave such files compressed when they are accessed,
which will recognizably reduce the bandwidth used for downloading tar(1)
archives or other compressed stuff that should not be compressed (and
imagine the confusion of a poor user who tries to decompress a down-
loaded *.tar.gz which isn't compressed any more!).


hth,
  Torsten

--
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstra�e 14                            Tel: +49-4101-403605
D-25474 Ellerbek                            Fax: +49-4101-403606
E-Mail: [EMAIL PROTECTED]            Internet: http://www.inwise.de

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to