I had two answers to my question - thanks Mick and Jim. I think I'll go with the Apache <Directory> solution for now because writing an external parser looks like it will take me more than 2 minutes :-)

I did wonder if maybe HtDig might have an option for "I know you don't know what this stuff is, but I know what it is and I want you to index it". Perhaps a new feature? Or maybe HtDig could treat "message/rfc822" the same way it handles "text/plain" for now.

Bill

On Sat, 24 Apr 2004, Bill Roberts wrote:


I am trying to build an index of a problem reporting system archive (we use req1.2 - old, but still going strong!). The files that I'm trying to index are plain text and are basically rfc822 messages concatenated together with some req-specific status commands in between. The problem is that HtDig is not indexing any of them because it reports them as "not HTML". I've used -vvv to get more info and I think the problem is that they are being reported by Apache2 as "Content-Type: message/rfc822".

I can force Apache to report them as something else (text/plain) and they get indexed. I have to do this with a .htaccess file and this seems an ugly solution.

Is there any way to tell HtDig how to handle message/rfc822 mime type?
The files I'm indexing do not have any extension as part of the filename.


Ninti Systems wrote:
Normally, anything that can be put in an .htaccess file can be added
directly to httpd.conf in an appropriate <Directory> container.

This isn't exactly what you're after, but is less ugly than using
.htaccess files. It is also more efficient, as Apache only deals with
the directives on start up as opposed to every time the directory is
accessed.

Mick


Jim wrote:
You might consider setting up an external converter to pass the contents
through to htdig as text/plain. More information regarding external parsers
and converters is available with the external_parsers documentation.

http://www.htdig.org/attrs.html#external_parsers


Jim




_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general


--
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Bill Roberts
Computer Section
Research School of Astronomy and Astrophysics
Australian National University
Canberra,  Australia

P:  +61 2 6125 0200
F:  +61 2 6125 0233
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~



-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek
For a limited time only, get FREE Ground shipping on all orders of $35
or more. Hurry up and shop folks, this offer expires April 30th!
http://www.thinkgeek.com/freeshipping/?cpg=12297
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to