I did wonder if maybe HtDig might have an option for "I know you don't know what this stuff is, but I know what it is and I want you to index it". Perhaps a new feature? Or maybe HtDig could treat "message/rfc822" the same way it handles "text/plain" for now.
Bill
On Sat, 24 Apr 2004, Bill Roberts wrote:
I am trying to build an index of a problem reporting system archive (we use req1.2 - old, but still going strong!). The files that I'm trying to index are plain text and are basically rfc822 messages concatenated together with some req-specific status commands in between. The problem is that HtDig is not indexing any of them because it reports them as "not HTML". I've used -vvv to get more info and I think the problem is that they are being reported by Apache2 as "Content-Type: message/rfc822".
I can force Apache to report them as something else (text/plain) and they get indexed. I have to do this with a .htaccess file and this seems an ugly solution.
Is there any way to tell HtDig how to handle message/rfc822 mime type? The files I'm indexing do not have any extension as part of the filename.
Ninti Systems wrote:
Normally, anything that can be put in an .htaccess file can be added directly to httpd.conf in an appropriate <Directory> container.
This isn't exactly what you're after, but is less ugly than using .htaccess files. It is also more efficient, as Apache only deals with the directives on start up as opposed to every time the directory is accessed.
Mick
Jim wrote:
You might consider setting up an external converter to pass the contents through to htdig as text/plain. More information regarding external parsers and converters is available with the external_parsers documentation.
http://www.htdig.org/attrs.html#external_parsers
Jim
_______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general
-- ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Bill Roberts Computer Section Research School of Astronomy and Astrophysics Australian National University Canberra, Australia
P: +61 2 6125 0200 F: +61 2 6125 0233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
------------------------------------------------------- This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek For a limited time only, get FREE Ground shipping on all orders of $35 or more. Hurry up and shop folks, this offer expires April 30th! http://www.thinkgeek.com/freeshipping/?cpg=12297 _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

