At 12:55 PM -0500 10/22/98, S.Pietsch wrote:
>I�ve got a question about the newest version of ht/Dig. Is it possible
>to get a index of  word documents (word6/7 and word8)? What kind of
>parser do I need for this job and where do I get them from?
>As far as I understand a index of word documents is only possible with
>an external parser. But I now little more: :(

What you'll need is a program that will read Word documents and (hopefully)
translate them to text or some other reasonable format. Then you'll need a
script that will do some minimal parsing on the text (i.e. pick out the
words in the document, etc.) and report that to ht://Dig. The next release
(3.1.0b2) will have a contributed external parser example
(contrib/htparsedoc) which will parse some Word documents.


-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to