Hi Digger,

It's external parser for HTDig 3.1.2 and very useful if you want to parsing
MS-OFFICE document file.

* What can handle file format.
  - MS-WORD 8.0(above MS-WORD 5.0 version)
  - MS-EXCEL 95 & 97
  - MS-POWERPOINT 97(can't handle PPT 95 format. If you know this format,
Please tell me.)
  - English & Korean version of MS-OFFICE is supported.

* How to use external parser for HTdig.
  1. Please add following MIME type in /etc/mime.types file, if you use
apache web sever.
        application/msword doc
        application/msexcel xls
        application/mspowerpoint ppt
  2. Please add external parser in htdig.conf file.
        external_parsers:       "application/msword"
"/opt/www/htdig/parsers/htparsedoc"\
                                "application/msexcel"   
"/opt/www/htdig/parsers/htparsedoc"\
                                "application/mspowerpoint"      
"/opt/www/htdig/parsers/htparsedoc"
  3. Please check max_doc_size value in htdig.conf file. If this value is
lower than real document file size, external parser should go
unlimited-execution state.

  4. Please add Document.cc, 532 line.
        <----
                    mystrncasecmp("application/msword", token, 18) != 0 &&
                    mystrncasecmp("application/msexcel", token, 19) != 0 &&
                    mystrncasecmp("application/mspowerpoint", token, 24) != 0 &&
        --->


* License
  - This tsparser's not following GNU license and commercial software. But
if you want to use uncommercial web site, you can use it free.
  - This tsparser is only allow to use for HTDig, If you want to use it
another program, you have to contact to techsoft.

Powered by GNU & FSF.

HomePage : www.techsoft.co.kr

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to