Hi Digger,
It's external parser for HTDig 3.1.2 and very useful if you want to parsing
MS-OFFICE document file.
* What can handle file format.
- MS-WORD 8.0(above MS-WORD 5.0 version)
- MS-EXCEL 95 & 97
- MS-POWERPOINT 97(can't handle PPT 95 format. If you know this format,
Please tell me.)
- English & Korean version of MS-OFFICE is supported.
* How to use external parser for HTdig.
1. Please add following MIME type in /etc/mime.types file, if you use
apache web sever.
application/msword doc
application/msexcel xls
application/mspowerpoint ppt
2. Please add external parser in htdig.conf file.
external_parsers: "application/msword"
"/opt/www/htdig/parsers/htparsedoc"\
"application/msexcel"
"/opt/www/htdig/parsers/htparsedoc"\
"application/mspowerpoint"
"/opt/www/htdig/parsers/htparsedoc"
3. Please check max_doc_size value in htdig.conf file. If this value is
lower than real document file size, external parser should go
unlimited-execution state.
4. Please add Document.cc, 532 line.
<----
mystrncasecmp("application/msword", token, 18) != 0 &&
mystrncasecmp("application/msexcel", token, 19) != 0 &&
mystrncasecmp("application/mspowerpoint", token, 24) != 0 &&
--->
* License
- This tsparser's not following GNU license and commercial software. But
if you want to use uncommercial web site, you can use it free.
- This tsparser is only allow to use for HTDig, If you want to use it
another program, you have to contact to techsoft.
Powered by GNU & FSF.
HomePage : www.techsoft.co.kr
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.