Thanks for your fast answer!
http://www.textmining.org has a Word text extractor that uses POIFS
----- Original Message ----- From: "Janick Bernet" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, October 07, 2003 5:08 PM
Subject: HWPF
I wanted to start implementing a .DOC-parser for the Nutch-Project (www.nutch.org) and wanted to use POI for this purpose, but the Word-format-implementation seems not to be ready yet. Now we only need to be able to extract the text without formating or anything. Is this already possible? If so, could you provide an example how to do this using POI?
If not Ill have to do a parser on my own ) and I would gladly provide my work to POI afterwards.
Regards
Janick <[EMAIL PROTECTED]> ------------------------ http://zap.to/jabernet http://www.swissasp.ch ICQ# 32896520
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--
Janick Bernet SwissASP AG
~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ home: www.swissasp.ch tel.: +41 (0)52 364 19 43 fax.: +41 (0)52 364 19 93
