On Sun, 11 Dec 2011, babug wrote:
I need to support microsoft outlook format template (.oft).How do we parse this type of format using POI library.
It's basically the same as a regular Outlook MSG file, so just process it as you would one of those
Alternately, if you don't need much control over what you get back, and just want the basics, use Apache Tika to do it - Tika will extract out the text and metadata for you.
This template file contains HTML format content,which has image.. so and so. I have attached a sample file, need to parse the content. Can some one help me on this?
One thing to be aware of is that many outlook "html" emails are actually compressed RTF ones. If you have one of those, you'll need to get the compressed rtf chunk rather than the html one, use POI to turn that back into regular RTF, and process that. (See Tika for an example of doing that)
Nick --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
