On Fri, 31 Dec 2010, jetcat33 wrote:
I've created a program to read .doc and .docx text. I now want to search and replace all newline characters (the ones created with shift+enter in Word) with the following: "<br>" For some reason, however, newline characters aren't being read properly in HWPF and XWPF.
If you want to get a HTML version, then you'll probably want to use Apache Tika. It uses poi internally, but returns a html version of the word files rather than just a plain text one
Nick --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
