Office produces some badly tortured HTML, but the HTML-Tidy utility can
clean it up and convert it into standards-conformant HTML or XHTML.
C version (highly recommend it)
http://www.w3.org/People/Raggett/tidy/
Java version (haven't used it)
http://sourceforge.net/projects/jtidy
Tidy is covered by the MIT License rather than the GPL, so you can
include it in commercial projects.
--
http://cms-list.org/
trim your replies for good karma.
- [cms-list] Third-party products that convert MS offi... Guy Chammas
- RE: [cms-list] Third-party products that conver... Andy Harrison
- RE: [cms-list] Third-party products that co... Raju Mathur
- RE: [cms-list] Third-party products that conver... Steve Drucker
- Re: [cms-list] Third-party products that co... Steve Yelvington
- Re: [cms-list] Third-party products tha... Charles Reitzel
- [cms-list] Query: CMS Success Stori... Tony Byrne - CMSWatch