On May 14, 2008, at 01:14, [EMAIL PROTECTED] wrote:
FWIW: Small correction
In the example you provided the following will break xml:
Na zolotom kryl'ce sideli
This is not true. The entity ' is also defined by XML. The
intention is correct, though. There are many entities predefi
Ok... I had a worse problem with the project i was involved...
I had to "clean-up" from MSOffice html to produce nice xml that would be
"eaten" up by fop.
The issue was simply sorted with the use of tons of regexp, coupled with
the use of tidy to convert the html to xml compliant html and the