Stephen Collyer <[EMAIL PROTECTED]> writes: > I've run into a pretty nasty problem with XML::Xerces > when parsing in a CGI script using a SAX2 parser: in brief, > the parser seems to ignore totally any character data in a > document that contains UTF-8 characters.
Hi Stephen, Yes, I think I know exactly what this is. While preparing for the 2.6 release I stumbled upon an obvious bug in the callback handlers - i.e. SAX2 character parsers. They are transcoding into ASCII by default, and I have not provided a way to override that, so everything will get tossed. There is a reasonably simple fix for this, but it is in the C++ code, not the Perl code, and it involves re-running SWIG. It is a *serious* problem, as serious as the memory leaks, and must be fixed. I will *not* have time to devote over the next 5 days, but after that I'll be on a long plane ride back to India, and so I will make sure it is fixed then. Stephen, could you send an example file and a short program that demonstrates the problem? That would make it even simpler for me to test that things are working as they should. Thanks, jas. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
