Stephen Collyer <[EMAIL PROTECTED]> writes:

> I've run into a pretty nasty problem with XML::Xerces
> when parsing in a CGI script using a SAX2 parser: in brief,
> the parser seems to ignore totally any character data in a
> document that contains UTF-8 characters.

Hi Stephen,

Yes, I think I know exactly what this is.

While preparing for the 2.6 release I stumbled upon an obvious bug in
the callback handlers - i.e. SAX2 character parsers. They are
transcoding into ASCII by default, and I have not provided a way to
override that, so everything will get tossed.

There is a reasonably simple fix for this, but it is in the C++ code,
not the Perl code, and it involves re-running SWIG. 

It is a *serious* problem, as serious as the memory leaks, and must be
fixed. I will *not* have time to devote over the next 5 days, but
after that I'll be on a long plane ride back to India, and so I will
make sure it is fixed then.

Stephen, could you send an example file and a short program that
demonstrates the problem? That would make it even simpler for me to
test that things are working as they should.

Thanks,
jas.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to