On Tue, Dec 13, 2005 at 03:55:11PM +0530, Arun S K (RBIN/EDM3) * wrote:
> <?xml version="1.0" encoding="UTF8"?>
>
> The document has the character ß (Beeta) in it. The parser aborts with the
> following message
> --------------------------------------------------------------------
> :13: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0x80 0x20 0x3C 0x2F
> <NAME>test_1ß</NAME>
> --------------------------------------------------------------------
>
> Is ß not a valid UTF8 character?
The character is part of unicode. But the sequence of bytes used to
express it are not valid in UTF-8. It is a fatal XML error.
> How can this be corrected.
Replacing the wrong bytes in the instance by a sequence which is
valid for UTF-8.
Read the material pointed to at the beginning of
http://xmlsoft.org/encoding.html
Daniel
--
Daniel Veillard | Red Hat http://redhat.com/
[EMAIL PROTECTED] | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
_______________________________________________
xml mailing list, project page http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml