* Aron Stansvik wrote:
>On 5/25/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
>> <?xml version="1.0" encoding="ISO-8859-1"?>
>> <rss version="2.0">
>>    <channel>
>>       <title>Aftonbladet &#246;jesliv</title>
>>    </channel>
>> </rss>
>>
>> I try to extract the title element from the above. But the encoding is not
>> recognised. What i get is this:
>> Aftonbladet öjesliv
>
>What do you mean the encoding is not recognized? That looks like a
>perfectly valid result. &#246; is U+00F6 LATIN SMALL LETTER O WITH
>DIAERESIS.

This appears to be a defect in your mail user agent, the message you
reponded to was ISO-8859-1 encoded and had the o-umlaut encoded as two
octets (C3 B6, which is the proper UTF-8 sequence). The original problem
appears to the the usual "API gives UTF-8 but I expect something else".
-- 
Björn Höhrmann · mailto:[EMAIL PROTECTED] · http://bjoern.hoehrmann.de
Weinh. Str. 22 · Telefon: +49(0)621/4309674 · http://www.bjoernsworld.de
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to