Re: OT Re: [WSG] UTF-8

2005-04-19 Thread Jan Brasna
Yes, but the bytes used in ISO 8859-1 (Latin1) or Windows code page and
those usef for UTF-8 are different.
Sure, however the mail came in Latin1 (see the headers), so I just want 
to comment that it won't show the difference.

--
Jan Brasna aka JohnyB :: www.alphanumeric.cz | www.janbrasna.com
**
The discussion list for  http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list & getting help
**


RE: OT Re: [WSG] UTF-8

2005-04-19 Thread Richard Ishida
> -Original Message-
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On Behalf Of Jan Brasna
> Sent: 19 April 2005 17:29
> To: wsg@webstandardsgroup.org
> Subject: OT Re: [WSG] UTF-8
> 
> > PPS. This is a good test to see if the WSG mail system can handle 
> > UTF-8
> 
> AFAIK "å" is Latin1 character (Scandinavian), so no need for UTF here.
> 

Yes, but the bytes used in ISO 8859-1 (Latin1) or Windows code page and
those usef for UTF-8 are different.  In Latin1 encoding å is a single byte:
E5; whereas UTF-8 represents this as two bytes: C3 A5.  So the fact that you
are seeing it indicates that the system recognised the Unicode encoding :-)

RI


PS: You may find my Unicode converter a useful play tool for this kind of
thing.  It's a bit rough and ready, but it's useful.
http://people.w3.org/rishida/scripts/uniview/conversion.en.html


Richard Ishida
W3C

contact info:
http://www.w3.org/People/Ishida/ 

W3C Internationalization:
http://www.w3.org/International/ 

Publication blog:
http://people.w3.org/rishida/blog/
 
 
 

**
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list & getting help
**