Someone correct me if I'm wrong, but I believe that UTF-8 is a
"multibyte" character set that represents ASCII as single-byte
characters and many other characters as variable numbers of bytes.
This is oversimplifying (and not very clear on my part) - see UTF-8 on
wikipedia:
  http://en.wikipedia.org/wiki/UTF-8

The bottom line is that UTF-8 characters that are not single-byte do not
have an ASCII representation, whereas UTF-8 that is only ASCII
characters is byte-for-byte the same as the ASCII representation.

Perhaps you are hoping to filter out (discard) the multi-byte, non-ASCII
characters.  Is that what you want to do?

On Sat, 2006-06-03 at 11:39 +0800, LU Song wrote:
> Dear Jeffery,
> 
> I tried to find the way in Unicon/Icon to transfer utf-8 to ascii.
> But I can't.
> 
> Do you know if there is a function to deal with it?
> 
> Thanks.
> 
> Song



_______________________________________________
Unicon-group mailing list
Unicon-group@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/unicon-group

Reply via email to