Thanks for the reply. Are you sure those characters don't exist n
shift-jis?  Please take a look at the attached text file. It contains
two characters ("1" in a circle and "2" in a circle). The file is in
shift-jis encoding.

Thanks,
Jianyang

-----Original Message-----
From: John Delacour [mailto:[EMAIL PROTECTED] 
Sent: Friday, July 07, 2006 6:24 AM
To: Jianyang Tai; perl-unicode@perl.org
Subject: Re: Problem with Encode module

At 10:31 am -0700 23/6/06, Jianyang Tai wrote:

>I encountered some problem with the Encode module when I convert some 
>Japanese contents from shift-jis to utf-8. Basically I am using the 
>from_to subroutine to do the job. All work well except for those number

>inside a circle characters (8740 ~ 8754). The unicode range for those 
>characters is 2460 ~ 2473. However, the from_to doesn't convert them 
>correctly. For 8740 (1 inside a little circle), what I got was "FFFD 
>0040".
>
>Does anyone have any idea what the problem is? Is this a known issue or

>there is something wrong with the original shift-jis text? Any advise 
>is very appreciated.

Those characters do not exist in shift-jis but only in GB18030 and in
the MacOS Japanese, Korean and Chinese (both) character sets.

JD

Reply via email to