Sorry, pasting the Chinese text into email wasn't very clever. I've attached a 
txt file with the Chinese saved as utf-8.

John


-----Original Message-----
From: Ronald Oussoren [mailto:ronaldousso...@mac.com]
Sent: Mon 5/25/2009 11:30 AM
To: John Newman
Cc: pythonmac-sig@python.org
Subject: Re: [Pythonmac-SIG] Chinese glyphs in Python 3
 

On 25 May, 2009, at 16:36, John Newman wrote:


        Ronald
        
        See the attachment, where most of the Chinese glyphs are correctly 
displayed, but not all.
        
        When I copy and paste these glyphs from Python 3 to JEdit, all the 
glyphs display correctly, so the correct unicode code points are there 
underlying the glyphs. I've tried most of the fonts available to me in the GUI 
and I still can't get them all to display properly.
        

Could you post the actual text-file as well? 

Ronald

        
        John
        
        
        -----Original Message-----
        From: Ronald Oussoren [mailto:ronaldousso...@mac.com]
        Sent: Sun 5/24/2009 11:48 PM
        To: John Newman
        Cc: pythonmac-sig@python.org
        Subject: Re: [Pythonmac-SIG] Chinese glyphs in Python 3
        
        John,
        
        On 25 May, 2009, at 1:34, John Newman wrote:
        
        
                Apologies if my question is ridiculously trivial - I'm not a 
developer, just a relatively new user of Python 3.0.1 on Mac OS 10.4.11. and I 
have only just now joined the list.
               
                When I read in a Chinese text in the Python 3 IDLE GUI on 
Windows XP I see all the Chinese glyphs displayed properly. On my Mac, about 
one third of the Chinese characters are not displayed correctly (just empty or 
black boxes in place of glyphs). It doesn't matter whether I save/open texts as 
utf-8, utf-16, etc. [The glyphs display fine in TextEdit, JEdit, Word etc.]  
And the same kind of problem occurs when I scroll through the list of font 
names in Preferences in IDLE: the names in Chinese glyphs have a number of 
white or black boxes instead of the glyphs.
               
                I just assume that this has something to do with the locale 
settings in IDLE? On my machine:
               
                >>> locale.getpreferredencoding()
                'mac-roman'
                >>> locale.getlocale()
                (None, None)
               
                In my Windows XP, the locale settings are (English, '1252') and 
I presume that this difference is relevant to understanding the different 
effects I get opening Chinese texts in my Windows XP and my Mac. 'mac-roman' 
would not be my natural choice of encoding if I am looking at Chinese text! I 
need an encoding which can handle the range of glyphs we find in GB 18030, say.
               
                Am I being naive in thinking that all I have to do in Python is 
somehow change the locale settings in some way which will display Chnese 
glyphs?  I'm at a loss to know what I should do in order to display Chinese 
glyphs properly on the Mac. I tried experimenting with "setlocale" but couldn't 
make progress.
               
                Any suggestions would be very welcome.
               
        
        
        This might be a font issue, although the default font (Courier) seems 
to be capable of displaying unicode text and therefore saving as UTF-8 should 
work.  Another possible souce for this problem is the GUI framework used by 
IDLE.
        
         Could you post an example of a file that shows the problem?
        
        Ronald
        
        
        
        

        <glyphs.jpg>




美       国   当   选 总 统 巴 拉 克 · 奥 巴 马 , 稍 早 与 
家 人 一 起 正 式 登 上 列 车 离 开 费 城 ( P h i l a d e l p h i 
a ) , 追 寻 废 奴 总 统 林 肯 当 年 前 往 赴 任 时 的 足 迹 
前 往 华 盛 顿 特 区 。 
  奥 巴 马 1 月 2 0 日 即 将 在 华 盛 顿 特 区 宣 誓 就 职 为 
美 国 第 4 4 任 总 统 。 
 自 林 肯 总 统 宣 布 废 奴 迄 今 近 1 5 0 年 , 美 国 终 于 
选 出 第 一 任 黑 人 总 统 , 因 此 纪 念 意 义 非 常 。 
_______________________________________________
Pythonmac-SIG maillist  -  Pythonmac-SIG@python.org
http://mail.python.org/mailman/listinfo/pythonmac-sig

Reply via email to