I believe that topic related to Chinese processing was discussed before. I could not dig out the info I want from the mail list archive.
My Python script reads some Chinese text and then split a line delimited by white spaces. I got lists like ['\xbc\xc7\xd5\xdf', '\xd0\xbb\xbd\xf0\xbb\xa2', '\xa1\xa2'] I had #-*- coding: gbk -*- on top of the script. My Windows 2000 system's default language is Chinese (GB2312) and displays Chinese perfectly. I don't know how to configure python or what else I need to properly process such two-byte-character text. Thanks. __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com -- http://mail.python.org/mailman/listinfo/python-list