Are you tring to recognized it by eyes or in your program? If the webpage is in unicode, it's hard to say. The bad thing is, unlike the "La", "The", "Die" in European languages, the most frequent ideographs in both Chinese text form are almost the same. Perhaps the ideograph for the meaning "for" (wei in Mandarin pinyin) is the most significant recognizable one. The traditional one 70BA looks like: http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=70BA The simplified one 4E3A looks like: http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=4E3A
And the most common measure word (a bit like the article "a" in English) is different. The traditional 500B http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=500B The simplified 4E2A http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=4E2A If the webpage isn't in unicode, a simple rule is most traditional Chinese webpages are coded "Big5", most simplified Chinese webpages are coded "GB2312" or "GB18030". Anyway, if you find a chunk of Chinese text looks complex, it is likely to be traditional. ================= Zhang Weiwu from Xiamen China ----- Original Message ----- From: "Paul Hastings" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, February 13, 2003 7:35 PM Subject: traditional vs simplified chinese > i suppose this is a really simple minded question but is there any way of > telling if an incoming chunk of text (say from a browser form) is > traditional or simplified chinese? > > thanks. > ---------------------------------------------------- > Paul Hastings [EMAIL PROTECTED] > Director Environmental Information Center > Thailand Environment Institute > Member Team Macromedia (Allaire) > http://www.tei.or.th/eic --------------------------- >