Re: FW: chinese conversion tables
At 11:21 AM -0700 5/1/01, Magda Danish (Unicode) wrote: -Original Message- From: Michal Gerling [mailto:[EMAIL PROTECTED]] Sent: Tuesday, May 01, 2001 7:24 AM To: '[EMAIL PROTECTED]' Subject: chinese conversion tables I am working with UNICODE and the CJK market and need to know: Is there any one table or formula for moving from simplified to traditional characters and back in UNICODE? thank you very much for your help! Michelle g. Partial data to interconvert between simplified and traditional characters is available through the Unihan database. However, the problem is not a simple one, as there are frequently multiple traditional forms that correspond to a single simplified form. Moreover, the vocabulary used in the PRC with simplified characters differs on occasion from the vocabulary used in Taiwan and elsewhere for traditional ones (e.g., the names of the chemical elements, until recently the word for computer). It really isn't possible to convert between simplified and traditional characters without doing a lexical analysis. -- = John H. Jenkins [EMAIL PROTECTED] [EMAIL PROTECTED] http://homepage.mac.com/jenkins/
RE: FW: chinese conversion tables
Hi Michal, Our company produces a product that addresses your problem, including all the issues mentioned by John Jenkins below. We call it our Chinese-to-Chinese Script Converter, or C2C for short. In particular it does not only code-point conversion but also orthographic and lexemic conversions, based on a set of cross-idiom dictionaries and word identification in streams of Chinese text. It is fully Unicode based internally, although conversion to and from other character sets is also supported. You can read more about it at http://www.basistech.com/products/Chinese-Converter.html, or contact me directly for more information. == Ted Peck Director of Product Management Basis Technology Corp. One Kendall Square Cambridge, MA 02139 tel: 617-386-7158 fax: 617-386-2021 [EMAIL PROTECTED] -Original Message- From: John H. Jenkins [mailto:[EMAIL PROTECTED]] Sent: Tuesday, May 01, 2001 2:54 PM To: Magda Danish (Unicode); [EMAIL PROTECTED] Subject: Re: FW: chinese conversion tables At 11:21 AM -0700 5/1/01, Magda Danish (Unicode) wrote: -Original Message- From: Michal Gerling [mailto:[EMAIL PROTECTED]] Sent: Tuesday, May 01, 2001 7:24 AM To: '[EMAIL PROTECTED]' Subject: chinese conversion tables I am working with UNICODE and the CJK market and need to know: Is there any one table or formula for moving from simplified to traditional characters and back in UNICODE? thank you very much for your help! Michelle g. Partial data to interconvert between simplified and traditional characters is available through the Unihan database. However, the problem is not a simple one, as there are frequently multiple traditional forms that correspond to a single simplified form. Moreover, the vocabulary used in the PRC with simplified characters differs on occasion from the vocabulary used in Taiwan and elsewhere for traditional ones (e.g., the names of the chemical elements, until recently the word for computer). It really isn't possible to convert between simplified and traditional characters without doing a lexical analysis. -- = John H. Jenkins [EMAIL PROTECTED] [EMAIL PROTECTED] http://homepage.mac.com/jenkins/
Re: FW: chinese conversion tables
On Tue, May 01, 2001 at 12:53:57PM -0600, John H. Jenkins wrote: I am working with UNICODE and the CJK market and need to know: Is there any one table or formula for moving from simplified to traditional characters and back in UNICODE? thank you very much for your help! Michelle g. Partial data to interconvert between simplified and traditional characters is available through the Unihan database. [...] You may want to look at zh-autoconvert (available at any Debian (www.debian.org) mirror). It probably uses different tables than the Unicode ones, because the initial version didn't support Unicode, but it's in actual use at a mailing list gateway at Debian. -- David Starner - [EMAIL PROTECTED] Pointless website: http://dvdeug.dhis.org I don't care if Bill personally has my name and reads my email and laughs at me. In fact, I'd be rather honored. - Joseph_Greg
Re: FW: chinese conversion tables
From: John H. Jenkins [EMAIL PROTECTED] It really isn't possible to convert between simplified and traditional characters without doing a lexical analysis. Word 2000/2002 both have a conversion utility that does the lexical analysis that John refers to here. Others will have to speak to how good it is, though. :-) I think there are also some other vendors who produce such tools? MichKa Michael Kaplan Trigeminal Software, Inc. http://www.trigeminal.com/