Quoting John Hudson [EMAIL PROTECTED]:
Amir, you are misunderstanding the nature of Unicode. Unicode is a
*character* encoding standard, and the glyphs in the charts are intended
only as a visual guide suggesting normative shapes for those characters. In
the case of Arabic, a single
At 22:57 5/12/2002, Amir Herman wrote:
If this is the case, why in Unicode it have Arabic Presentation A C to
present the final, medial, and initial form of Arabic characters?
Ah, this is an historical oddity. Character inclusion in Unicode is
governed by a number of principles, which are not
In chapter 15 of the Unicode specification is the statement that the Han
Radical-Stroke Index is available as a separate file. I have tried to find
it on the web site with no success. Is this file available on the web site
please?
William Overington
13 May 2002
I have been looking at the characters in the CJK Unified Ideographs
Extension B document. These are the characters from U+02 through to
U+02A6DF, which, as I understand it, are the rarer CJK characters.
I wonder if any of the people who read this list who understand the
languages involved
The 128 is from all 128 possible permutations of 0 and 1 as the seven least
significant bits of any high surrogate in the range U+DB80 through to
U+DBFF. The 1024 is from all possible permutations of the ten least
significant bits of any low surrogate in the range U+DC00 through to U+DFFF.
At 10:14 +0800 2002-05-13, Amir Herman wrote:
Dear Roozbeh,
I would strongly suggest that instead of correcting the U+6AC, we
add another glyph of 'GA' of letter 'KEHEH' (U+6A9) with dot above.
It is not 100% wrong of saying
that existing U+6AC represent 'GA' for old malay. Only that the
John Hudson [mailto:[EMAIL PROTECTED]]
Amir, you are misunderstanding the nature of Unicode. Unicode is a
*character* encoding standard, and the glyphs in the charts
are intended only as a visual guide suggesting normative shapes
for those characters.
On the other hand, we know that this
Doug Ewell wrote:
Michael Yau michael dot yau at oracle dot com wrote:
In Oracle9i, the NCHAR datatypes are exclusively Unicode datatypes
and supports both UTF-16 and UTF-8.
s/UTF-8/CESU-8/
update ORACLE set ENCODING='CESU-8' where ENCODING='UTF-8'
_ Marco
Take U+2A009 and U+2A00A as examples, I haven't seen anyone
use any of these in my life, yet from the radicals it seems
that both are names of some kind of birds.
Ping
William Overington wrote:
I have been looking at the characters in the CJK Unified Ideographs
Extension B document. These
On Mon, 13 May 2002, Amir Herman wrote:
As conclusion, I would say that we can still preserve the existing U+6AC
because it is not wrong, only the glyph is not standard and limited in
its use. Later on I might send some images to clarify my argument. The
task now is to add another glyph
- Original Message -
From: Markus Scherer [EMAIL PROTECTED]
To: Magda Danish (Unicode) [EMAIL PROTECTED]
Cc: unicode [EMAIL PROTECTED]
Sent: Friday, May 10, 2002 5:11 AM
Subject: Re: Useful Resources - Another round of spring cleaning
Google+ :
markus
Magda Danish (Unicode)
On Monday, May 13, 2002, at 04:02 AM, William Overington wrote:
In chapter 15 of the Unicode specification is the statement that the Han
Radical-Stroke Index is available as a separate file. I have tried to
find
it on the web site with no success. Is this file available on the web
site
On Monday, May 13, 2002, at 04:21 AM, William Overington wrote:
I have been looking at the characters in the CJK Unified Ideographs
Extension B document. These are the characters from U+02 through to
U+02A6DF, which, as I understand it, are the rarer CJK characters.
Actually, this is
On Mon, 13 May 2002, William Overington wrote:
I have been looking at the characters in the CJK Unified Ideographs
Extension B document. These are the characters from U+02 through to
U+02A6DF, which, as I understand it, are the rarer CJK characters.
I wonder if any of the people who
Rajesh,
The correct links are now posted at
http://www.unicode.org/unicode/onlinedat/resources.html
Magda.
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
Sent: Monday, May 13, 2002 3:10 AM
To: Markus Scherer; Magda Danish (Unicode)
Cc: unicode
Subject: Re:
On Friday, May 10, 2002, at 06:29 PM, John Cowan wrote:
What is this about Qing taboo characters? Can someone point me to an
explanation (in English)? Thanks.
One source is Charles S. Gardner _Chinese Traditional Historiography_,
(Cambridge, Mass.: Harvard University Press, 1938, 2nd
At 04:03 5/13/2002, Marco Cimarosti wrote:
On the basis of this precedent, and on the basis of the fact that sample
glyphs on old copies Unicode book's will have a strong influence on font
designers for years, it may be wise in this case to leave the old
U+0643-like character alone and add a new
It may be helpful to know that USMARC is now known as MARC 21.
MARC 21 Specifications (including mapping tables to UCS/Unicode)
http://www.loc.gov/marc/specifications/spechome.html
Gary L. Smith
Software Architect
Database Quality Enrichment Department
OCLC
[EMAIL PROTECTED]
-Original
dear unicoder:
Could someone tell me the story about JIS x0213 and how could we encode
JIS x0213 ? Is there a spec about how to encode JIS x0213 into SJIS,
ISO-2022-JP or EUCJP ?
Thanks.
I have been reading the Unicode Normalization UTR and have a couple of
questions regarding it:
- will string comparison methods based on NFC and NFD always give the
same results?
- is it correct that methods based on NFKC and NFKD will give
different results from ones based on
Lars Marius Garshol scripsit:
- will string comparison methods based on NFC and NFD always give the
same results?
By intention, yes.
- is it correct that methods based on NFKC and NFKD will give
different results from ones based on NFC/NFD?
Yes.
- if NFC and NFD give the same
Hi Lars,
Some information below...
Addison
Addison P. Phillips
Globalization Architect
webMethods, Inc.
432 Lakeside Drive
Sunnyvale, California, USA
+1 408.962.5487 (phone)
+1 408.210.3569 (mobile)
-
Internationalization is an architecture.
It
22 matches
Mail list logo