On 4/27/07, Jeffrey Yasskin <[EMAIL PROTECTED]> wrote:
> On 4/18/07, Jim Jewett <[EMAIL PROTECTED]> wrote:

> > Agreed.  But there aren't 40K (alphabetic) letters in any particular
> > locale.  Most individual languages will have less than 100.

> Here's a relevant bunch of data from the CLDR:
> http://www.unicode.org/cldr/data/charts/by_type/misc.exemplarCharacters.html

http://www.unicode.org/Public/UNIDATA/Scripts.txt is also relevant,
but I can't quite interpret it.

There are 5020 "Common" code points.  These are mostly non-letters,
but I suppose they could appear in some langauges.

Latin script has 1070 characters; most Latin-script languages use only
a small fraction of them.  The standard ASCII alphabet is still only
26 lower + 26 capital, but there are plenty of characters that get
used in some language or other. (The largest single block is 208
letters from LATIN CAPITAL LETTER DZ WITH CARON to LATIN SMALL LETTER
EZH WITH CURL)

-jJ
_______________________________________________
Python-3000 mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to