On 4/27/07, Jeffrey Yasskin <[EMAIL PROTECTED]> wrote: > On 4/18/07, Jim Jewett <[EMAIL PROTECTED]> wrote:
> > Agreed. But there aren't 40K (alphabetic) letters in any particular > > locale. Most individual languages will have less than 100. > Here's a relevant bunch of data from the CLDR: > http://www.unicode.org/cldr/data/charts/by_type/misc.exemplarCharacters.html http://www.unicode.org/Public/UNIDATA/Scripts.txt is also relevant, but I can't quite interpret it. There are 5020 "Common" code points. These are mostly non-letters, but I suppose they could appear in some langauges. Latin script has 1070 characters; most Latin-script languages use only a small fraction of them. The standard ASCII alphabet is still only 26 lower + 26 capital, but there are plenty of characters that get used in some language or other. (The largest single block is 208 letters from LATIN CAPITAL LETTER DZ WITH CARON to LATIN SMALL LETTER EZH WITH CURL) -jJ _______________________________________________ Python-3000 mailing list [email protected] http://mail.python.org/mailman/listinfo/python-3000 Unsubscribe: http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com
