FW to Unicode ml

From: ernestvandenbooga...@hotmail.com
To: jsb...@mimuw.edu.pl
Subject: RE: statistics
Date: Tue, 12 Oct 2010 10:13:17 +0200








In 5.2, Chapter 2.4 table 2-3 is listed which General Categories are 
"characters". Out are: Surrogates, Private Use, Non-characters and Reserved 
code points. Note that Format characters (Cf) are included as characters. The 
code points with formatting aspects in C0 and C1 are Controls ("Cc"), so 
excluded.

Total number of characters in 6.0 is 109,242+142=109,384.

Regards,
Ernest van den Boogaard

> From: jsb...@mimuw.edu.pl
> To: asm...@ix.netcom.com
> CC: unicode@unicode.org
> Subject: Re: statistics
> Date: Tue, 12 Oct 2010 09:14:21 +0200
> 
> On Mon, 11 Oct 2010  Asmus Freytag <asm...@ix.netcom.com> wrote:
> 
> >   On 10/11/2010 9:49 PM, Janusz S. "Bień" wrote:
> >> On Mon, 11 Oct 2010  announceme...@unicode.org wrote:
> >>
> >>>   The newly finalized Unicode Version 6.0 adds 2,088 characters,
> >> What is the current total? Are other statistic informations available
> >> somewhere?
> > The announcement gives a link to click through.
> >
> > There you will find more statistics.
> 
> I guess you mean "Character Assignment Overview" at
> 
>   http://www.unicode.org/versions/Unicode6.0.0/
> 
> However it does not provide the precise answer to my primary question,
> which is not purely arithmetic but depends on the definition of the
> character. In particular, do noncharacters belong to characters?
> 
> Regards
> 
> JSB
> 
> -- 
>                      ,   
> dr hab. Janusz S. Bien, prof. UW -  Uniwersytet Warszawski (Katedra 
> Lingwistyki Formalnej)
> Prof. Janusz S. Bien - Warsaw University (Department of Formal Linguistics)
> jsb...@uw.edu.pl, jsb...@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/
> 
> 
                                          

Reply via email to