There's still my unanswered question about the third numeric field not filled for some numeric characters (notably Nl characters, i.e. number letters).
I accepted the fact of being unable to define it for the "numerator one less than the denominator", but the Latin Roman number 900 has NO defined numeric value, and I don't see why. I would accept a rationale based on contextual meaning of the number, where its actual value changed between sources, but I don't think that the Roman 900 number letter has another possible value than 900. As the first reason why scripts have been standardized, learned and tought during the history is its use for accounting purpose, I doubt that merchants would have accepted an ambiguous meaning of these characters. If this ever occured in some local cultures, which brought a foreign glyph in their script, the use of the glyph creates a new abstract character that merits another name in Unicode and other properties. So I suggest you load the UCD in any spreadsheet, sort it on the general category column, and look at the numeric characters (third column starting by N): - all "Nd" characters should have their 3 numeric defined equally between 0 and 9, - all "Ni" characters should have only their last two fields set equally with a simple integer, and - all "Nl" characters should have something set in the third field only (except possibly for the "numerator one less than the denominator" character, which could have its own "No" category for "Numeric, other".) Philippe. -----Message d'origine----- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] la part de Mark Davis Envoyà : mardi 25 novembre 2003 20:10 à : Arcane Jill; [EMAIL PROTECTED] Objet : Re: numeric properties of Nl characters in the UCD The fields are the way they are for backwards compatibility. If you look at the UCD.html, you will see that the actual properties are separated: http://www.unicode.org/Public/UNIDATA/UCD.html#Numeric_Type I'd like to remind people again that you should read the documentation in UCD.html before trying to make sense of the raw data files. Mark __________________________________ http://www.macchiato.com ? ????????????????????? ? ----- Original Message ----- From: Arcane Jill To: [EMAIL PROTECTED] Sent: Tue, 2003 Nov 25 02:42 Subject: RE: numeric properties of Nl characters in the UCD Actually, I don't understand why UnicodeData.txt has no less than three different fields for numerical value anyway. I mean, it's not as though there exists EVEN A SINGLE CODEPOINT for which two or more of these fields exist and are defined differently from each other. One never sees, for example, a character for which "digit value" is 3 and "numeric value" is 4. It seems to me that one single numeric field would suffice. You may need a second field to establish what "kind" of number this is (decimal digit, whatever), but then maybe you could figure that out from the general category anyway. Jill > -----Original Message----- > From: Philippe Verdy [mailto:[EMAIL PROTECTED] > Sent: Sunday, November 23, 2003 2:58 AM > To: [EMAIL PROTECTED] > Subject: numeric properties of Nl characters in the UCD > > > I do understand why number letter characters with "Nl" > general category > don't have a "decimal value" property or a "integer value" > property, but why > they don't all have a "numeric value" property in the UCD. __________________________________________________________________ << ella for Spam Control >> has removed Spam messages and set aside Newsletters for me You can use it too - and it's FREE! http://www.ellaforspam.com
<<attachment: winmail.dat>>