sorting feature for Arabic Kurdish
Hello, the Arabic sorting feature implemented in Windows XP is not sufficient for sorting Arabc Kurdish. Do you know a sorting feature for Arabic Kurdish? Where can I look for it?Because I'm not able to do it myself. Regards, Ernst Tremel
Re: Codes for Individual Chinese Brushstrokes
On Thu, 19 Feb 2004 18:27:09 -0800 (PST), Kenneth Whistler wrote: Of the 64 entities listed on the page: http://www.chinavoc.com/arts/calligraphy/eightstroke.asp *none* of them are encoded, and *none* of them are standard enough to merit consideration -- if by consideration you mean separate encoding as characters. I'm not sure about *none* of them are encoded. As far as I can tell, pretty much most of the basic ideographic stroke forms are either already encoded in CJK and CJK-B or are proposed in CJK-C (where encoded here means encoded in their own right or can be represented by same-shaped ideographs). See for example the IRG document http://www.cse.cuhk.edu.hk/~irg/irg/irg19/N927_Add%202%20Strokes%20to%20C1.doc which states : quote Although most ideographic strokes have been encoded in CJK (including Ext.A and B) or submitted to CJK_C1 by IRG members, there are two ideographic strokes are found missing. Ideographic strokes are important for ideograph decomposition, analysis and for making ideographic strokes subset. Chinese linguists suggest to add these two ideographic strokes to CJK_C1. /quote I also remember reading one WG2 document that explicitly raised the question of how to deal with all the ideographic strokes proposed in CJK-C that are not distinct ideographs in their own right, although I can't seem to locate that document any more. All except one of the eight basic strokes mentioned at http://www.chinavoc.com/arts/calligraphy/eightstroke.asp are *representable* using existing characters in the CJK and/or Kangxi Radicals blocks : dot = U+4E36 or U+2F02 [KANGXI RADICAL DOT] dash = U+4E00 or U+2F00 [KANGXI RADICAL ONE] perpendicular downstroke = U+4E28 or U+2F01 [KANGXI RADICAL LINE] downstroke to the left or left-falling stroke = U+4E3F or U+2F03 [KANGXI RADICAL SLASH] wavelike stroke or right-falling stroke = U+4E40 hook = U+4E85 or U+2F05 [KANGXI RADICAL HOOK], as well as U+4E5A and U+2010C upstroke to the right = bend or twist = U+4E5B and U+200CC I concur with Ken that the 8x8 stroke categorization given at this web site is largely artificial. Whilst it may be useful to encode general ideographic stroke forms to help in the analysis and decomposition of ideographs, in my opinion the minute distinctions in the way that dots and dashes are written in various individual ideographs are beyond the scope of a character encoding system as the exact shape of a dot or length of a dash is irrelevant to any analysis of the compositional structure of an ideograph. Andrew
Re: Codes for Individual Chinese Brushstrokes
As a native Chinese person. I believe 1. The so called eight basic stroke is very standard in concept. But that is only 8. 2. They list 8 different varients for each of the 8 basic stroke. But if you read that page carefully, it does not mean that there are only 8 variants for each stroke, neither mean people can distinguish those variants from each others. For example, most Chinese will think the first Dot from the left is the same as the fourth Dot from the left. And the differents between them are really style. Therefore, it is not a good idea to encode those variants 3. There are more composit strokes if you really want to encode strokes. For example: http://people.netscape.com/ftang/chineselearning/strokes/refglyph_003.gif http://people.netscape.com/ftang/chineselearning/strokes/refglyph_004.gif Andrew C. West wrote: On Thu, 19 Feb 2004 18:27:09 -0800 (PST), Kenneth Whistler wrote: Of the 64 entities listed on the page: http://www.chinavoc.com/arts/calligraphy/eightstroke.asp *none* of them are encoded, and *none* of them are standard enough to merit consideration -- if by consideration you mean separate encoding as characters. I'm not sure about *none* of them are encoded. As far as I can tell, pretty much most of the basic ideographic stroke forms are either already encoded in CJK and CJK-B or are proposed in CJK-C (where encoded here means encoded in their own right or can be represented by same-shaped ideographs). See for example the IRG document http://www.cse.cuhk.edu.hk/~irg/irg/irg19/N927_Add%202%20Strokes%20to%20C1.doc which states : quote Although most ideographic strokes have been encoded in CJK (including Ext.A and B) or submitted to CJK_C1 by IRG members, there are two ideographic strokes are found missing. Ideographic strokes are important for ideograph decomposition, analysis and for making ideographic strokes subset. Chinese linguists suggest to add these two ideographic strokes to CJK_C1. /quote I also remember reading one WG2 document that explicitly raised the question of how to deal with all the ideographic strokes proposed in CJK-C that are not distinct ideographs in their own right, although I can't seem to locate that document any more. All except one of the eight basic strokes mentioned at http://www.chinavoc.com/arts/calligraphy/eightstroke.asp are *representable* using existing characters in the CJK and/or Kangxi Radicals blocks : dot = U+4E36 or U+2F02 [KANGXI RADICAL DOT] dash = U+4E00 or U+2F00 [KANGXI RADICAL ONE] perpendicular downstroke = U+4E28 or U+2F01 [KANGXI RADICAL LINE] downstroke to the left or left-falling stroke = U+4E3F or U+2F03 [KANGXI RADICAL SLASH] wavelike stroke or right-falling stroke = U+4E40 hook = U+4E85 or U+2F05 [KANGXI RADICAL HOOK], as well as U+4E5A and U+2010C upstroke to the right = bend or twist = U+4E5B and U+200CC I concur with Ken that the 8x8 stroke categorization given at this web site is largely artificial. Whilst it may be useful to encode general ideographic stroke forms to help in the analysis and decomposition of ideographs, in my opinion the minute distinctions in the way that dots and dashes are written in various individual ideographs are beyond the scope of a character encoding system as the exact shape of a dot or length of a dash is irrelevant to any analysis of the compositional structure of an ideograph. Andrew
Re: Codes for Individual Chinese Brushstrokes
At 18:27 -0800 2004-02-19, Kenneth Whistler wrote: If you want to know how many stroke types there really are and how their forms are modified in context in various Chinese characters, you should consult with Tom Bishop and Richard Cook, who have an extensive catalog of basic stroke types and forms based on the usage of CDL in the Wenlin system for constructing Chinese character glyphs. I was only trying to clarify what I thought the question was, Ken. I did forward it to Richard, who has responded offline.to Tim. -- Michael Everson * * Everson Typography * * http://www.evertype.com
Mother Language Day
23 February is the fifth International Mother Language Day. See http://tinyurl.com/2fdzc for details. -- Verbogeny is one of the pleasurettesJohn Cowan [EMAIL PROTECTED] of a creatific thinkerizer. http://www.reutershealth.com -- Peter da Silvahttp://www.ccil.org/~cowan
FW: Web Form: Fonts display on Palm OS
Bob, I am forwarding your email to the Unicode public list http://www.unicode.org/consortium/distlist.html for a possible answer from one of the list's subscribers. --- Magda Danish Administrative Director The Unicode Consortium +1 650-693-3921 -Original Message- Date/Time:Fri Feb 20 15:27:42 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hello and great site. In short - is there a way to modify or make my own 256 character set? I am working on displaying fonts on Palm OS with a program called iSilo. It appears that Palm uses an Ascii Chart very similar to that used by Windows (with a few PDA specific characters added). It also appears that Palm is limited to 256 characters. iSilo is a program that translates web pages into a Palm readable format. Greek letters will not display on the Palm if you use character set 1252 on none at all on your web page. If you set the character set to 1253 (Greek) and use Greek characters on your page, the default Latin characters in those Ascii positions will be displayed. In turn, if you use a custom font with Greek characters in those spots - you get Greek letters on your web page and your Palm. My problem is that I use arrows and other characters (such as the Male and Female symbols) not on any specific Character set I have found. I can make a custom font for these and the Greek letters, but need to make a custom character map. Is there a way of doing this? Thanks for your time. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Re: Mother Language Day
Thanks for this. The usual observance of IMLD is 21 February, but the more days the better! (Apparently a ceremony at UNESCO will be on 2/23.) I had trouble accessing the URL you provided. The UNESCO page is http://www.unesco.org/education/IMLD2004 . It would be great to get an ICT angle introduced in future observances, including of course, Unicode. Don Osborn Bisharat.net - Original Message - From: John Cowan [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Friday, February 20, 2004 9:45 PM Subject: Mother Language Day 23 February is the fifth International Mother Language Day. See http://tinyurl.com/2fdzc for details. -- Verbogeny is one of the pleasurettesJohn Cowan [EMAIL PROTECTED] of a creatific thinkerizer. http://www.reutershealth.com -- Peter da Silvahttp://www.ccil.org/~cowan
RE: Mother Language Day
(Hmm, in Russian mother language (maternij jazik) means something *verry* different. Watch your language! ;-) Joe