sorting feature for Arabic Kurdish

2004-02-20 Thread Ernst Tremel



Hello,

the Arabic sorting feature implemented in Windows 
XP is not sufficient for sorting Arabc Kurdish.
Do you know a sorting feature for Arabic Kurdish? Where can I look for 
it?Because I'm not able to do it myself.

Regards,

Ernst Tremel



Re: Codes for Individual Chinese Brushstrokes

2004-02-20 Thread Andrew C. West
On Thu, 19 Feb 2004 18:27:09 -0800 (PST), Kenneth Whistler wrote:
 
 Of the 64 entities listed on the page:
 
 http://www.chinavoc.com/arts/calligraphy/eightstroke.asp
 
 *none* of them are encoded, and *none* of them are standard
 enough to merit consideration -- if by consideration you mean
 separate encoding as characters.
 

I'm not sure about *none* of them are encoded. As far as I can tell, pretty
much most of the basic ideographic stroke forms are either already encoded in
CJK and CJK-B or are proposed in CJK-C (where encoded here means encoded in
their own right or can be represented by same-shaped ideographs).

See for example the IRG document
http://www.cse.cuhk.edu.hk/~irg/irg/irg19/N927_Add%202%20Strokes%20to%20C1.doc
which states :

quote
Although most ideographic strokes have been encoded in CJK (including Ext.A and
B) or submitted to CJK_C1 by IRG members, there are two ideographic strokes are
found missing. Ideographic strokes are important for ideograph decomposition,
analysis and for making ideographic strokes subset. Chinese linguists suggest to
add these two ideographic strokes to CJK_C1.
/quote

I also remember reading one WG2 document that explicitly raised the question of
how to deal with all the ideographic strokes proposed in CJK-C that are not
distinct ideographs in their own right, although I can't seem to locate that
document any more.

All except one of the eight basic strokes mentioned at
http://www.chinavoc.com/arts/calligraphy/eightstroke.asp are *representable*
using existing characters in the CJK and/or Kangxi Radicals blocks :

dot = U+4E36 or U+2F02 [KANGXI RADICAL DOT]
dash = U+4E00 or U+2F00 [KANGXI RADICAL ONE]
perpendicular downstroke = U+4E28 or U+2F01 [KANGXI RADICAL LINE]
downstroke to the left or left-falling stroke = U+4E3F or U+2F03 [KANGXI RADICAL
SLASH]
wavelike stroke or right-falling stroke = U+4E40
hook = U+4E85 or U+2F05 [KANGXI RADICAL HOOK], as well as U+4E5A and U+2010C
upstroke to the right = 
bend or twist = U+4E5B and U+200CC

I concur with Ken that the 8x8 stroke categorization given at this web site is
largely artificial. Whilst it may be useful to encode general ideographic stroke
forms to help in the analysis and decomposition of ideographs, in my opinion the
minute distinctions in the way that dots and dashes are written in various
individual ideographs are beyond the scope of a character encoding system as the
exact shape of a dot or length of a dash is irrelevant to any analysis of the
compositional structure of an ideograph.

Andrew



Re: Codes for Individual Chinese Brushstrokes

2004-02-20 Thread Frank Yung-Fong Tang

As a native Chinese person. I believe
1. The so called eight basic stroke is very standard in concept.
But that is only 8.
2. They list 8 different varients for each of the 8 basic stroke. But 
if you read that page carefully, it does not mean that there are only 8 
variants for each stroke, neither mean people can distinguish those 
variants from each others. For example, most Chinese will think the 
first Dot from the left is the same as the fourth Dot from the left. 
  And the differents between them are really style. Therefore, it is 
not a good idea to encode those variants
3. There are more composit strokes if  you really want to encode 
strokes. For example: 
http://people.netscape.com/ftang/chineselearning/strokes/refglyph_003.gif
http://people.netscape.com/ftang/chineselearning/strokes/refglyph_004.gif


Andrew C. West wrote:

  On Thu, 19 Feb 2004 18:27:09 -0800 (PST), Kenneth Whistler wrote:
  
   Of the 64 entities listed on the page:
  
   http://www.chinavoc.com/arts/calligraphy/eightstroke.asp
  
   *none* of them are encoded, and *none* of them are standard
   enough to merit consideration -- if by consideration you mean
   separate encoding as characters.
  
 
  I'm not sure about *none* of them are encoded. As far as I can tell,
  pretty
  much most of the basic ideographic stroke forms are either already
  encoded in
  CJK and CJK-B or are proposed in CJK-C (where encoded here means
  encoded in
  their own right or can be represented by same-shaped ideographs).
 
  See for example the IRG document
  
http://www.cse.cuhk.edu.hk/~irg/irg/irg19/N927_Add%202%20Strokes%20to%20C1.doc 

 
  which states :
 
  quote
  Although most ideographic strokes have been encoded in CJK (including
  Ext.A and
  B) or submitted to CJK_C1 by IRG members, there are two ideographic
  strokes are
  found missing. Ideographic strokes are important for ideograph
  decomposition,
  analysis and for making ideographic strokes subset. Chinese linguists
  suggest to
  add these two ideographic strokes to CJK_C1.
  /quote
 
  I also remember reading one WG2 document that explicitly raised the
  question of
  how to deal with all the ideographic strokes proposed in CJK-C that
  are not
  distinct ideographs in their own right, although I can't seem to
  locate that
  document any more.
 
  All except one of the eight basic strokes mentioned at
  http://www.chinavoc.com/arts/calligraphy/eightstroke.asp are
  *representable*
  using existing characters in the CJK and/or Kangxi Radicals blocks :
 
  dot = U+4E36 or U+2F02 [KANGXI RADICAL DOT]
  dash = U+4E00 or U+2F00 [KANGXI RADICAL ONE]
  perpendicular downstroke = U+4E28 or U+2F01 [KANGXI RADICAL LINE]
  downstroke to the left or left-falling stroke = U+4E3F or U+2F03
  [KANGXI RADICAL
  SLASH]
  wavelike stroke or right-falling stroke = U+4E40
  hook = U+4E85 or U+2F05 [KANGXI RADICAL HOOK], as well as U+4E5A and
  U+2010C
  upstroke to the right =
  bend or twist = U+4E5B and U+200CC
 
  I concur with Ken that the 8x8 stroke categorization given at this web
  site is
  largely artificial. Whilst it may be useful to encode general
  ideographic stroke
  forms to help in the analysis and decomposition of ideographs, in my
  opinion the
  minute distinctions in the way that dots and dashes are written in
  various
  individual ideographs are beyond the scope of a character encoding
  system as the
  exact shape of a dot or length of a dash is irrelevant to any analysis
  of the
  compositional structure of an ideograph.
 
  Andrew
 





Re: Codes for Individual Chinese Brushstrokes

2004-02-20 Thread Michael Everson
At 18:27 -0800 2004-02-19, Kenneth Whistler wrote:

If you want to know how many stroke types there really are
and how their forms are modified in context in various
Chinese characters, you should consult with Tom Bishop and
Richard Cook, who have an extensive catalog of basic stroke
types and forms based on the usage of CDL in the Wenlin
system for constructing Chinese character glyphs.
I was only trying to clarify what I thought the question was, Ken. I 
did forward it to Richard, who has responded offline.to Tim.
--
Michael Everson * * Everson Typography *  * http://www.evertype.com



Mother Language Day

2004-02-20 Thread John Cowan
23 February is the fifth International Mother Language Day.
See http://tinyurl.com/2fdzc for details.

-- 
Verbogeny is one of the pleasurettesJohn Cowan [EMAIL PROTECTED]
of a creatific thinkerizer. http://www.reutershealth.com
   -- Peter da Silvahttp://www.ccil.org/~cowan



FW: Web Form: Fonts display on Palm OS

2004-02-20 Thread Magda Danish \(Unicode\)
 
Bob,

I am forwarding your email to the Unicode public list
http://www.unicode.org/consortium/distlist.html for a possible answer
from one of the list's subscribers.


---
Magda Danish
Administrative Director
The Unicode Consortium
+1 650-693-3921
 

-Original Message-
Date/Time:Fri Feb 20 15:27:42 EST 2004
Contact:  [EMAIL PROTECTED]
Report Type:  Other Question, Problem, or Feedback

Hello and great site.

In short - is there a way to modify or make my own 256 character set?
   

I am working on displaying fonts on Palm OS with a program called iSilo.

It appears that Palm uses an Ascii Chart very similar to that used by
Windows (with a few PDA specific characters added).  It also appears
that Palm is limited to 256 characters.

iSilo is a program that translates web pages into a Palm readable
format.

Greek letters will not display on the Palm if you use character set 1252
on none at all on your web page.

If you set the character set to 1253 (Greek) and use Greek characters on
your page, the default Latin characters in those Ascii positions will be
displayed.  In turn, if you use a custom font with Greek characters in
those spots - you get Greek letters on your web page and your Palm.

My problem is that I use arrows and other characters (such as the Male
and Female symbols) not on any specific Character set I have found.  I
can make a custom font for these and the Greek letters, but need to make
a custom character map.  Is there a way of doing this?

Thanks for your time.


-- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)




Re: Mother Language Day

2004-02-20 Thread Don Osborn
Thanks for this.  The usual observance of IMLD is 21 February, but the more
days the better!  (Apparently a ceremony at UNESCO will be on 2/23.)  I had
trouble accessing the URL you provided.  The UNESCO page is
http://www.unesco.org/education/IMLD2004 .

It would be great to get an ICT angle introduced in future observances,
including of course, Unicode.

Don Osborn
Bisharat.net

- Original Message - 
From: John Cowan [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Sent: Friday, February 20, 2004 9:45 PM
Subject: Mother Language Day


 23 February is the fifth International Mother Language Day.
 See http://tinyurl.com/2fdzc for details.

 -- 
 Verbogeny is one of the pleasurettesJohn Cowan
[EMAIL PROTECTED]
 of a creatific thinkerizer. http://www.reutershealth.com
-- Peter da Silvahttp://www.ccil.org/~cowan






RE: Mother Language Day

2004-02-20 Thread joe

(Hmm, in Russian mother language (maternij jazik) means something *verry* different.

Watch your language! ;-)

Joe