Change the input language in java

2000-06-28 Thread Nicolas Toussaint
Unicoders, Java programers, Hello I wich changed the local language (from my keyBoard) under my java application. I know now how to dislpay all the characteres of Unicode in my JTextField, but now I want to be able to ENTER some characteres in my JEditoPane (like greek, russian). I can do thi

Re: UTF-8N?

2000-06-28 Thread Doug Ewell
Asmus Freytag <[EMAIL PROTECTED]> wrote: >> Yes. The Unicode Standard will deprecate the use of U+FFEF (Note: not >> U+FFFE) as a zero-width non-breaking space (despite its formal name). >> >> And U+FFEF should *only* be used as a byte order mark and/or >> signature. (That is already ambiguous an

Re: Do you have these characters?

2000-06-28 Thread Antoine Leca
[EMAIL PROTECTED] wrote: > > Has Unicode, by any name, the two mutant digits in the attached file? What about the pairs 0041;LATIN CAPITAL LETTER A 0042;LATIN CAPITAL LETTER B and 0061;LATIN SMALL LETTER A 0062;LATIN SMALL LETTER B (which will be the one chosen by almost any software for this u

Re: Not all Arabics are created equal...

2000-06-28 Thread brendan_murray
"Michael Kaplan (Trigeminal Inc.)" <[EMAIL PROTECTED]> wrote > Issues such as this one can obviously cause major issues since it even > affects logical vs. visual order of numbers! I don't think there was any suggestion that the logical order would differ: AFAIK, only the display varies. B=

Not all Arabics are created equal...

2000-06-28 Thread Michael Kaplan (Trigeminal Inc.)
I have heard the same thing, and think it is underscores a point that MANY companies forget: not all dialects of Arabic are the same, despite the fact that most software packages have *one* Arabic version. Issues such as this one can obviously cause major issues since it even affects logical vs.

RE: UTF-8 and UTF-16 issues

2000-06-28 Thread Jones, Bob
Has anyone out there taken a cross platform non-Unicode enabled legacy application and converted it to run UTF-8 instead of UTF-16? I've read Markus Kuhn's UTF-8/Unicode FAQ at http://www.cl.cam.ac.uk/~mgk25/unicode.html and while it was helpful, it only addresses Unix. I also have to consider W

Re: Looking For Information

2000-06-28 Thread brendan_murray
[EMAIL PROTECTED] (Timothy Partridge) wrote: > Do IBM DBCS strings assume starting in single byte mode? > And would the presence of certain bytes in UTF-16 trigger a switch from > double to single byte mode? Yes and yes. There are a number of Asian EBCDIC codepages that follow this structure. The

Re: Looking up han characters

2000-06-28 Thread John Hudson
At 11:00 AM 6/28/00 -0800, [EMAIL PROTECTED] wrote: >How do I look up a han character if I don't know its codepoint? What if all I >have is its shape, or its EUC-JP or Shift-JIS number? There are a couple I >want to see. If you know the characters you are looking for in their Japanese (Kanji) fo

RE: Plane 14 language tags

2000-06-28 Thread Murray Sargent
Note that in C, it's essentially just as fast to make character comparisons with (ch | 0x20) as with ch alone, i.e., if you know ch is in an ASCII range (0 - 0x7F or 0xE - 0xE007F), you can do a case insensitive compare as quickly as a case sensitive one. The problem with assuming lower case

RE: Bidi Examples of math, going right to left

2000-06-28 Thread Jonathan Rosenne
I had been told that in Egypt math is right to left, at least in school books. I have no first hand knowledge. Jony > -Original Message- > From: Roozbeh Pournader [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, June 28, 2000 6:40 PM > To: Unicode List > Cc: Unicode List; [EMAIL PROTECTED]

Re: Looking For Information

2000-06-28 Thread Timothy Partridge
Harry Aufderheide recently said: > I work for a large global firm in the transportation industry and we are > taking a high-level look of our future business requirements for and the > I.S. effort to properly handle all the characters of all the languages > currently in use on the planet earth.

Re: Looking up han characters

2000-06-28 Thread brendan_murray
[EMAIL PROTECTED] wrote: > How do I look up a han character if I don't know its codepoint? What if all I > have is its shape, or its EUC-JP or Shift-JIS number? There are a couple I You can use the information in the unihan.txt file (link to it from http://www.unicode.org/unicode/faq/mappings/ma

Re: Looking up han characters

2000-06-28 Thread Rick McGowan
28L;56L;84L;112L;140L;168L;196L;224L;252L;280L;308L;336L;Rampshot asked... > How do I look up a han character if I don't know its codepoint? > What if all I have is its shape, or its EUC-JP or Shift-JIS number? > There are a couple I want to see. The people at Sanseido have just now made it really

Re: Plane 14 language tags

2000-06-28 Thread Kenneth Whistler
Doug Ewell asked: > 2. (Ken and Glenn) Can you explain in a little more detail the rationale > for lowercasing the entire language tag? It seems that if RFC 1766 > is the model to be followed, then the RFC 1766 casing convention > (lowercase for language, uppercase for country) migh

Looking up han characters

2000-06-28 Thread rampshot
How do I look up a han character if I don't know its codepoint? What if all I have is its shape, or its EUC-JP or Shift-JIS number? There are a couple I want to see. Robert Lozyniak 01 02 03 04 05 06 "Don't stop movin', 07 08 09 10 11 12 13 14 It's your life, keep on groovin', 1

Re: Plane 14 language tags

2000-06-28 Thread John Cowan
Doug Ewell wrote: > 2. (Ken and Glenn) Can you explain in a little more detail the rationale > for lowercasing the entire language tag? It seems that if RFC 1766 > is the model to be followed, then the RFC 1766 casing convention > (lowercase for language, uppercase for country) migh

Re: C # character model

2000-06-28 Thread Torsten Mohrin
Antoine Leca <[EMAIL PROTECTED]> wrote: >Torsten Mohrin wrote: >> Antoine Leca <[EMAIL PROTECTED]> wrote: >> [...] >> >> > APIs use and return single 16-bit values. >> > >> >Ah, that may be a problem (what is the ToUpper return value of ß?) >> >> I don't know the mentioned API, but it could retu

Re: italic variants of CYRILLIC SMALL LETTER I

2000-06-28 Thread Pierpaolo Bernardi
On Wed, 28 Jun 2000, Gary P. Grosso wrote: > A user's query has been passed on to me, regarding > CYRILLIC SMALL LETTER I and CYRILLIC SMALL LETTER SHORT I > (U+0438 and U+0439). They pointed out that they > noticed that when they are italicized, they look like Us > instead of backward Ns. A c

Re: Bidi Examples of math, going right to left

2000-06-28 Thread Roozbeh Pournader
On Tue, 27 Jun 2000, Tex Texin wrote: > I have been asked to gather some examples of mathematical > expressions used in bidirectional languages, where the > expressions go right to left rather than left to right. Persian and Hebrew math are left to right. At least some Arabic math is right to

italic variants of CYRILLIC SMALL LETTER I

2000-06-28 Thread Gary P. Grosso
A user's query has been passed on to me, regarding CYRILLIC SMALL LETTER I and CYRILLIC SMALL LETTER SHORT I (U+0438 and U+0439). They pointed out that they noticed that when they are italicized, they look like Us instead of backward Ns. I'm pretty sure this is as it should be (the font in quest

Re: C # character model

2000-06-28 Thread Mark Davis
Almost all international functions (upper-, lower-, titlecasing, case folding, drawing, measuring, collation, transliteration, grapheme-, word-, linebreaks, etc.) should take *strings* in the API, NOT single code-points. Single code-point APIs almost always malfunction once you get outside of s

Plane 14 language tags

2000-06-28 Thread Doug Ewell
I have two questions about Plane 14 language tags as specified in Technical Report #7: 1. Does anyone know of any implementation that interprets language tags and actually does something with the result? I'm not looking for code, just information and ideas. 2. (Ken and Glenn) Can you

Re: Do you have these characters?

2000-06-28 Thread Doug Ewell
> Unicode does not have these two characters (dozenal digit 10 {a turned > digit 2} and dozenal digit 11 [a reversed digit 3}). I see two slightly different forms for this DUODECIMAL DIGIT ELEVEN on the Dozenal Society's Web page. The PDF referenced by Robert shows a *reversed* 3 (rotated about

Can you help me fill out the submission form?

2000-06-28 Thread rampshot
Please e-mail me and I'll e-mail you a Word document with the form so you can help me fill it out. I already have most of it. Robert Lozyniak 01 02 03 04 05 06 "Don't stop movin', 07 08 09 10 11 12 13 14 It's your life, keep on groovin', 15 16 17 18 19 20 21 Get it right,

Phonetic?

2000-06-28 Thread rampshot
Exactly what constitutes a phonetic sound, besides being made by a human being? I mean, clapping isn't phonetic, is it? Robert Lozyniak 01 02 03 04 05 06 "Don't stop movin', 07 08 09 10 11 12 13 14 It's your life, keep on groovin', 15 16 17 18 19 20 21 Get it right, 22

Re: C # character model

2000-06-28 Thread Antoine Leca
Torsten Mohrin wrote: > > Antoine Leca <[EMAIL PROTECTED]> wrote: > > [...] > >> > APIs use and return single 16-bit values. > > > >Ah, that may be a problem (what is the ToUpper return value of ß?) > > I don't know the mentioned API, but it could return 0x00DF or (to > indicate it as an error)

Re: The Roman numerals for dozenal digits??

2000-06-28 Thread Michael Everson
Ar 02:09 -0800 2000-06-28, scríobh [EMAIL PROTECTED]: >Only if you call "V" the same glyph as "5" >I want the new digits to be: >U+218A DUODECIMAL DIGIT TEN aka TURNED DIGIT TWO >U+218B DUODECIMAL DIGIT ELEVEN aka REVERSED DIGIT THREE >Don't worry; I'll be filling out the form in my best cursi

Re: C # character model

2000-06-28 Thread Torsten Mohrin
Antoine Leca <[EMAIL PROTECTED]> wrote: [...] >> > APIs use and return single 16-bit values. > >Ah, that may be a problem (what is the ToUpper return value of ß?) I don't know the mentioned API, but it could return 0x00DF or (to indicate it as an error) 0x. I don't see a problem. --Torsten

The Roman numerals for dozenal digits??

2000-06-28 Thread rampshot
Only if you call "V" the same glyph as "5" I want the new digits to be: U+218A DUODECIMAL DIGIT TEN aka TURNED DIGIT TWO U+218B DUODECIMAL DIGIT ELEVEN aka REVERSED DIGIT THREE Don't worry; I'll be filling out the form in my best cursive :) The reason I chose these codepoints is because they e

Re: C # character model

2000-06-28 Thread Antoine Leca
Markus Scherer wrote: > > John O'Conner wrote: > > It appears that this new product is not adopting UTF-32...and is > > sticking with UTF-16 (or more appropriately UCS-2?). Not very surprising given the commitment of MS with 16-bit Unicode. > > APIs use and return single 16-bit values. Ah, tha

RE: Looking For Information

2000-06-28 Thread Marco . Cimarosti
Harry R Aufderheide wrote: > 1. Is the UTF-8's character set equal to the Latin-1 (ASCII) > Code Page's? If not, what are the differences? As Brendan Murray already mentioned, UTF-8 is an encoding form of Unicode, so it supports *all* Unicode characters. In case you are wondering how this is po

Re: Do you have these characters?

2000-06-28 Thread Antoine Leca
Kenneth Whistler wrote: > > Brendan suggested: > > > Antoine Leca <[EMAIL PROTECTED]> wrote: > > > What about the pairs > > > 0041;LATIN CAPITAL LETTER A > > > > > (which will be the one chosen by almost any software for this use). > > > > This is a little too simplistic: these characters have