Re: newbie: unicode (when used as a coding) = UTF16LE?

2003-02-12 Thread starner
>Also it seems to me when ContentType in a html page is "unicode", IE tends to >understand it as UTF16LE. So it seems UTF16LE is (or was) the standard coding for >unicode. Just because IE does something doesn't mean it's the standard. The whole world doesn't run IE. The legal content types are

Re: newbie: unicode (when used as a coding) = UTF16LE?

2003-02-12 Thread Jungshik Shin
On Thu, 13 Feb 2003, Zhang Weiwu wrote: > Very newbie question: > 1) I noticed when I save a file as "unicode" in Windows 2000, or > other editor like EditPlus, the file begins with FF FE, which looks > like UTF16LE. Also it seems to me when ContentType in a html page is > "unicode", IE tends to u

newbie: unicode (when used as a coding) = UTF16LE?

2003-02-12 Thread Zhang Weiwu
Very newbie question: 1) I noticed when I save a file as "unicode" in Windows 2000, or other editor like EditPlus, the file begins with FF FE, which looks like UTF16LE. Also it seems to me when ContentType in a html page is "unicode", IE tends to understand it as UTF16LE. So it seems UTF16LE is

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread Andy White
Michael Everson wrote: > At 22:20 + 2003-02-12, [EMAIL PROTECTED] wrote: > >Andy White wrote, > >> >How then would you encode a visual U+0930, U+094D, U+090B (which > >> may be >needed when a glottal stop is intended between > the 'ra' and > >> 'ri')? > >> > >> U+0930, U+090B, of cours

Oriya WA

2003-02-12 Thread Michael Everson
At 23:47 + 2003-02-12, Andy White wrote: Michael Everson wrote: > These are A-VIRAMA-YA-AA and E-VIRAMA-Y-AA, which are sequences of an independent vowel plus a subjoined consonant plus a dependent vowel. Those sequences are used to represent foreign sounds in Bengali. Since the ya-phala

Bengali WA etc.

2003-02-12 Thread Michael Everson
I have put up some samples of the Bengali version of the letter Wa here: http://www.exnet.btinternet.co.uk/uniprop/tsample.htm I will comment on this in another post. Sailendra Biswas' transliterations seem odd. o = a-y-aa (with dot above y), ô = a, oo = au, o-short = o s = j-ri, t = tta-virama

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread Michael Everson
At 22:20 + 2003-02-12, [EMAIL PROTECTED] wrote: . Andy White wrote, >How then would you encode a visual U+0930, U+094D, U+090B (which may be >needed when a glottal stop is intended between the 'ra' and 'ri')? U+0930, U+090B, of course. This would not be expected to show a visible viram

RE: separate code positions for Malayalam 'chillu' characters

2003-02-12 Thread Andy White
K.G Sulochana wrote: > > It is proposed that the five pure consonants (Chillu > characters) in Malayalam > ()be given separate code positions > in Malayalam Unicode block Who is it that is proposing that these should be separate charaters? > In ISCII based

RE: Never say never

2003-02-12 Thread Michael Everson
At 00:23 + 2003-02-13, Andy White wrote: When ever I have written a paper or mentioned errors and problems with the Indic FAQ, there is always an overall lack of response on this list. This has lead me to believe in the past, that you guys are not interested. If this were not the case I am su

Re: Never say never

2003-02-12 Thread Asmus Freytag
At 08:13 AM 2/12/03 -0800, Doug Ewell wrote: > Even then, you may be behind a time lag of more than one month because > the UTC meetings minutes are posted a little late. So, to be fully > aware, apart from becoming a member, you should also attend UTC > meetings. I would imagine that issues like

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread Michael Everson
At 22:20 + 2003-02-12, [EMAIL PROTECTED] wrote: . Andy White wrote, >How then would you encode a visual U+0930, U+094D, U+090B (which may be >needed when a glottal stop is intended between the 'ra' and 'ri')? U+0930, U+090B, of course. This would not be expected to show a visible viram

RE: Never say never

2003-02-12 Thread Andy White
Michael Everson wrote: > At 06:53 + 2003-02-12, Andy White wrote: > > >What gave you the power to suddenly come up with encoding > innovations > >such as this? > > Andy, you're being silly. No, not silly, but yes, I was overreacting a little bit maybe ;-) (sometimes I puzzle over why suc

RE: Never say never

2003-02-12 Thread Andy White
Michael Everson wrote: > At 02:27 + 2003-02-12, Andy White wrote: > >I said: > >? (I am talking about the letters mentioned in the Unicode Indic FAQ, > > > http://www.unicode.org/faq/indic.html#13) > > > >Just to be clear, I mean the letters called 'Vowel_A_zophola_AA' & > >'Vowel_E_zophola_

Re: Indic Vowel/Consonant combinations

2003-02-12 Thread Jjagenbroad
  Wed.Feb.12, 2003   No halant is needed.  What appears as the stand alone R vowel with a superscript reph hook above is the rendering of the r consonant (U+0930) and the vowel sign vocalic R (U+0943).  This is demonstrated by the chart

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread jameskass
. Andy White wrote, > >How then would you encode a visual U+0930, U+094D, U+090B (which may be > >needed when a glottal stop is intended between the 'ra' and 'ri')? > > U+0930, U+090B, of course. This would not be expected to show a visible virama. Best regards, James Kass .

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread Michael Everson
At 07:57 + 2003-02-12, Andy White wrote: >> An example is the Devanagri Letter Vocalic R with Superscript Letter >> Ra (aka Vowel Ru with Reph). Despite many discussions, no one has >> been able to come to any agreement as to how to encode it. Is this a >> candidate? If not; how to encode

RE: Never say never

2003-02-12 Thread Michael Everson
At 02:27 + 2003-02-12, Andy White wrote: I said: ? (I am talking about the letters mentioned in the Unicode Indic FAQ, > http://www.unicode.org/faq/indic.html#13) Just to be clear, I mean the letters called 'Vowel_A_zophola_AA' & 'Vowel_E_zophola_AA' as mentioned in the above mentioned FAQ.

RE: Never say never

2003-02-12 Thread Michael Everson
At 06:53 + 2003-02-12, Andy White wrote: What gave you the power to suddenly come up with encoding innovations such as this? Andy, you're being silly. The FAQ has continually been queried and criticised since it was put up, but you are seriously going to stand by it aren't you. Andy, if

RE: Never say never

2003-02-12 Thread Peter_Constable
On 02/12/2003 06:48:57 AM Roozbeh Pournader wrote: >That is not the Unicode procedure for encoding new characters. The public >Unicode list is informal and informational. If you want to be aware of all >standardisation decisions of Unicode, you should become a member of the >Unicode Consortium. I

Re: Glyph of Pipeline Characters ?

2003-02-12 Thread Rick McGowan
Roozbeh pined: > Actually, I lost the email with the link, and cried a lot [;)] when I > couldn't find the link anywhere. The Yahoo archive was not that good for > searching. I guess someone should volunteer to host an archive of unicode > list in mailbox format somewhere. Have you checked the

Re: Never say never

2003-02-12 Thread Rick McGowan
Marco Cimarosti mentioned... > if it had a canonical composition, I would have posted a > formal error report to "[EMAIL PROTECTED]", rather than > just a lazy comment on the Unicode List. By the way, there is no such address any more. It was discontinued a long time ago. Please report errata u

Re: Glyph of Pipeline Characters ?

2003-02-12 Thread Roozbeh Pournader
On Wed, 12 Feb 2003, Doug Ewell wrote: > Also, when you need to search the Unicode mailing list archives, the > search engine on the Unicode site is often more useful than Yahoo: > > http://www.unicode.org/mail-arch/ Thanks a lot! I was misdirected to the Yahoo archives from the mailing list pag

Re: Glyph of Pipeline Characters ?

2003-02-12 Thread Doug Ewell
Roozbeh Pournader wrote: > Actually, I lost the email with the link, and cried a lot [;)] when I > couldn't find the link anywhere. The Yahoo archive was not that good > for searching. I guess someone should volunteer to host an archive of > unicode list in mailbox format somewhere. To prevent f

[OT] Re: Converting EBCDIC to Unicode

2003-02-12 Thread Doug Ewell
Carl W. Brown wrote: > That is true. The half width katakana did replace the small roman > letters. In those days one rarely used lower case and Japanese > support was usually limited to katakana. It let people replace the > print train with an English/Japanese one and not make any other > ap

Re: Never say never

2003-02-12 Thread Roozbeh Pournader
On Wed, 12 Feb 2003, Doug Ewell wrote: > I would imagine that issues like this are also discussed on the > members-only UnicoRe list. It's not about discussions, but final decisions. Discussions happen, but from my personal experience I can tell you that what will be decided finally is not always

RE: Converting EBCDIC to Unicode

2003-02-12 Thread Carl W. Brown
Markus, > There are some more characters that have the same codes in most > EBCDIC codepages, but there are also > some where the Latin letters are not all present. (I think some > old Japanese EBCDIC codepages > replace small Latin letters with Katakana ones.) > That is true. The half width

Re: Never say never

2003-02-12 Thread Doug Ewell
Roozbeh Pournader wrote: > Even then, you may be behind a time lag of more than one month because > the UTC meetings minutes are posted a little late. So, to be fully > aware, apart from becoming a member, you should also attend UTC > meetings. I would imagine that issues like this are also disc

Re: Glyph of Pipeline Characters ?

2003-02-12 Thread Roozbeh Pournader
On Tue, 11 Feb 2003, Doug Ewell wrote: > > The new character code charts are available for review at > > http://www.unicode.org/charts/u40-beta.html > > (Note: It would be nice if the Unicode site had a link to the beta > charts somewhere. The Pipeline page still says, "Charts of the > character

RE: Never say never

2003-02-12 Thread Kenneth Whistler
Andy continued: > > In principle, at some point in the future, either the > > phonology or the orthography or both could evolve to > > the point where the entire constructs start to get handled > > as basic orthographic units (or "letters") for Bengali, > > but it isn't really the place of the Un

Re: Never say never

2003-02-12 Thread Michael Everson
At 07:09 -0500 2003-02-12, John Cowan wrote: Kenneth Whistler scripsit: An example which comes to mind is using "kl'" to try to represent a lateral affricate, for example. Notably (though without the apostrophe) as in "Klingon". Written orthographically "tlh" in that language. -- Michael

RE: Never say never

2003-02-12 Thread Roozbeh Pournader
On Wed, 12 Feb 2003, Andy White wrote: > I do regularly check the Unicode site for proposed new characters and > scripts. I did not notice the ORIYA WA before. This must have only just > recently been added. You are right, it was accepted in last December's ISO/IEC JTC1/SC2/WG2 meeting, in Tokyo

Re: Never say never

2003-02-12 Thread John Cowan
Kenneth Whistler scripsit: > An example which comes to mind is > using "kl'" to try to represent a lateral affricate, > for example. Notably (though without the apostrophe) as in "Klingon". -- John Cowan http://www.ccil.org/~cowan [EMAIL PROTECTED] To say that Bilbo's br

RE: Never say never

2003-02-12 Thread Marco Cimarosti
Kenneth Whistler wrote: > > Marco Cimarosti wrote: > > > It has been repeated a lot of times that no more > precomposed character > > will > > > never ever ever ever be added. ... > > I trust the clarification from John Cowan helped on this -- there > is no prohibition against adding characters w

RE : Unicode and Encoding Problems in Browsers

2003-02-12 Thread Azzedine Ait Khelifa
Thank you very much Otto, Your link is really useful, and now I don't have any problem. Best regards, Azzedine Ait Khelifa -Message d'origine- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] De la part de Otto Stolz Envoyé : vendredi 7 février 2003 12:59 À : Muhammad Asif Cc

separate code positions for Malayalam 'chillu' characters

2003-02-12 Thread K.G Sulochana
Hello,      It is proposed that the five pure consonants (Chillu characters)  in Malayalam (    )be given separate code positions in Malayalam Unicode block. In ISCII based applications, three keys are used to form each of these characters as shown below.

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread jameskass
. Andy White wrote, > Ah, but 0B35 didn't used to look the way it does in the charts today. It > used to look the same as 0B2C. > (Which I think that you probably already know) I have only a low-level understanding of the Oriya script and need to do an in-depth study of the conjunct forms. > > U

RE: Indic Vowel/Consonant combinations

2003-02-12 Thread Andy White
Jameskass wrote: > Andy White wrote, > > > The ORIYA LETTER WA (A.K.A. ORIYA VOWEL O WITH SUBSCRIPT > CONSONANT VA) > > It looks like a squashed U+0B13 atop the base of U+0B2A and > doesn't seem to resemble U+0B35. Ah, but 0B35 didn't used to look the way it does in the charts today. It used

Re: Converting EBCDIC to Unicode

2003-02-12 Thread Doug Ewell
Markus Scherer wrote: >> They are all the same in the A-Z, a-z, and 0-9 >> ranges, but beyond that they can differ substantially. > > There are some more characters that have the same codes in most EBCDIC > codepages, but there are also some where the Latin letters are not all > present. (I think