>Also it seems to me when ContentType in a html page is "unicode", IE tends to
>understand it as UTF16LE. So it seems UTF16LE is (or was) the standard coding for
>unicode.
Just because IE does something doesn't mean it's the standard. The whole
world doesn't run IE. The legal content types are
On Thu, 13 Feb 2003, Zhang Weiwu wrote:
> Very newbie question:
> 1) I noticed when I save a file as "unicode" in Windows 2000, or
> other editor like EditPlus, the file begins with FF FE, which looks
> like UTF16LE. Also it seems to me when ContentType in a html page is
> "unicode", IE tends to u
Very newbie question:
1) I noticed when I save a file as "unicode" in Windows 2000, or other editor like
EditPlus, the file begins with FF FE, which looks like UTF16LE. Also it seems to me
when ContentType in a html page is "unicode", IE tends to understand it as UTF16LE. So
it seems UTF16LE is
Michael Everson wrote:
> At 22:20 + 2003-02-12, [EMAIL PROTECTED] wrote:
> >Andy White wrote,
> >> >How then would you encode a visual U+0930, U+094D, U+090B (which
> >> may be >needed when a glottal stop is intended between
> the 'ra' and
> >> 'ri')?
> >>
> >> U+0930, U+090B, of cours
At 23:47 + 2003-02-12, Andy White wrote:
Michael Everson wrote:
> These are A-VIRAMA-YA-AA and E-VIRAMA-Y-AA, which are sequences of an
independent vowel plus a subjoined consonant plus a dependent vowel.
Those sequences are used to represent foreign sounds in Bengali.
Since the ya-phala
I have put up some samples of the Bengali version of the letter Wa here:
http://www.exnet.btinternet.co.uk/uniprop/tsample.htm
I will comment on this in another post.
Sailendra Biswas' transliterations seem odd.
o = a-y-aa (with dot above y), ô = a, oo = au, o-short = o
s = j-ri, t = tta-virama
At 22:20 + 2003-02-12, [EMAIL PROTECTED] wrote:
.
Andy White wrote,
>How then would you encode a visual U+0930, U+094D, U+090B (which may be
>needed when a glottal stop is intended between the 'ra' and 'ri')?
U+0930, U+090B, of course.
This would not be expected to show a visible viram
K.G Sulochana wrote:
>
> It is proposed that the five pure consonants (Chillu
> characters) in Malayalam
> ()be given separate code positions
> in Malayalam Unicode block
Who is it that is proposing that these should be separate charaters?
> In ISCII based
At 00:23 + 2003-02-13, Andy White wrote:
When ever I have written a paper or mentioned errors and problems with
the Indic FAQ, there is always an overall lack of response on this list.
This has lead me to believe in the past, that you guys are not
interested. If this were not the case I am su
At 08:13 AM 2/12/03 -0800, Doug Ewell wrote:
> Even then, you may be behind a time lag of more than one month because
> the UTC meetings minutes are posted a little late. So, to be fully
> aware, apart from becoming a member, you should also attend UTC
> meetings.
I would imagine that issues like
At 22:20 + 2003-02-12, [EMAIL PROTECTED] wrote:
.
Andy White wrote,
>How then would you encode a visual U+0930, U+094D, U+090B (which may be
>needed when a glottal stop is intended between the 'ra' and 'ri')?
U+0930, U+090B, of course.
This would not be expected to show a visible viram
Michael Everson wrote:
> At 06:53 + 2003-02-12, Andy White wrote:
>
> >What gave you the power to suddenly come up with encoding
> innovations
> >such as this?
>
> Andy, you're being silly.
No, not silly, but yes, I was overreacting a little bit maybe ;-)
(sometimes I puzzle over why suc
Michael Everson wrote:
> At 02:27 + 2003-02-12, Andy White wrote:
> >I said:
> >? (I am talking about the letters mentioned in the Unicode Indic FAQ,
> > > http://www.unicode.org/faq/indic.html#13)
> >
> >Just to be clear, I mean the letters called 'Vowel_A_zophola_AA' &
> >'Vowel_E_zophola_
Wed.Feb.12, 2003
No halant is needed. What appears as the stand alone R vowel with a superscript reph hook above is the rendering of the r consonant (U+0930) and the vowel sign vocalic R (U+0943). This is demonstrated by the chart
.
Andy White wrote,
> >How then would you encode a visual U+0930, U+094D, U+090B (which may be
> >needed when a glottal stop is intended between the 'ra' and 'ri')?
>
> U+0930, U+090B, of course.
This would not be expected to show a visible virama.
Best regards,
James Kass
.
At 07:57 + 2003-02-12, Andy White wrote:
>> An example is the Devanagri Letter Vocalic R with Superscript Letter
>> Ra (aka Vowel Ru with Reph). Despite many discussions, no one has
>> been able to come to any agreement as to how to encode it. Is this a
>> candidate? If not; how to encode
At 02:27 + 2003-02-12, Andy White wrote:
I said:
? (I am talking about the letters mentioned in the Unicode Indic FAQ,
> http://www.unicode.org/faq/indic.html#13)
Just to be clear, I mean the letters called 'Vowel_A_zophola_AA' &
'Vowel_E_zophola_AA' as mentioned in the above mentioned FAQ.
At 06:53 + 2003-02-12, Andy White wrote:
What gave you the power to suddenly come up with encoding innovations
such as this?
Andy, you're being silly.
The FAQ has continually been queried and criticised since it was put up,
but you are seriously going to stand by it aren't you.
Andy, if
On 02/12/2003 06:48:57 AM Roozbeh Pournader wrote:
>That is not the Unicode procedure for encoding new characters. The public
>Unicode list is informal and informational. If you want to be aware of all
>standardisation decisions of Unicode, you should become a member of the
>Unicode Consortium. I
Roozbeh pined:
> Actually, I lost the email with the link, and cried a lot [;)] when I
> couldn't find the link anywhere. The Yahoo archive was not that good for
> searching. I guess someone should volunteer to host an archive of unicode
> list in mailbox format somewhere.
Have you checked the
Marco Cimarosti mentioned...
> if it had a canonical composition, I would have posted a
> formal error report to "[EMAIL PROTECTED]", rather than
> just a lazy comment on the Unicode List.
By the way, there is no such address any more. It was discontinued a long
time ago. Please report errata u
On Wed, 12 Feb 2003, Doug Ewell wrote:
> Also, when you need to search the Unicode mailing list archives, the
> search engine on the Unicode site is often more useful than Yahoo:
>
> http://www.unicode.org/mail-arch/
Thanks a lot! I was misdirected to the Yahoo archives from the mailing
list pag
Roozbeh Pournader wrote:
> Actually, I lost the email with the link, and cried a lot [;)] when I
> couldn't find the link anywhere. The Yahoo archive was not that good
> for searching. I guess someone should volunteer to host an archive of
> unicode list in mailbox format somewhere.
To prevent f
Carl W. Brown wrote:
> That is true. The half width katakana did replace the small roman
> letters. In those days one rarely used lower case and Japanese
> support was usually limited to katakana. It let people replace the
> print train with an English/Japanese one and not make any other
> ap
On Wed, 12 Feb 2003, Doug Ewell wrote:
> I would imagine that issues like this are also discussed on the
> members-only UnicoRe list.
It's not about discussions, but final decisions. Discussions happen, but
from my personal experience I can tell you that what will be decided
finally is not always
Markus,
> There are some more characters that have the same codes in most
> EBCDIC codepages, but there are also
> some where the Latin letters are not all present. (I think some
> old Japanese EBCDIC codepages
> replace small Latin letters with Katakana ones.)
>
That is true. The half width
Roozbeh Pournader wrote:
> Even then, you may be behind a time lag of more than one month because
> the UTC meetings minutes are posted a little late. So, to be fully
> aware, apart from becoming a member, you should also attend UTC
> meetings.
I would imagine that issues like this are also disc
On Tue, 11 Feb 2003, Doug Ewell wrote:
> > The new character code charts are available for review at
> > http://www.unicode.org/charts/u40-beta.html
>
> (Note: It would be nice if the Unicode site had a link to the beta
> charts somewhere. The Pipeline page still says, "Charts of the
> character
Andy continued:
> > In principle, at some point in the future, either the
> > phonology or the orthography or both could evolve to
> > the point where the entire constructs start to get handled
> > as basic orthographic units (or "letters") for Bengali,
> > but it isn't really the place of the Un
At 07:09 -0500 2003-02-12, John Cowan wrote:
Kenneth Whistler scripsit:
An example which comes to mind is
using "kl'" to try to represent a lateral affricate,
for example.
Notably (though without the apostrophe) as in "Klingon".
Written orthographically "tlh" in that language.
--
Michael
On Wed, 12 Feb 2003, Andy White wrote:
> I do regularly check the Unicode site for proposed new characters and
> scripts. I did not notice the ORIYA WA before. This must have only just
> recently been added.
You are right, it was accepted in last December's ISO/IEC JTC1/SC2/WG2
meeting, in Tokyo
Kenneth Whistler scripsit:
> An example which comes to mind is
> using "kl'" to try to represent a lateral affricate,
> for example.
Notably (though without the apostrophe) as in "Klingon".
--
John Cowan http://www.ccil.org/~cowan [EMAIL PROTECTED]
To say that Bilbo's br
Kenneth Whistler wrote:
> > Marco Cimarosti wrote:
> > > It has been repeated a lot of times that no more
> precomposed character
> > will
> > > never ever ever ever be added. ...
>
> I trust the clarification from John Cowan helped on this -- there
> is no prohibition against adding characters w
Thank you very much Otto,
Your link is really useful, and now I don't have any problem.
Best regards,
Azzedine Ait Khelifa
-Message d'origine-
De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] De
la part de Otto Stolz
Envoyé : vendredi 7 février 2003 12:59
À : Muhammad Asif
Cc
Hello,
It is proposed that the five pure consonants
(Chillu characters) in Malayalam
(
)be given
separate code positions in Malayalam Unicode block. In ISCII based applications,
three keys are used to form each of these characters as shown below.
.
Andy White wrote,
> Ah, but 0B35 didn't used to look the way it does in the charts today. It
> used to look the same as 0B2C.
> (Which I think that you probably already know)
I have only a low-level understanding of the Oriya script and need
to do an in-depth study of the conjunct forms.
> > U
Jameskass wrote:
> Andy White wrote,
>
> > The ORIYA LETTER WA (A.K.A. ORIYA VOWEL O WITH SUBSCRIPT
> CONSONANT VA)
>
> It looks like a squashed U+0B13 atop the base of U+0B2A and
> doesn't seem to resemble U+0B35.
Ah, but 0B35 didn't used to look the way it does in the charts today. It
used
Markus Scherer wrote:
>> They are all the same in the A-Z, a-z, and 0-9
>> ranges, but beyond that they can differ substantially.
>
> There are some more characters that have the same codes in most EBCDIC
> codepages, but there are also some where the Latin letters are not all
> present. (I think
38 matches
Mail list logo