On Sun, Jan 20, 2002 at 10:44:00PM -0500, Aman Chawla wrote:
For sites providing archives
of documents/manuscripts (in plain text) in Devanagari, this factor could be
as high as approx. 3 using UTF-8 and around 1 using ISCII.
Uncompressed, yes. It shouldn't be nearly as bad compressed - gzip
- Original
Message -From: "David Starner" [EMAIL PROTECTED]To: "Aman Chawla" [EMAIL PROTECTED]Cc: "James Kass" [EMAIL PROTECTED]; "Unicode"[EMAIL PROTECTED]Sent: Monday, January 21, 2002 12:19
AMSubject: Re: Devanagari What's your point
On Sun, 20 Jan 2002, Aman Chawla wrote:
Taking the extra links into account the sizes are:
English: 10.4 Kb
Devanagari: 15.0 Kb
Thus the Dev. page is 1.44 times the Eng. page. For sites providing archives
of documents/manuscripts (in plain text) in Devanagari, this factor could be
as high
unjustified.
Devanagari text encoded in SCSU occupies exactly 1 byte per character, plus
an additional byte near the start of the file to set the current window (0x14
= SC4).
-Doug Ewell
Fullerton, California
In a message dated 2002-01-20 21:49:02 Pacific Standard Time,
[EMAIL PROTECTED] writes:
The issue was originally brought up to gather opinion from members of this
list as to whether UTF-8 or ISCII should be used for creating Devanagari web
pages. The point is not to criticise Unicode
On Mon, Jan 21, 2002 at 12:57:39AM -0500, [EMAIL PROTECTED] wrote:
This is why I really wish that SCSU were considered a truly standard
encoding scheme. Even among the Unicode cognoscenti it is usually
accompanied by disclaimers about private agreement only and not suitable
for use on the
Iwould be grateful if I could get opinions on
the following:
1. Which encoding/character set is most suitable
for using Hindi/Marathi (both of which use Devanagari) on the internet as well
asin databases, and why? In your response, please refer to: http://www.iiit.net/ltrc/Publications
This is with reference to the Unicode Devanagari
(Hindi) Range. Is there a way to overcome/override the automatic glyph
substitution that occurs when one types a pure consonant (eg. 0926 द) + halant (094D ् ) + another consonant (0918 घ) ?
When one types the previously indicated sequence
d are seen
in fonts used for literary publications that might not have the same
reproduction constraints as newspapers.
Thanks, -apurva
-Original Message-From: Aman Chawla
[mailto:[EMAIL PROTECTED]] Sent: Wednesday, January 16, 2002
2:45 PMTo: UnicodeSubject: Unicode
Aman Chawla wrote:
[EMAIL PROTECTED]">
This is with reference to the Unicode
Devanagari (Hindi) Range. Is there a way to overcome/override the automatic
glyph substitution that occurs when one types a pure consonant (eg. 0926
द) + halant (094D
्) + another consonant
like in that temporary
state, you could probably get it to do it. I have used Keyman to create
some input methods that did some similar things (though not for Devanagari
script).
Moreover, it is tricly to implement the deletion of the danda: when the
user hits backspace near 0917 the keyboard
Hello,
I am a new user of unicode for devanagari (Hindi) in Microsoft Word. I am
very impressed with this font, but I'm wondering if there is a way to remap
the keyboard, so that I don't have to use shortcut keys which require
multiple keystrokes in order to type devanagari.
Also, I noticed
From: Rick McGowan [mailto:[EMAIL PROTECTED]]
Mike Ayers wrote:
The last I knew,
computer-savvy Taiwan and Hong Kong were continuing to invent new
characters. In the end, the onus is on the computer to
support the user.
Yes, the computer should support the user, but... The
On Tue, Nov 14, 2000 at 08:22:21AM -0800, D.V. Henkel-Wallace wrote:
Sadly, it seems unlikely that any furture change or adoption of orthography
will use characters not already supported by the then major computer
systems. In fact the trend seems to be the other way, viz Spain's changing
"D.V. Henkel-Wallace" wrote:
For a minority language (which all remaining unwritten languages are) the
pressure will be strong to use existing combinations (since they won't
constitute a large enough community for people to write special rendering
support).
OTOH minority languages have
Mark Davis wrote:
The Unicode Standard does define the rendering of such combinations, which
is in the absence of any other information to stack outwards.
A dumb implementation would simply move
the accent outwards if there was in the same position. This will not
necessarily produce an
[EMAIL PROTECTED] wrote:
Unfortunately, there's no corresponding LATIN CAPITAL LETTER N WITH LONG
RIGHT LEG, which Lakota needs.
To my knowledge, the discussion in September between John Cowan and Curtis Clark
didn't terminate with any actual proposal, and I'm not clear on whether the above
From: D.V. Henkel-Wallace [mailto:[EMAIL PROTECTED]]
At 06:30 2000-11-14 -0800, Marco Cimarosti wrote:
But my point was: not even Mr. Ethnologue himself knows
exactly *which*
combinations are meaningful, in all orthographic system.
And, clearly, no
one can figure out which combinations
Mike Ayers wrote:
The last I knew,
computer-savvy Taiwan and Hong Kong were continuing to invent new
characters. In the end, the onus is on the computer to support the user.
Yes, the computer should support the user, but... The invention of new characters to
serve multitudes is OK, and
On Tue, 14 Nov 2000, Rick McGowan wrote:
Mike Ayers wrote:
The last I knew,
computer-savvy Taiwan and Hong Kong were continuing to invent new
characters. In the end, the onus is on the computer to support the user.
Yes, the computer should support the user, but... The invention of new
Antoine Leca wrote:
My understanding is that there are a number of similar cases,
which are not
officially prohibited (AFAIK), but does not carry any sense.
For example, how about digits followed by accents (as
combining marks)?
Or the kana voicing/voiceless combining marks, when they
Monday, November 13, 2000 10:11
Subject: Re: Devanagari question
Marco Cimarosti wrote:
Antoine Leca wrote:
My understanding is that there are a number of similar cases,
which are not
officially prohibited (AFAIK), but does not carry any sense.
I think that the original idea beh
, slayer of demons)
= 0928 090B Ra[sup] 0924 0940
NaiRiTYa (south-west)
= 0928 090B Ra[sup] 0924 094D 092F
The Devanagari shaping engine in Uniscribe currently recognises a 0930 094D
preceding only consonants, to be duely reordered to the end of the syllable
and replaced with Ra[sup
Thursday, November 8, 2000
After sending a comment on the Ra(sup) + independent vowel discussion two
more general Devanagari questions occurred to me:
1. Is a halant/virama ever valid following other than a consonant (or
consonant and nukta)? My
1. Is a halant/virama ever valid following other than a consonant (or
consonant and nukta)?
Legal? In the sense of "any string is legal", yes; as is anything else. The
implementation question to answer is whether it's useful or renderable, and if so, how.
The independent vowel followed by
On Thu, 9 Nov 2000, Rick McGowan wrote:
1. Is a halant/virama ever valid following other than a consonant (or
consonant and nukta)?
Legal? In the sense of "any string is legal", yes; as is anything else.
The implementation question to answer is whether it's useful or
renderable, and
Hello,
In the Devanagari section of the standard, rule R2, on page 217 of the
version 3.0 standard, states, "If the dead consoant RA[d] preecesd either a
consonant *or an independent vowel,* then it is replaced by the superscript
nonspacing mark RA[sup]..."
I've never seen a RA[su
0940
NaiRiTYa (south-west)
= 0928 090B Ra[sup] 0924 094D 092F
The Devanagari shaping engine in Uniscribe currently recognises a 0930 094D
preceding only consonants, to be duely reordered to the end of the syllable
and replaced with Ra[sup]. Whether this be extended to independent vowels
had figured
You will find examples of Devanagari on the ICU locale explorer pages..
http://oss.software.ibm.com/icu/demo/
Try Marathi, Konkani, and Hindi.
The encoding should be UTF-8 by default or you can change it at the
bottom of the page.
Hindi especialy has an extensive but incomplete list
Anyone know of any Devanagari documents (Sanskrit, Hindi, Nepali) on the Web
using UTF-8 (other than the pages at
http://titus.uni-frankfurt.de/unicode/samples/rvbeispx.htm ) - especially any
using Dynamic fonts?
I am not interested in Devanagri sites using font based encodings.
- Chris
: ? Devanagari
Anyone know of any Devanagari documents (Sanskrit, Hindi, Nepali) on the Web
using UTF-8 (other than the pages at
http://titus.uni-frankfurt.de/unicode/samples/rvbeispx.htm ) - especially any
using Dynamic fonts?
I am not interested in Devanagri sites using font based encodings
201 - 231 of 231 matches
Mail list logo