N2555

2004-11-05 Thread Alexander Savenkov
Hello,

the proposal N2555 titled Revised proposal for encoding the Glagolitic
script in the UCS by Michael Everson and Ralph Cleminson contains a
number of inaccuracies and errors according to my source.

Is it the most up-to-date version of the document?

Alexander
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Writing Tatar using the Latin script; new characters to encode?

2004-07-27 Thread Alexander Savenkov
I didn't expect this pointless discussion to get that far. Hopefully,
this will be the end of it.

2004-07-19T02:39:47+03:00 Peter Kirk <[EMAIL PROTECTED]> wrote:

> Conclusion:

> 1) The Republic of Tatarstan passed a law in 1999 and coming into
> force in 2001 establishing a Tatar Latin alphabet.

True.

> 2) A Russian federal law (a monstrous piece of linguistic
> imperialism)

Peter, your estimates of Russian federal law are inadequate and go
beyond the scope of this list.

> overrode this in 2002, so after the Tatarstan law had come into
> force. Therefore this Latin alphabet was in some sense officially in
> force for a period. And it is still considered to be officially in
> force by many in Tatarstan including top government officials.

I can't guess what is considered by "many in Tatarstan". And I think
you shouldn't be guessing too as it makes no difference in our case.
If someone, in spite of the law, consider killing people to be ok,
it's a matter of court.

> 3) As the people of Tatarstan are independent-minded

Wow, where did you get this epithet? I agree with that, especially given
that 43 percent of Tatarstan's population are Russians. Just a quick
question, how is this sentence related to the discussion described in
the header?

> and more likely to follow their local leaders than the linguistic
> imperialists in Moscow, it is highly likely that at least some of
> them use the published Latin script even if it is not permitted to
> have official status.

True again. No one is banning you from using, e.g., pig Latin, online
or offline.

> 4) Not all speakers of the Tatar language live in the Russian
> Federation, and some live in countries like Azerbaijan where the
> official alphabets use Latin script. In such areas they are clearly
> likely to use the Latin script.

That seems to be the only relevant point of your letter, Peter.
Russian Kurds use Cyrillic script, others use Latin or Arabic... Same
for the Gipsies, same for Tatars. For this reason (and also for
historical reason) Latin alphabet for Tatar language exists.

Btw, I remember reading you visited Azerbaijan, so you know
the situation there better. I.e., you should know that many Azerbaijan
officials write their public speeches in Cyrillic script, so
the secretararies need to transliterate them into Latin before
publishing.

> 5) This is an alphabet which has been used, even in official
> websites, and very likely continues to be used by some. Decisions
> made in Moscow do not change this, especially because they are in
> practice widely ignored in Tatarstan

Once again, Peter, you're going off the topic. You're invited to prove
your assumptions with facts or withdraw them. I personally consider
statements of this kind as veiled attacks to Russia's statehood.
Please, stop that.

> and have no force in some other places where Tatars live. This
> alphabet therefore needs to be supported by Unicode. But fortunately
> this is not a problem as all the characters are already defined.

True, the alphabet is already supported. I've no idea about the point
of your latest letter.

Alexander
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Writing Tatar using the Latin script; new characters to encode?

2004-07-18 Thread Alexander Savenkov
Hello,

(delayed response)

2004-05-12T19:37:51+03:00 Ernest Cline <[EMAIL PROTECTED]> wrote:

>  From: Alexander Savenkov <[EMAIL PROTECTED]>
>>
>> 2004-05-12T03:08:59+03:00 Eric Muller <[EMAIL PROTECTED]> wrote:
>>
>> > According to , there is a currently an effort to convert the
>> > writing of Tatar from Cyrillic to Latin.
>>
>> > 1. Does somebody have more information about that effort?
>>
>> Perhaps it's their own effort.
>>
>> > Eki lists four characters as needed but missing in Unicode (see 
>> >
>> <http://www.eki.ee/letter/chardata.cgi?lang=tt+Tatar&script=latin>).
>>
>> > 2. The case pair for barred o is encoded (U+019F and U+0275), and it
>> > seems that their confusion comes from less-than-perfect but annotated
>> > name for U+019F, and from the usage remark "African". Can we 
>> > authoritatively tell them that those two characters are the ones they
>> > want? Can we add a "Tatar" usage remark to both?
>>
>> Is there a need for this? You don't want to tell everyone on the net
>> about his or her wrong assumptions. There's one sentence in the page
>> you mentioned that gives a good description of this resource:
>>
>> "The conversion from Cyrillic to Latin script is planned within years
>> 2001-2011."
>>
>> This is false.
>>
>> > 3. The case pair n with descender is definitely not encoded, and from my
>> > memory of the discussion of ghe with descender, we would want to encode
>> > them as separate characters (rather than with combining descenders on
>> > "n"). Is anybody working on that proposal?
>>
>> There's no Latin Tatar script. It's the law. Full stop.
>>
>> It's the Institute of Estonian language. I hope they know more about
>> Estonian than about other languages and Unicode.

> They are numerous sites on the web about the change from Cyrillic to
> Latin for Tatar that is planned for completion by 2011 by the Republic
> of Tatarstan (a part of the Russian Federation).

Ernest, I fail to see how the fact that there are numerous sites about
Latin for Tatar proves it really exists. There are numerous sites
about Babylon 5 and Frankenstein. What are your thoughts about these?

> There is legal wrangling
> over wether Tatarstan can make the change back to Latin script official
> for Tatar as it is used there, but no final decision has been reached and
> there is probably at least several more years of legal shenanigans
> before it is reached.

You're wrong and the facts you give here are outdated. Legal wrangling
is over. See links below (in Russian).

...

> As for the merits of the proposed change back to Latin, I think
> it is silly for Tatarstan to make the change and it is silly for the
> Russian Federation to oppose it.

Your clever thoughts are really helpful. I wonder what Russians and
Tatars would do without them.

Links in Russian:
http://www.tatar.ru/?DNSID=0627096ec5c075004c0d219207f349de&node_id=978
http://www.tatar.ru/1296_c.html
http://www.tatar.ru/index.php?node_id=1006
http://www.tatar.ru/?DNSID=0627096ec5c075004c0d219207f349de&node_id=2610
http://www.tatar.ru/?node_id=2611
http://peoples.org.ru/proekt.html
http://peoples.org.ru/stenogramma.html

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Importance of diacritics

2004-07-15 Thread Alexander Savenkov
2004-07-14T19:20:35+03:00 Peter Kirk <[EMAIL PROTECTED]> wrote:

>>>For in Russian these dots are considered highly optional, and
>>>e with dots (pronounced o or yo - a spelling rule prescribes this 
>>>instead of o after certain letters when stressed) is not a separate
>>>letter of the alphabet (contrast i kratkoe, Cyrillic i with breve, which
>>>is a fully separate letter from i).
>>>
>>
>>That’s wrong, Peter. The letter «ё» is a separate letter. Please don’t
>>spread your wrong assumptions in the list.
>>
> I meant this in the sense that the two letters are interfiled in
> dictionaries, e.g.

> елейный
> ёлка
> еловый
> ёлочный
> ель

> At least this is the ordering in my Collins Russian dictionary, and I
> understand it to be the standard Russian ordering. Am I wrong here?

Not at all, though different dictionaries use different approaches.
The practice of filing «е*» and «ё*» entries under one section is quite
common, it however doesn’t imply that one of these letters is not
a separate letter. There are two reasons for this kind of ordering:

1. The words with the letter «ё» are almost always written with «е»
so readers will tend to mix them.
2. The aren’t many words with initial «ё», so it's safe to file all
the «ё*» entries along with «е*».

> By contrast, и and й are not interfiled.

I can’t see why you put these as an example. They are completely
different letters (vowel and consonant), notwithstanding their similar
look. One of my dictionaries have a section called «Ъ-Ь» which
obviously means that the entries cover the letters «ъ», «ы», and «ь».
Unlike «Е, Ё» section the entries are sorted alphabetically. Just like
«е» and «ё» all of these letters are meant to be separate.

Alexander
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Importance of diacritics

2004-07-14 Thread Alexander Savenkov
Hello,

2004-07-13T13:57:37+03:00 Peter Kirk <[EMAIL PROTECTED]> wrote:

> In the original Russian, the two dots would appear over the Cyrillic e
> only in rather specialised circumstances or in texts marked up 
> beginners.

Correct. Some people however would like to change that (i.e. so that
the dots are no longer optional).

> For in Russian these dots are considered highly optional, and
> e with dots (pronounced o or yo - a spelling rule prescribes this 
> instead of o after certain letters when stressed) is not a separate 
> letter of the alphabet (contrast i kratkoe, Cyrillic i with breve, which
> is a fully separate letter from i).

That’s wrong, Peter. The letter «ё» is a separate letter. Please don’t
spread your wrong assumptions in the list.

> And indeed the dotless e is
> reflected in the commonest English transcription, Khrushchev (and 
> similarly Gorbachev etc).

Regards,
Alexander
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re: Writing Tatar using the Latin script; new characters to encode?

2004-05-12 Thread Alexander Savenkov
Hello,

2004-05-12T03:08:59+03:00 Eric Muller <[EMAIL PROTECTED]> wrote:

> According to , there is a currently an effort to convert the
> writing of Tatar from Cyrillic to Latin.

> 1. Does somebody have more information about that effort?

Perhaps it's their own effort.

> Eki lists four characters as needed but missing in Unicode (see 
> <http://www.eki.ee/letter/chardata.cgi?lang=tt+Tatar&script=latin>).

> 2. The case pair for barred o is encoded (U+019F and U+0275), and it
> seems that their confusion comes from less-than-perfect but annotated
> name for U+019F, and from the usage remark "African". Can we 
> authoritatively tell them that those two characters are the ones they
> want? Can we add a "Tatar" usage remark to both?

Is there a need for this? You don't want to tell everyone on the net
about his or her wrong assumptions. There's one sentence in the page
you mentioned that gives a good description of this resource:

"The conversion from Cyrillic to Latin script is planned within years
2001-2011."

This is false.

> 3. The case pair n with descender is definitely not encoded, and from my
> memory of the discussion of ghe with descender, we would want to encode
> them as separate characters (rather than with combining descenders on
> "n"). Is anybody working on that proposal?

There's no Latin Tatar script. It's the law. Full stop.

It's the Institute of Estonian language. I hope they know more about
Estonian than about other languages and Unicode.

> PS: sorry for the double post to unicode and unicore. However, given the
> current state of [EMAIL PROTECTED], this seems the best course of action.

What's up with [EMAIL PROTECTED]
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[4]: Fixed Width Spaces (was: Printing and Displaying DependentVowels)

2004-04-03 Thread Alexander Savenkov
2004-04-03T02:34:38+03:00 D. Starner <[EMAIL PROTECTED]> wrote:

>> > It only affects its (visual) aesthetic 
>> > quality. 
>>  
>> That is arguable. An aural user agent could pronounce "1, 2, 3" a bit
>> different from "1, 2, 3" if there is a (say) thin space between the
>> digits in the latter case. It could pronounce it quicker, for example.
 
> And it could read it as "thin space", too.

Yeah, and it could read it as “all your base are belong to us”. And
your browser renders it as a hollow square. Hey, and my cat can’t read
it at all. What’s the point of this?

> But it's questionable if any
> speech reader is going to try and interpret such ambiguous and rarely
> used characters specially.

As I already have said in another message, they’re not that ambiguous
and rarely used. If someone misuses them, that’s his problem.

> Even if it does, that doesn't make it plain
> text; italics and  can be interpreted by speech
> readers much more usefully, but are clearly not plain text.

You can’t markup everything just like you can’t make everything
plain text. I’ve no objections to . That’s just
another level.

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[4]: Fixed Width Spaces (was: Printing and Displaying Depen dentVowels)

2004-04-03 Thread Alexander Savenkov
Hello,

2004-04-03T02:01:34+03:00 /|/|ike Ayers <[EMAIL PROTECTED]> wrote:

>> That is arguable. An aural user agent could pronounce "1, 2, 3" a bit
>> different from "1, 2, 3" if there is a (say) thin space between the
>> digits in the latter case. It could pronounce it quicker, for example.

>     It *could* do that, but, frankly, that would be a bad
> idea.  Speech synthesis devices have enough trouble with plain text
> as it is - adding special interpretation for neo-markup characters
> would just make things worse.  This belongs in the realm of
> (surprise!) markup.

I wonder why you call them neo-markup characters while they have been
used for years, not on the Web of course. Speech synthesizers have to
learn how to read (or at least skip) those characters in order to
facilitate listening comprehension just like a proof-read book
facilitates visual comprehension.

>     This seems to be international
> let's-merge-markup-into-plaintext month.

I'm not one of these.

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Fixed Width Spaces (was: Printing and Displaying DependentVowels)

2004-04-03 Thread Alexander Savenkov
> Alexander Savenkov suggested:

>> Why not? I think Peter needs a good book on typesetting to find out
>> what is inserted inserted between "Louis" and "XIV". In this case IIRC
>> there should be the following sequence: Louis,ZWNBSP,SP,ZWNBSP,XIV.

Kenneth Whistler replied:

> Uh, no.  is equivalent to . In either
> case, the SPACE or the NBSP would be (potentially) subject to
> justification which would change their width.

> The argument was about how (or whether) to constrain the space
> between "Louis" and "XIV" to a *fixed* width under certain
> assumptions about justification. To do that, you would need to
> use a *fixed*-width space as a starting point (and then hope that
> the justification algorithm used for rendering doesn't ignore
> or override your choice, anyway).

First of all, you don’t want a fixed-width space in this particular
case, just a non-breaking one (ask the typesetters).

A fixed-width space is desirable in cases like:

“As he was setting the ion gun in place, his fingers had slipped
through the portal—ordinarily no problem, but this morning his hip
had also brushed the toggle switch on the control panel at the left
of the portal.”

You’d type “portal,ZWNBSP,THIN SP,ZWNBSP,EM DASH,THIN SP,ordinarily”.

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Fixed Width Spaces (was: Printing and Displaying DependentVowels)

2004-04-02 Thread Alexander Savenkov
Hello,

sorry for the late response.

2004-04-01T03:47:40+03:00 Kenneth Whistler <[EMAIL PROTECTED]> wrote:

> Other possible approaches that any industrial-strength
> typesetting program ought to provide:

...

> The point is that looking to encode a special character in
> Unicode for every distinct visual effect in typesetting is
> not necessarily the first, best solution to settle on. It
> might not even be seventh or eighth best on the possible
> list of alternative approaches to solve the problem.

Why not? I think Peter needs a good book on typesetting to find out
what is inserted inserted between "Louis" and "XIV". In this case IIRC
there should be the following sequence: Louis,ZWNBSP,SP,ZWNBSP,XIV.

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Fixed Width Spaces (was: Printing and Displaying DependentVowels)

2004-04-02 Thread Alexander Savenkov
Hello,

and sorry for the late response.

2004-04-01T05:41:02+03:00 fantasai <[EMAIL PROTECTED]> wrote:

>>>> But, as Ken has just clarified, with NBSP Louis' neck may be
>>>> stretched rather uncomfortably, if not cut completely. Here is what I
>>>> don't want to see (fixed width font required):
>>>>
>>>> Louis   XVI   was
>>>> guillotinedin
>>>> 1793.
>>>
>>> This, however, is a matter of presentation rather than semantics, and
>>> as such fitly belongs in the realm of presentational markup.  In HTML,
 >>> one might specify   to generate a fixed-width space.
>>
>> I disagree. Surely there is something SEMANTICALLY different about the
>> space in "Louis XVI". One semantic difference is that it is 
>> non-breaking. But another one is that these words should not be split
>> apart. An additional semantic distinction might be that they should be
>> treated as one word for the purposes of word breaking algorithms.

> non-breaking and non-stretching are presentational properties, not
> semantic ones. They don't change the meaning of the space: it's still
> just a space, not a hyphen or the letter "g". They don't affect
> non-visual media; we don't break lines in spoken speech. "Louis XVI"
> is semantically different from "Louis' head" because the former is a
> bare noun whereas the latter is a noun phrase, but as far as the reader
> is concerned, they're both separated with "a space". Whether the space
> breaks or not or stretches or not has no effect on either the meaning
> or correctness of the text. It only affects its (visual) aesthetic
> quality.

That is arguable. An aural user agent could pronounce "1, 2, 3" a bit
different from "1, 2, 3" if there is a (say) thin space between the
digits in the latter case. It could pronounce it quicker, for example.

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re: Tajik alphabet code

2004-03-01 Thread Alexander Savenkov
Hello Aso,

2004-03-01T11:18:34+03:00 Asomiddin Atoev <[EMAIL PROTECTED]> wrote:

> I am emailing on behalf of the Tajikistani state
> working group on localizing software for Tajik
> language. Could you please kindly guide us to be in
> right direction. What shall be the procedure of
> standartization of alphabet symbols? Tajik alphabet
> makes use of cyrillic symbols and contains of 35
> letters.

As far as I know all of the Cyrillic letters contained in the Tajik
alphabet are already encoded in the Unicode standard. See the code
charts at: http://www.unicode.org/charts/ and particularly
http://www.unicode.org/charts/PDF/U0400.pdf.

I heard the Tajik alphabet was reformed, if you could send a picture
or a link to the modern alphabet then someone (e.g., me) could provide
the exact codepoints for you if you're in doubt.

Best regards,
Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: American English translation of character names

2003-12-18 Thread Alexander Savenkov
Hello,

2003-12-18T18:38:28Z Michael Everson <[EMAIL PROTECTED]> wrote:

> At 16:21 +0100 2003-12-18, Philippe Verdy wrote:
>>John Cowan wrote:
>>>  The most mysterious term is "caron" for the hacek accent: this word
>>>  seems to exist only in ISO standards, and nobody has any idea where it
>>>  came from.
>>
>>I think it may have occured in some typographic terminology, because
>>the intial glyph looked more like a crochet hook than to a reversed 
>>circumflex, i.e. caron was not angular in handwritten form, as it is
>>now in typesetted fonts, but looked like a rounded and oblique check
>>mark (a slight variation of the accute accent with a small rounded 
>>hook on its bottom end, but still much more distinctful from the 
>>lower half-circle form used by breve).

> This doesn't make any sense to me, but in any case it does not 
> explain the origin of the word "caron". The most plausible suggestion
> I've ever come up with is folk-etymological: It's a CARet that sits 
> ON the vowel. :-(

The first time I saw the word caron it reminded me of the Russian word
«корона» (korona, cf. crown, coronet, or Corona beer). The accent
obviously looks like a small crown, hence may be the name. I guess we
could find the mysterious word in one of the languages using haceks.

Native speakers are needed.

-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: [OT] CJK -> CJC (Re: Corea?)

2003-12-17 Thread Alexander Savenkov
Hello,

2003-12-17T14:36:37Z Philippe Verdy <[EMAIL PROTECTED]> wrote:

> Marco Cimarosti wrote:
>> Doug Ewell wrote:
>> > I'll go farther than that.  It's always bothered me that speakers of
>> > European languages, including English but especially French, have seen
>> > fit to rename the cities and internal subdivisions of other countries.
>> 
>> Rightly said!
>> 
>> There is reason to rename "Colonia" to "Koln", "Augusta" to "Augsburg",
>> "Eboraco" to "York", "Provincia" to "Provence", and so on.

> Or even "Aix-la-Chapelle" to "Aachen" because that's its _current_ German
> name (the French name was official in the history, and is still used in
> French).

> Cities sometimes change name, some of theme being famous like the _current_
> Saint-Petersbourg (French name revived in Russia with just a

It's Saint-Petersburg (or St. Petersburg) if you write in English.
The name has German roots, not French ones.

> transliteration, the Latin transcription being also widely used by Russians)

Why would Russians use "the Latin transcription" for a Russian name?

> which has also been Leningrad or Petrograd or Stalingrad

Stalingrad was the previous name for Volgograd, not St. Petersburg.
The initial name was Tsaritsyn.

Petrograd on the other hand *was* the name of St. Petersburg in
1914-1924. Leningrad was the name of it in 1924-1991.

> (in the Latin
> transliteration of the official and changing Russian script name, this Latin
> transliteration changing a bit among various languages which used them), and
> even Saint-Petersbourg officially for some time in the tsar's Russia.

I wonder what you meant by the "some time" part. St. Petersburg was
founded in 1703, and therefore stayed St. Petersburg for more than 200
years, that is it was St. Petersburg *most* of the time.

You mixed everything up, Phillippe.

Regards,
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re: Stability of scientific names, was Stability of WG2

2003-12-17 Thread Alexander Savenkov
Hello,

2003-12-17T11:06:32Z Curtis Clark <[EMAIL PROTECTED]> wrote:

> on 2003-12-16 15:27 Peter Kirk wrote:

>> I'm no expert on this... 

> I am. :-)

>> but I thought that species could be transferred 
>> from genus to genus as knowledge advances. 

> As John pointed out, the epithet stays the same.

>> And presumably obvious 
>> spelling mistakes are corrected (contrast "FHTORA" in U+1D0C5), or are
>> you saying that if the first publication had "Brontosuarus" as a typo
>> this error would remain for ever?

> There are errors and then there are errors. Some are correctable, some
> are not, and botanists and zoologists have different rules about this.
> An example that's not entirely OT: There was a Russian physician with
> the last name Эшшолц - a "cyrillicization" of his German family name

He was Эшшольц actually. You forgot the "soft sign".

(I'm not sure everyone will see the name - the editor replaced the
encoding with windows-1251, and there's no UTF-8 support).

Regards,
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re: [OT] Re[2]: Swastika to be banned by Microsoft?

2003-12-14 Thread Alexander Savenkov
Michael,

2003-12-14T18:48:17Z Michael Everson <[EMAIL PROTECTED]> wrote:

>>References: [1] ISBN 5-85141-016-7, p. 38.

> What is this and how does one find it? Amazon.ru? :-)

A book on history, as a matter of fact. Try visiting ozon.ru if you
can read Russian. I personally got if from the author at the
exhibition here in Moscow recently.

I though I should post the reference to the exact source instead of
sending the unfounded assertions, nothing more than that.

Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




[OT] Re[2]: Swastika to be banned by Microsoft?

2003-12-14 Thread Alexander Savenkov
Hello Philippe,

2003-12-14T16:56:10Z Philippe Verdy <[EMAIL PROTECTED]> wrote:

Snip...

> I find nothnig wrong in proposing a font which does not have these symbols
> for use in European scripts, where the occurence of the symbol is almost
> always associated to the Nazi's party, but I think it would be wrong to
> remove them from fonts designed for Asian markets that need it to represent
> their script, in a context where such association is not self-evident.

AFAIK the use of the symbol is forbidden in the names/logos
of organizations in Russia. It, unsurprisingly, was present here before,
e.g., it was printed on the Russian money in 1917 or so.

...snip...

> However there's still a problem with the ancient scandinavian usage:

I guess it is more appropriate to speak about the ancient Babylonian
or rather ancient Russian (Boreal) use. See, for instance, [1].

> it's
> not clear that the symbol would only fit in Asian fonts. However the symbols
> could be present in fonts made to represent old European scripts such as
> Runic, even if they have been used in translations to Roman-Latin or
> Church-Latin of these texts, with a Latin or Gothic script, or even in some
> other Uralic languages.

References: [1] ISBN 5-85141-016-7, p. 38.

Regards,
Alexander.
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: Berber/Tifinagh (was: Swahili & Banthu)

2003-11-10 Thread Alexander Savenkov
Hello,

2003-11-09T21:41:25Z Michael Everson <[EMAIL PROTECTED]> wrote:

> At 19:30 +0100 2003-11-09, Philippe Verdy wrote:

>>So my question is, once again: would a font that would display pointed Latin
>>glyphs from Tifinagh script code points really break the Unicode model?

> Yes, Philippe. It is the same thing as mapping Cyrillic to ASCII 
> letters. It is a hack. It is to be avoided. It is the Wrong Thing To 
> Do.

I'm not sure I'm not taking your words out of the context, Michael.
"The Wrong Thing To Do" can be seen everywhere in the newspapers when
the names and some other words originally written in Cyrillic and
other scripts are letter-by-letter (mapped?) transliterated to the
resulting script.

I can guess you're aware of the Russian GOST (state standard) accepted
IIRC by the ISO which maps the Russian letters to the Latin letters or
their combinations for different scripts. It allows people to
recognize the original name and to trace the original orthography
back. It also lets the people without the knowledge of Cyrillic
letters to read and write the Cyrillic names. Therefore such mapping
isn't a hack but a Right Thing To Do.

Best regards,
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/




Re[2]: GDP by language

2003-10-23 Thread Alexander Savenkov
Hello everyone,

2003-10-22T21:53:44Z Peter Kirk <[EMAIL PROTECTED]> wrote:

...snip...
> The data doesn't support addition to this degree of accuracy because of 
> the effect of the "others" area. Cyrillic may even overtake Arabic, 
> because there are several countries using the Cyrillic alphabet, but not 
> Russian or Ukrainian, which might each contribute 0.1-0.2%, but no 
> countries as far as I know using Arabic script but not Arabic, Persian 
> or Urdu as official languages (except perhaps Pashto in Afghanistan).

I know that Tajik is currently written and taught in Arabic script in
some schools. That's another one. There may be more.

Best regards,
-- 
  Alexander Savenkovhttp://www.xmlhack.ru/
  [EMAIL PROTECTED] http://www.xmlhack.ru/authors/croll/