Re: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-11 Thread Marcin 'Qrczak' Kowalczyk

7 Jul 2001 11:01:18 GMT, Marcin 'Qrczak' Kowalczyk [EMAIL PROTECTED] pisze:

 I put a sample at http://qrczak.ids.net.pl/vi-001.gif

Now I put a prettier version there: with variable line width, serifs,
and by a slightly improved sizing engine (enlargement of rounded parts
to make them look the same size as straight parts happens locally
instead of only at the top and bottom of a letter), and with all
dots looking exactly the same due to rounding coordinates of their
centers to whole pixels (or whole pixels and a half, in case of an
even dot size).

I still can't have serifs on ends of slanted lines, but they happen
only in ASCII shapes, not in my script, so I'm not sure that I want
them badly enough. Serifs are really triangles, so they look like
traditional serifs only in small pixel sizes like that one.

It would be nice to be able to draw it with TeX, but I don't know
TeX well enough. I will not reimplement the whole Metafont myself
either:-)

-- 
 __("  Marcin Kowalczyk * [EMAIL PROTECTED] http://qrczak.ids.net.pl/
 \__/
  ^^  SYGNATURA ZASTPCZA
QRCZAK





RE: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-09 Thread Ayers, Mike


 From: Edward Cherlin [mailto:[EMAIL PROTECTED]] 

 The 'tsu' sign in reduced form is traditionally used in Japanese for 
 consonant doubling (chyotto is written chi yo tsu to), but 
 has been adapted 
 for glottal stops at the end of words.

Odd.  I've always considered Japanese double consonants to be
glottal stops.  Could anyone please explain the difference?


Thanks,

/|/|ike




RE: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-09 Thread [EMAIL PROTECTED]
   Odd.  I've always considered Japanese "double consonants" to be
 glottal stops.  Could anyone please explain the difference?

They are glottal stops.  But Japanese writing doesn't have a (standard)  
means of expressing a glottally stopped vowel pair.  It only can express  
consonants.  One supposes that a small "tsu" would suffice, e.g.  
$B%O%t%!%$%C%$(B = hawai'i... And probably has already been used somewhere  
to that effect.  As Ed Cherlin pointed out, "tsu" has been adapted for  
word-final consonants... in that sense, "tsu" is effectively used as a virama  
already.

I still don't know if there's any Japanese phonetic scholarship that  
distinguishes "L" and "R"...

Rick


Re: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-07 Thread DougEwell2

In a message dated 2001-07-06 0:31:39 Pacific Daylight Time, [EMAIL PROTECTED] 
writes:

 I wonder: why aren't languages with simple syllabic structures written in 
 hiragana? It seems to be built for them. 

Hiragana (and katakana) assume certain things about the syllabic structure, 
specifically that syllables are of the form [C] V [C], where the trailing 
consonant (if any) must be n.  Pairs of consonants like st and tr 
within a single syllable aren't supported in kana.  Neither are consonants 
like th and vowels like short a as in ash.

The kana are not built for languages with simple syllabic structures in 
general, but for a specific language of that type: Japanese.  It serves that 
language very well, but would not work so well for, say, Spanish or Italian 
(which could also be said to have relatively simple syllabic structures).

11 later wrote:

 Latin letters were not invented at any one time, they evolved, sort of.
 Same with our European digits.

That's an interesting point.  As we have been saying, all scripts are created 
by humans; none are natural in the sense that they sprout from the ground 
or wash up on the shore.  But is there a difference, in some people's minds 
at least, between a script that is invented more or less intact versus one 
that evolves over time?  Does this kind of evolution somehow add credibility 
to a script?  What about Cyrillic; isn't it pretty much the same as when it 
was introduced?

-Doug Ewell
 Fullerton, California




Re: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-07 Thread Marcin 'Qrczak' Kowalczyk

In a message dated 2001-07-06 0:31:39 Pacific Daylight Time, [EMAIL PROTECTED] 
writes:
 
 I wonder: why aren't languages with simple syllabic structures
 written in hiragana? It seems to be built for them.

I am using my own script inspired by hiragana 10 years ago for writing
Polish. It looks very differently, I only liked the idea of having
letters for consonant+vowel pairs and stretched it a bit.

I put a sample at http://qrczak.ids.net.pl/vi-001.gif (resolution
suitable for printing at 300dpi). For example the subject says:
Re: vi (Re: O wyższości znaku zachęty nad GUI), i.e. Re: vi (Re:
About the superiority of command-line prompt over GUI), which has
only 11 letters between the second Re: and GUI.

I won't dare proposing to encode it in Unicode. The number of users
is approaching two. But technically it's an interesting script with
a non-trivial rendering engine. I implemented the rendering engine
and a translator from standard Polish orthography (not perfect due to
ambiguities in our orthography - I modified the orthography a little
to resolve them). I did it to practice reading. I could only practice
writing before - it's hard to read what you just wrote, because you
remember what you wrote!

Letters are composed from core characters by the engine. There
are 35 consonants, 8 normal vowels, 1 extra vowel, joiner, and
non-joiner. They produce an unbounded number of letters.

(1) Adjacent consonants are joined up to some limit (2 is a good
choice, but there is no semantic difference here) and they are joined
with the following vowel if present (this is mandatory).

(2) A consonant+vowel pair must be split if this is a border
between a prefix and a stem or the like. Such pairs are also split
in some foreign words to force correct pronunciation (pronunciation
of a consonant sometimes depends on the following vowel and vice
versa). Non-joiner is used to encode such splitting in the stream of
core characters.

(3) The default (greedy) splitting of chunks of consonants is not
always perfect, e.g. when it would join a final part of a prefix with
the beginning of the stem. Joiner and non-joiner are used to prevent or
force splitting at certain points between consonants. Forced joining
overrides the limit of joined consonants.

(4) Any two letters can be joined by writing one above another with a
dot between. This is never required by the orthography but is sometimes
a good style, e.g. in the od prefix and in diphtongs. Joiner is
used to encode that.

Finally there are cases where a consonant+vowel pair is split according
to (2) and then joined according to (4). I am encoding such case with
joiner + non-joiner + joiner. I think that there is already a similar
practice in Unicode used for Arabic ligatures.

Actually I'm not using even PUA characters but an ASCII-based escaping
scheme, because I don't have an editor capable of editing text in
such a script. But simple non-joined letters put in a font with the
ability to directly edit joiners and non-joiners would be technically
workable. The meaning of a text file would then be unambiguous modulo
PUA assignment (the ASCII-based escaping is a hack).

-- 
 __(  Marcin Kowalczyk * [EMAIL PROTECTED] http://qrczak.ids.net.pl/
 \__/
  ^^  SYGNATURA ZASTĘPCZA
QRCZAK





Re: Shavian

2001-07-07 Thread Michael Everson

At 01:25 -0400 2001-07-07, [EMAIL PROTECTED] wrote:

Furthermore, the Tengwar and the Cirth have been used and are being used
   to write English as well as JRRT's invented languages, and in such a way
   that (unlike Pigpen, e.g.) they cannot be taken as mere ciphers.

Very true.  How did these usages develop?  Are they mentioned in Tolkien's
manuscripts, or did they develop later?

The text on the title page of the Lord of the Rings is written (by 
Tolkien) in English in both Tengwar and Cirth.
-- 
Michael Everson




Re: Shavian

2001-07-07 Thread Michael Everson

At 13:19 -0700 2001-07-06, Kenneth Whistler wrote:

Aha! I see you are more of a true believer than the true believers.
So your problem is that the *wrong* group is claiming jurisdiction
here, and you would prefer to wait for the Númenoreans to show up
in person at WG2 before progressing any Tengwar proposal.

Perhaps I'll go to a ring of standing stones during the next eclipse 
and call some hither. Or perhaps they will just name Ireland as proxy.

Lots of other minorities have

:-)
-- 
Michael Everson




Re: Shavian

2001-07-07 Thread Michael Everson

At 12:25 -0700 2001-07-06, Michael \(michka\) Kaplan wrote:

While fully recognizing the importance of Middle Earth to some people it is
difficult for me to get past the fact that there Middle Earth has no
national representative to WG2?

Neither did Old Italic, which was sponsored by Ireland and the 
Unicode Consortium. Not, you know, Italy.

And the same people who push so hard for these scripts may never know why it
is so hard to find experts for other scripts who are willing to help.
Perhaps those experts just did not want their scripts to be given equal
footing with those of the _Elvish Linguistic Fellowship_ or similar
organizations?

Michka, the only thing I can say is nonsense. You've given no 
evidence whatsoever that any particular group eschewed Unicode 
because it had scripts in it they didn't like. Some members of WG2 
complained that there were too many minority scripts roadmapped, to 
which only one answer could be given: be happy that your script 
needs are met, and remember that others' rights to their scripts are 
just as important as yours. The UCS is, indeed, intended to be 
Universal, and there's no getting around that.

Sometimes it's hard to find Nabataean experts because there aren't 
very many, and a lot of traditional philologists don't use e-mail. 
Sometimes the real experts are dead (like Lytkin, who published 
definitively on Old Permic). I can't think of a case where anyone 
said, oh, we can't bother with Unicode because they plan to encode 
Tengwar.

You can just call me a consciencious objector to having anyone who
subscribes to Vinyar Tengwar considering themselves to be among the
Númenoreans (a.k.a. the Dúnedain), who alone of all the races of Men knew
Elvish tongues. :-)

Most subscribers think of themselves as linguists interested in 
having a bit of fun.
-- 
Michael Everson




Re: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-07 Thread DougEwell2

In a message dated 2001-07-07 9:30:56 Pacific Daylight Time, 
[EMAIL PROTECTED] writes:

  Cherokee is another syllabary that is very language-specific. And Etruscan 
  was pretty much forced into an alphabet--with its long consonant clusters, 
  a syllabary would have been unwieldy.

Another example is Canadian Syllabics.  VERY difficult to write English using 
that script.

-Doug Ewell
 Fullerton, California




Re: Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-07 Thread [EMAIL PROTECTED]
Hiragana (and katakana) assume certain things about the syllabic structure,
specifically that syllables are of the form [C] V [C], where the trailing
consonant (if any) must be "n".

Yes, but, kana _has_ been used even natively in comics and so forth, to end  
words with other consonants (i.e., eliding the last vowel) for example:   
$B%$%s%9%?%s%H%C!&%9!<%W%C(B

The biggest problem with using kana for a wide variety of languages, aside  
from having a severely limited number of consonants  vowels even with  
extension, is that it doesn't express adjacent non-identical consonants at  
all.

Kana should be quite adequate for some other languages...  Hawaiian?  Oh,  
hmmm, well, except for that darned L/R distinction which kana doesn't have...  


Uh... Never mind...

Rick


Re: Terms constructed script,invented script (was: FW: Re: Shavian)

2001-07-07 Thread Edward Cherlin

At 12:20 PM 2001-07-07, Michael Everson wrote:
At 11:16 -0700 2001-07-07, [EMAIL PROTECTED] wrote:

Kana should be quite adequate for some other languages...  Hawaiian?  Oh,
hmmm, well, except for that darned L/R distinction which kana doesn't have...

Neither does Hawai'ian.

So then all you need is a kana rendering for a glottal stop-vowel syllable, 
like 'i in hava'i or 'a in a'a (a type of lava). Based on the manga I have 
seen, the most likely combination is the small 'tsu' followed by a vowel. 
The 'tsu' sign in reduced form is traditionally used in Japanese for 
consonant doubling (chyotto is written chi yo tsu to), but has been adapted 
for glottal stops at the end of words.

If anyone cares, that is.

--
Michael Everson


Edward Cherlin
Generalist
A knot! Oh, do let me help to undo it.
Alice in Wonderland





Re: Shavian

2001-07-06 Thread Michael \(michka\) Kaplan

From: [EMAIL PROTECTED]

 What's bad is that work seems to get done on fictional scripts while
there
 are still millions of real people (some of whom even have access to
 computers) who can't express texts of their natively-used languages with
 Unicode because we don't have their scripts encoded.

This is (I think) one large reason for the contention -- a combination of
the reaction of people who argue against fictional scripts due to what they
fear will be the appearance of the above, and of course the people who
actually do not like the appearance. Can we truly blame them?

 The most common reason for not having enough information is that
 we can't shlep enough experts to us, nor shlep enough of us to the
experts,
 to complete any encoding proposals... a matter of time and funds.

It is even quite likely that work on fictional scripts can even serve to
undermine attempts to get alternate funding to assist with the difficulties
you cite here.

Example: I do not think I would find fault in the character of an
organization or individual that would choose not to assist in the work to
encode Egyptian Hieroglyphics if they saw their script in the same
consideration list as something like Klingon.

The only valid justification I have seen for encoding scripts like Desert is
the one from John Jenkins:

The problem was, a vicious circle quickly arose. Nobody
started to implement surrogates because there were no
characters encoded using them, and nobody wanted their
characters to be encoded using surrogates because nobody
was implementing them.

To break the vicious circle, we needed a writing system
which was at once real and at the same time so incredibly
rare and/or dead that nobody would object to its being
encoded with surrogates. I volunteered the Deseret Alphabet
as such as script, and it was quickly accepted.


Well, now with the strength of tens of thousands of new characters that
*are* important to a large number of people, this argument cannot really be
applied to fictional scripts as effectively. No one needs Tengwar or Cirth
(or Shavian!) to make people believe that surrogates need to be supported,
if indeed they ever truly did.

MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Re: Shavian

2001-07-06 Thread John Cowan

[EMAIL PROTECTED] scripsit:

 Shavian and Deseret are examples of  
 scripts that needn't have been encoded now, and aren't very widely used, and  
 aren't _NEEDED_ by anyone at all, but were encoded because a while back  
 someone just happened to have done the work, and the proposals have just been  
 sitting around gathering dust.  Might as well get them in, because nothing  
 more needs to be done to the proposals.

I think that talking of _NEED_ quickly gets you into trouble.  Why do people
NEED to use their native script on computers at all?  Given all the other
things they have to learn, learning a romanization of it should be no sweat.
Every educated person in Greece, e.g., knows the Latin alphabet and can apply
it to Greek text, so there is no NEED to encode Greek at all.

Instead, we should look at what people WANT to use.  People WANT to use
hieroglyphics for Egyptian text, Gothic for Gothic text, rovasiras or
ogham for pseudo-secret communications.  Therefore these scripts
should go in, given that there are enthusiasts who will do the work.

 What's bad is that work seems to get done on fictional scripts while there  
 are still millions of real people (some of whom even have access to  
 computers) who can't express texts of their natively-used languages with  
 Unicode because we don't have their scripts encoded.  There are various  
 reasons for that, the most common being that we can't get enough information  
 about them.  The most common reason for not having enough information is that  
 we can't shlep enough experts to us, nor shlep enough of us to the experts,  
 to complete any encoding proposals... a matter of time and funds.

Just so, which means that the energy spent on invented scripts is nowise
taken away from the energy that could be spent on obscure-but-real scripts.
Would that it were otherwise.

-- 
John Cowan   [EMAIL PROTECTED]
One art/there is/no less/no more/All things/to do/with sparks/galore
--Douglas Hofstadter




Re: Shavian

2001-07-06 Thread DougEwell2

In a message dated 2001-07-05 21:02:05 Pacific Daylight Time, 
[EMAIL PROTECTED] writes:

  A lot of the arguments against Klingon weren't specificially against
  Klingon; they were more against any fictional scripts in Unicode. The
  editorial response to comments from national groups, in the public archive
  of ISO 10646 stuff that you linked to at the start of this message, 
included
  a complaint about Deseret from the German Standards body, in that it was
  inappropriate for being a fictional script. The response to that was
  bascially Not really, IIRC. That does not bode well for lack of 
contention
  for later scripts.

I do feel that there is a difference between:

(a) scripts like Shavian and Deseret, which were invented in a completely 
serious vein, in an attempt to provide an alternative and presumably better 
means of writing a real language, but didn't quite catch on; and

(b) truly fictional scripts like Klingon, Tengwar, Cirth, and such that 
appear in novels or TV or movies and were never intended to be used seriously.

Both G.B. Shaw and the Mormons had genuine, if not universally shared, 
reasons for wanting to abandon the Latin script for writing English in favor 
of something better.  Shaw thought English literacy could be improved with 
a more regular writing system to take the place of the convoluted Latin-based 
orthography.  (There are also rumors of darker motives, but the intent was 
still for serious use.)  Brigham Young wanted to isolate the Mormons from the 
rest of the corrupt world of written English.

Compare the motivations behind these scripts to that of scripts that appear 
in fictional literature and popular culture.  Although nobody denies the 
greatness of J.R.R. Tolkein as an author and scholar, it is extremely 
unlikely that he intended the beautiful and carefully designed Tengwar and 
Cirth scripts to be used by real humans to write real languages for use in 
everyday life.  Nor did Marc Okrand and other creators of Star Trek likely 
intend the Klingon script and/or language to be used seriously, in the same 
sense as Shaw or Young.  This goes double for some of the other scripts 
listed in the ConScript registry.  Some appear *only* on the author's Web 
pages, alongside elaborate descriptions of fantasy worlds.

I do believe that original intent has something to do with the legitimacy 
of a script for consideration in Unicode.

Remember that all scripts, including Latin, Arabic, Han, Shavian, and 
Klingon, were invented by humans.  There are differences having to do with 
how long ago, by whom, for what purpose, and how widely adopted, but there is 
no such thing as a natural script against which artificial scripts may be 
contrasted.

-Doug Ewell
 Fullerton, California




Re: Shavian

2001-07-06 Thread Michael \(michka\) Kaplan

From: John Cowan [EMAIL PROTECTED]

 Just so, which means that the energy spent on invented scripts is nowise
 taken away from the energy that could be spent on obscure-but-real
scripts.
 Would that it were otherwise.

No one is arguing the FACTUAL basis for the above, but it is quite
reasonable to argue the perception, and its effect on credibility.
Especially if the desire is to inspire more wallets to be opened and help
with the effort to encode these historical scripts?


MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Terms constructed script, invented script (was: FW: Re: Shavian)

2001-07-06 Thread $B$F$s$I$&$j$e$&$8(B



$B$i$s$^(B $B!z$8$e$&$$$C$A$c$s!z(B
$B!!!_$"$+$M(B 
$B!

Re: Shavian

2001-07-06 Thread David Starner

[EMAIL PROTECTED] writes:
 Shavian and Deseret are examples of
 scripts that needn't have been encoded now, and aren't very widely used,
and
 aren't _NEEDED_ by anyone at all, but were encoded because a while back
 someone just happened to have done the work, and the proposals have just
been
 sitting around gathering dust.

Shavian, at least, has a body of users that are ready and willing to use
Unicode. I have never seen or heard of actual use of Unicode for Cherokee
(and I have done some browsing of the appropriate sites and a little talking
to users); after sticking Shavian in a web search early today, I've seen
several examples of Unicode being used for Shavian (Conscript encoding), and
several comments about being prepared to switch to a real Unicode encoding.

 What's bad is that work seems to get done on fictional scripts while
there

Really? There's only one fictional script encoded, and one on the fast track
to encoding. Both those are simple non-shaping, non-combining LTR scripts
with a very well defined closed set of characters. It probably only took an
afternoon to write up either of them. I think that more effort has been
wasted debating fictional scripts on [EMAIL PROTECTED] than it will take
to get them encoded.

 There are various
 reasons for that, the most common being that we can't get enough
information
 about them.  The most common reason for not having enough information is
that
 we can't shlep enough experts to us, nor shlep enough of us to the
experts,
 to complete any encoding proposals... a matter of time and funds.

How does encoding fictional scripts affect this one way or another?

--
David Starner - [EMAIL PROTECTED]





Re: Shavian

2001-07-06 Thread David Starner

From: Michael (michka) Kaplan [EMAIL PROTECTED]
 Example: I do not think I would find fault in the character of an
 organization or individual that would choose not to assist in the work to
 encode Egyptian Hieroglyphics if they saw their script in the same
 consideration list as something like Klingon.

Could you find fault with some one who chose not to work with Cham because
it was in the same consideration list as the (dead) Egyptian
Hieroglyphics?

--
David Starner - [EMAIL PROTECTED]





Re: Shavian

2001-07-06 Thread David Starner

[EMAIL PROTECTED] writes:
 I do feel that there is a difference between:

 (a) scripts like Shavian and Deseret, which were invented in a completely
 serious vein, in an attempt to provide an alternative and presumably
better
 means of writing a real language, but didn't quite catch on; and

 (b) truly fictional scripts like Klingon, Tengwar, Cirth, and such that
 appear in novels or TV or movies and were never intended to be used
seriously.

I agree. The first are monuments to men's ego, like the great statue to
Ozymandius, failed attempts to improve the world by fiat. The second are
monuments to men's greatness, offerings of joy and entertainment to the
world, offerings that were gladly accepted, offerings that actually improved
the world.

 I do believe that original intent has something to do with the
legitimacy
 of a script for consideration in Unicode.

Why? IMO, characters and scripts should be encoded based on whether people
are actually using them today, and will actually be using them tomorrow.
(Roughly the same principle as not [...] idiosyncratic, personal, novel,
rarely exchanged, or private-use characters, nor [...] logos or graphics.)
There are a lot of political reasons why some characters or scripts
shouldn't be encoded - they shouldn't matter.

--
David Starner - [EMAIL PROTECTED]





Re: Shavian

2001-07-06 Thread James Kass


Doug Ewell wrote:

 
 Both G.B. Shaw and the Mormons had genuine, if not universally shared, 
 reasons for wanting to abandon the Latin script for writing English in favor 
 of something better.  Shaw thought English literacy could be improved with 
 a more regular writing system to take the place of the convoluted Latin-based 
 orthography.  (There are also rumors of darker motives, but the intent was 
 still for serious use.)  Brigham Young wanted to isolate the Mormons from the 
 rest of the corrupt world of written English.
 

The intent of the Deseret alphabet may have had more to do
with promoting literacy.  The vagaries of English spelling were
possibly perceived as too much of a burden for recent European
immigrants when combined with the hardship of learning spoken
English.   At the time the Deseret script was being promoted,
the Saints were pretty much geographically isolated from most
of the Gentiles, so censorship wouldn't have been much of a problem.

Best regards,

James Kass.
 





Re: Shavian

2001-07-06 Thread Michael Everson

At 03:03 -0400 2001-07-06, [EMAIL PROTECTED] wrote:

Compare the motivations behind these scripts to that of scripts that appear
in fictional literature and popular culture.  Although nobody denies the
greatness of J.R.R. Tolkein as an author and scholar, it is extremely
unlikely that he intended the beautiful and carefully designed Tengwar and
Cirth scripts to be used by real humans to write real languages for use in
everyday life.

Tolkien knew that others shared his secret vice and was delighted 
when school children wrote him to say they had made an Elvish 
vocabulary. He wrote letters to some of them using his alphabets. He 
published an appendix explaining the writing system. Why? For the joy 
of it. Why do people devise Tengwar modes for Hebrew and Yiddish, and 
practice Tengwar calligraphy in Russian and Sindarin?

And they encode texts using it. Would Tolkien have approved? Very 
much so, I am sure -- he would have been fascinated by it.

We don't use Egyptian or Luvian or Ogham for use in every day life, 
well at least most of you don't (I confess to refrigerator notes 
written in the script-of-the-week).
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread Michael Everson

At 08:00 +0100 2001-07-06, David Starner wrote:

Could you find fault with some one who chose not to work with Cham because
it was in the same consideration list as the (dead) Egyptian
Hieroglyphics?

I don't believe the Vietnamese national body (which originally 
sponsored Cham, providing me with a very big Cham dictionary to help) 
holds this view. The reason the Vietnamese are not pressing Cham 
encoding is that they don't have funds for it now. I can't press it 
because I need their input.
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread Michael Everson

At 03:50 +0100 2001-07-06, David Starner wrote:

A lot of the arguments against Klingon weren't specificially against
Klingon; they were more against any fictional scripts in Unicode.

But arguments don't hold water. Criteria for encoding scripts or 
symbols are that (1) they are used by enough people who need to 
transfer data, (2) they are important enough historically with regard 
to the representation of the recorded data of humankind. Now some of 
(2) have a handful of documents and hardly any users. Some of (1) are 
fictional (whatever that means -- all writing systems are 
artifacts) but have a great many users.

Klingon failed not because it was Klingon, but because speakers of 
that language themselves don't really use it except ornamentally, as 
gifs in web pages and so on. Translations of Hamlet are published in 
the Latin script. Grammars and dictionaries use the Latin script.

The editorial response to comments from national groups, in the public archive
of ISO 10646 stuff that you linked to at the start of this message, included
a complaint about Deseret from the German Standards body, in that it was
inappropriate for being a fictional script. The response to that was
bascially Not really, IIRC. That does not bode well for lack of contention
for later scripts.

The folks at DIN were wrong about Deseret, in my opinion. It seems to 
me that they did not know what Deseret was. Whether it had a long 
life is irrelevant. That script is of cultural importance to a rather 
sizeable community of people, and, is of interest to students of 
writing systems, English phonology, and the history of Mormonism.
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread Michael Everson

At 07:15 +0100 2001-07-06, David Starner wrote:

What's bad is that work seems to get done on fictional scripts while there

Really? There's only one fictional script encoded, and one on the fast track
to encoding.

Deseret isn't fictional. And, to be precise, we don't fast track 
encoding. Shavian will go through the normal ballotting procedure.

Both those are simple non-shaping, non-combining LTR scripts
with a very well defined closed set of characters. It probably only took an
afternoon to write up either of them. I think that more effort has been
wasted debating fictional scripts on [EMAIL PROTECTED] than it will take
to get them encoded.

Well there are only two on the roadmap yet to encode, Tengwar and 
Cirth. Tengwar is complex, and it'd be better for us to make sure 
Lepcha and Limbu and Tai and Cham are sorted out. (Now, Phoenician 
and Old Persian Cuneiform will probably be pushed through sooner 
rather than later, because they are very simple scripts.)

There are various reasons for that, the most common being that we 
can't get enough information about them.  The most common reason 
for not having enough information is that we can't shlep enough 
experts to us, nor shlep enough of us to the experts, to complete 
any encoding proposals... a matter of time and funds.

How does encoding fictional scripts affect this one way or another?

There are a handful of us really interested in working on getting new 
stuff into the standard (entire scripts, anyway). It takes time and 
expertise to do these. For some scripts, it's hard to find experts to 
discuss the issues with. Or when we do find them, they don't use 
e-mail or don't understand Unicode, or whatever. I'm sure that with 
adequate funding, for instance, somebody like me could spend a couple 
of weeks in London libraries and cultural organizations finding the 
right people and nailing down the answers for a good many of the 
Brahmic scripts currently roadmapped. It'd be so easy if one were 
independently wealthy. :-) In the meantime, one has to pay the rent. 
:-)

This problem isn't new. It's going to get worse, too, as possibly 
some of the bigger companies may stop paying their employees to work 
on Unicode stuff since all the major living scripts are encoded (if 
not implemented).

Not that one intends to give up
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread Michael Everson

At 02:34 -0400 2001-07-06, John Cowan wrote:

The most common reason for not having enough information is that we 
can't shlep enough experts to us, nor shlep enough of us to the 
experts, to complete any encoding proposals... a matter of time and 
funds.

Just so, which means that the energy spent on invented scripts is nowise
taken away from the energy that could be spent on obscure-but-real scripts.
Would that it were otherwise.

I try to balance work on the dead ones with work on the living ones.
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread Michael Everson

At 22:03 -0700 2001-07-05, [EMAIL PROTECTED] wrote:

That was in WG2, I guess... The most recent discussion material that UTC saw
is a document I wrote, which is solely about Klingon and reasons for
rejecting it.

WG2 never discussed Klingon, formally.

Fictional or invented scripts aren't in and of themselves bad candidates for 
encoding, they should just be, in general, of low priority because, pretty 
much without exception, they are toys.  Shavian and Deseret are examples of 
scripts that needn't have been encoded now, and aren't very widely used, and 
aren't _NEEDED_ by anyone at all, but were encoded because a while back 
someone just happened to have done the work, and the proposals have just been 
sitting around gathering dust.  Might as well get them in, because nothing
more needs to be done to the proposals.

I don't think Deseret was fictional. Even if it was marginal. But 
they were simple to encode.

What's bad is that work seems to get done on fictional scripts while there
are still millions of real people (some of whom even have access to 
computers) who can't express texts of their natively-used languages with
Unicode because we don't have their scripts encoded.

Shavian and Deseret didn't take a lot of work. We did, however, a 
fair bit of work on Linear B, Ugaritic, Cypriot, Old Italic, and 
Gothic recently. I've tried to get progress on the Tai scripts, but 
as the Chinese haven't sent delegations to the last two WG2 meetings, 
naturally no progress has been made. (I am sure they will be turning 
up to Singapore, and won't they be unhappy if I don't turn up, due to 
the cost of getting there Sigh.)

There are various reasons for that, the most common being that we 
can't get enough information about them. The most common reason for 
not having enough information is that we can't shlep enough experts 
to us, nor shlep enough of us to the experts, to complete any 
encoding proposals... a matter of time and funds.

Indeed. Cham (from Vietnam) should probably have been encoded long 
ago. There are some outstanding questions, and there is the question 
of improving the font for it.
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread John H. Jenkins

At 3:50 AM +0100 7/6/01, David Starner wrote:
A lot of the arguments against Klingon weren't specificially against
Klingon; they were more against any fictional scripts in Unicode.

True, but the criterion being applied by the UTC is whether or not 
there are users of the script who wish to exchange data using it. 
There are also periodic objections from various states to providing 
characters or scripts used by minority languages which they wish 
would go away.  Unicode's criteria are different.

The
editorial response to comments from national groups, in the public archive
of ISO 10646 stuff that you linked to at the start of this message, included
a complaint about Deseret from the German Standards body, in that it was
inappropriate for being a fictional script.

Excuse me??!? 

The response to that was
bascially Not really, IIRC. That does not bode well for lack of contention
for later scripts.


Deseret is a bad example to use.  Deseret is *not* a fictional 
script.  It *is* a modern invented script with slight current 
utility, but it *does* have people who want to use it, and there is a 
surprisingly large body of diaries and other historical records from 
the 19th century written using it.
-- 
=
John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://homepage.mac.com/jenkins/




Re: Shavian

2001-07-06 Thread DougEwell2

In a message dated 2001-07-06 3:23:51 Pacific Daylight Time, 
[EMAIL PROTECTED] writes:

  The folks at DIN were wrong about Deseret, in my opinion. It seems to 
  me that they did not know what Deseret was. Whether it had a long 
  life is irrelevant. That script is of cultural importance to a rather 
  sizeable community of people, and, is of interest to students of 
  writing systems, English phonology, and the history of Mormonism.

Indeed, it appeared as the sole script on a beautiful $5 gold coin issued in 
1860 by a private mint in Salt Lake City that was owned and operated by 
Brigham Young.  The coin is listed in the respected Guide Book of U.S. 
Coins, along with earlier Mormon gold pieces that were issued before the 
invention of the Deseret alphabet.

-Doug Ewell
 Fullerton, California




Re: Shavian

2001-07-06 Thread James Kass

Doug Ewell wrote:

 Indeed, it appeared as the sole script on a beautiful $5 gold coin issued in
 1860 by a private mint in Salt Lake City that was owned and operated by
 Brigham Young.  The coin is listed in the respected Guide Book of U.S.
 Coins, along with earlier Mormon gold pieces that were issued before the
 invention of the Deseret alphabet.


The coin in question is quite rare and valuable.  It's also listed
in (Krause) Standard Catalog of World Coins.  The pictures
in both Yeoman and Krause are a bit blurry, but the inscription
appears to say holiness to the lord.

Using the ConScript encoding,     .

Best regards,

James Kass.







Re: Shavian

2001-07-06 Thread Michael \(michka\) Kaplan

From: Michael Everson [EMAIL PROTECTED]

 The editorial response to comments from national groups, in the
 public archive of ISO 10646 stuff that you linked to at the start
 of this message, included a complaint about Deseret from the German
 Standards body, in that it was inappropriate for being a fictional
 script.
 
 Excuse me??!?

 That's what you get for missing WG2 meetings, John. Yup, that's what
 the German ballot comments said. We dismissed them. Politely. :-)

It seems both of you missed the subtext here, and the reason that the
attempt to introduce fictional scripts might cause consternation (the
original point here)?

The German ballot comments about Deseret were dismissed on the basis that it
was not, in fact, a fictional script. Obviously a dismissal worded the same
way for an actual fictional script would not be possible, and the resistance
to fictional scripts will have to be directly addressed?


MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Re: Shavian

2001-07-06 Thread Michael Everson

At 09:53 -0700 2001-07-06, Michael \(michka\) Kaplan wrote:

It seems both of you missed the subtext here, and the reason that the
attempt to introduce fictional scripts might cause consternation (the
original point here)?

The German ballot comments about Deseret were dismissed on the basis that it
was not, in fact, a fictional script. Obviously a dismissal worded the same
way for an actual fictional script would not be possible, and the resistance
to fictional scripts will have to be directly addressed?

No, if, or when, they make the same objection to Tengwar we will just 
have to dismiss the objection on other grounds, namely, the 
importance of Tolkien's work and the rather large community of users 
who wish to exchange data using these scripts.
-- 
Michael Everson




Re: Shavian

2001-07-06 Thread Kenneth Whistler

MichKa wrote:

 It seems both of you missed the subtext here, and the reason that the
 attempt to introduce fictional scripts might cause consternation (the
 original point here)?
 
 The German ballot comments about Deseret were dismissed on the basis that it
 was not, in fact, a fictional script. Obviously a dismissal worded the same
 way for an actual fictional script would not be possible, and the resistance
 to fictional scripts will have to be directly addressed?

I've been lurking on this discussion, but have to chime in here. A
couple correspondents have pointed out the problems with the misnomer
fictional script. Part of the reason, in my opinion, why we keep
going round and round on this topic is that the terminology itself
bends peoples' minds inappropriately.

I get the strong impression that for some people the argument seems to
go:

   P: Fictional scripts should not be encoded in Unicode.
   Q: Klingon, Tengwar, Cirth, (and Deseret, ...) are fictional scripts.

Therefore R: Klingon, Tengwar, Cirth should not be encoded in Unicode.

Simple and irrefutable, unless you deny both the premises and the
assertions of P and Q.

Everson pointed out (as have others) that *all* scripts are artifacts.

Some, like Latin and Greek, are cultural transmissions from earlier
forms, which gradually changed through use until they came to have
separate identities from their antecedents.

Others were more or less created on the spot, as it were, by a
single individual or group working intentionally on the creation
of an entire script. Of those, some, like Hangul, were magnificent successes,
and have come to see everyday use by millions of people. Some, like
Shavian, were magnificent failures, admired in the attempt, and
still used by small groups of enthusiasts, but of no significant
commercial or economic worth, and of marginal literary worth.

Some, like Tengwar, have taken a somewhat different path.
Tolkien constructed it for aesthetic and literary purposes,
and certainly never had the intent of someone like Shaw, to
use it for the reform or replacement of an existing orthography.
However, unlike Shavian, Tengwar has had a kind of organic success
of a sort, spreading in its aesthetic and literary realm, and
gaining a group of adherents. The fact that Tengwar is used to
express a language that itself was also consciously constructed
does not, as I see it, render it any less suitable for the purposes
it is intended and used. After all, the Latin script is also
used to express constructed languages such as Esperanto. I see
no *moral* distinction here, even if Tengwar is more often put
to the purpose of writing romantic nature poetry, whereas
Esperanto tends to discussions of world government. ;-)

And as I have said before, I see nothing inherently less worthwhile
in a well-constructed Elvish poem expressed in Tengwar than in
a warehouse record from Uruk expressed in Sumero-Akkadian cuneiform.

Jenkins hit this nail on the head when he pointed out that
the basic Unicode criterion for coding-worthiness is the
existence of a significant group of people who wish to exchange
text in the script in question. We can quibble about what it means
to be a significant group in this context, and we certainly
should want to weigh factors such as how large a group, how
much text, and the costs of encoding and implementation versus
the needs. But I think it is just counterproductive to keep
obsessing on the imagined distinction of fictional versus
(what?) real status of the scripts in question.

Klingon (the pseudoscript) was rejected because it failed
the basic criterion for coding-worthiness, as well as a whole
list of other criteria that Rick listed in his document.

Tengwar is a completely different case. It meets all the
criteria, and is appropriate to have on the Roadmap. That doesn't
mean that it is a high priority -- in fact, as has been pointed
out, more energy has been spent gassing about it on this
list than has actually gone into encoding it. But in any
case I consider it inappropriate to keep labelling it a
fictional script. It is a real script and people use it.

If you want to toss around the term fictional script, I
suggest you apply it to things like Jindai, where we know
that a bunch of purported alphabets were simply hoaxes
invented to further political agendas. They were truly
fictional from the beginning, and weren't used except to
discuss the fake examples and make up more fake examples.

If *that* is what the term fictional script means, then,
yes, I agree that fictional scripts should not be encoded
in Unicode.

But if DIN (or somebody else) objects in the future to the
encoding of Tengwar because it is a fictional script, then
I think the answer simply has to involve the rectification
of terms, since Tengwar is *not* a fictional script. And DIN
has no business objecting to the inclusion of a real script
that has a demonstrated body of users who wish to exchange
textual data on the Internet and by other 

Re: Shavian

2001-07-06 Thread Michael \(michka\) Kaplan

From: Kenneth Whistler [EMAIL PROTECTED]

 I've been lurking on this discussion, but have to chime in here.

I do appreciate it, for what its worth. The chime was very much in tune.

While fully recognizing the importance of Middle Earth to some people it is
difficult for me to get past the fact that there Middle Earth has no
national representative to WG2? :-)  And of course anything that is
difficult for me will likely be difficult for others.

And the same people who push so hard for these scripts may never know why it
is so hard to find experts for other scripts who are willing to help.
Perhaps those experts just did not want their scripts to be given equal
footing with those of the _Elvish Linguistic Fellowship_ or similar
organizations?

It certainly gives me pause.

Obviously this will not be enough to away any of those who are truly
committed to representing the interests of the elves, or the people who want
to write like elves, or whoever it is they are in fact representing.

You can just call me a consciencious objector to having anyone who
subscribes to Vinyar Tengwar considering themselves to be among the
Númenoreans (a.k.a. the Dúnedain), who alone of all the races of Men knew
Elvish tongues. :-)

michka
(who loved Tolkien yet nevertheless decided to leave middle earth)





Re: Shavian

2001-07-06 Thread Kenneth Whistler

MichKa responded:

 While fully recognizing the importance of Middle Earth to some people it is
 difficult for me to get past the fact that there Middle Earth has no
 national representative to WG2? :-) 

But this is, of course, nearly irrelevant. If you prowl through the
unencoded scripts listed in the Roadmap documents now, there is almost
nothing left where we would expect a national representative to show
up at WG2 -- or are you expecting Babylonians, Hittites, and Nabataeans
to show up any time soon?

Even for the minority scripts, we no longer have national representatives --
instead we have the big national governments showing up speaking
for minorities on their territories (if they care). Sometimes those
minorities are ethnic minorities in the classic sense, and other times
they are scholarly minorities who happen to claim a certain jurisdiction
in the area by virtue of academic expertise. Hence Germany is claiming
expertise on a number of historic scripts of the Middle East, not because
of any territorial association, but simply because the German scholars
care about the subject matter.

I see no defensible line here, if American, British, and Irish scholars
claim expertise in something like Tengwar and wish to see it encoded. 

 And of course anything that is
 difficult for me will likely be difficult for others.

Of course. ;-)

 
 And the same people who push so hard for these scripts may never know why it
 is so hard to find experts for other scripts who are willing to help.
 Perhaps those experts just did not want their scripts to be given equal
 footing with those of the _Elvish Linguistic Fellowship_ or similar
 organizations?

I think this is an utter nonissue. As Rick pointed out, the reason why
Cham is not progressing has to do with funding for minority language
support and standardization work in Vietnam, not because some key
Cham scholar is miffed about Elvish. The reasons for funding problems
in Vietnam are economic and political, and don't have any demonstrable
connection to arguments on the Unicode list about the decoration of
Star Trek sets at Paramount.

 
 It certainly gives me pause.

...that refreshes.

 Obviously this will not be enough to away any of those who are truly
 committed to representing the interests of the elves, or the people who want
 to write like elves, or whoever it is they are in fact representing.
 
 You can just call me a consciencious objector to having anyone who
 subscribes to Vinyar Tengwar considering themselves to be among the
 Númenoreans (a.k.a. the Dúnedain), who alone of all the races of Men knew
 Elvish tongues. :-)

Aha! I see you are more of a true believer than the true believers.
So your problem is that the *wrong* group is claiming jurisdiction
here, and you would prefer to wait for the Númenoreans to show up
in person at WG2 before progressing any Tengwar proposal.

--Ken

 
 michka
 (who loved Tolkien yet nevertheless decided to leave middle earth)
 
 





Re: Shavian

2001-07-06 Thread Michael \(michka\) Kaplan

From: Kenneth Whistler [EMAIL PROTECTED]

  You can just call me a consciencious objector to having anyone who
  subscribes to Vinyar Tengwar considering themselves to be among the
  Númenoreans (a.k.a. the Dúnedain), who alone of all the races of Men
knew
  Elvish tongues. :-)

 Aha! I see you are more of a true believer than the true believers.
 So your problem is that the *wrong* group is claiming jurisdiction
 here, and you would prefer to wait for the Númenoreans to show up
 in person at WG2 before progressing any Tengwar proposal.

Heh heh heh, I would no quite *that* far -- I don't believe in Middle
Earth the way some of those folks do. I just think they are great stories
and would hate to disprove any aspect of the myth surrounding them, thats
all.

MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Re: Shavian

2001-07-06 Thread John Cowan

[EMAIL PROTECTED] scripsit:

 Although nobody denies the 
 greatness of J.R.R. Tolkien as an author and scholar, it is extremely 
 unlikely that he intended the beautiful and carefully designed Tengwar and 
 Cirth scripts to be used by real humans to write real languages for use in 
 everyday life.  

Well, real humans *have* done so, starting with JRRT and his son Christopher,
and going on to lots of enthusiasts, and if they are not exactly used for
laundry lists, the same is true of many other scripts as well.
(Elfling is just discussing whether we know how to write *Gandalf Tea Wednesday*
in Elvish.)

Furthermore, the Tengwar and the Cirth have been used and are being used
to write English as well as JRRT's invented languages, and in such a way
that (unlike Pigpen, e.g.) they cannot be taken as mere ciphers.

 This goes double for some of the other scripts 
 listed in the ConScript registry.  Some appear *only* on the author's Web 
 pages, alongside elaborate descriptions of fantasy worlds.

I don't think that anyone, certainly not Michael or I, ever intended the
CSUR as a sort of vestibule or waiting area for Unicode registry.  Lots of
the scripts there should never move from it.  But some few have Real
World justifications which ought not to be dismissed out of hand.

 I do believe that original intent has something to do with the legitimacy 
 of a script for consideration in Unicode.

Something, but not everything; how the script has come to be used is
also relevant.  The original intent of Americai Spek was the same as
Shavian, but it didn't take off even in the limited way that Shavian
has, and it should stay in the CSUR.

 Remember that all scripts, including Latin, Arabic, Han, Shavian, and 
 Klingon, were invented by humans.  

Hear, hear.

-- 
John Cowan   [EMAIL PROTECTED]
One art/there is/no less/no more/All things/to do/with sparks/galore
--Douglas Hofstadter




Re: Shavian

2001-07-06 Thread John Cowan

Kenneth Whistler scripsit:

[much good sense snipped]

 However, unlike Shavian, Tengwar has had a kind of organic success
 of a sort, spreading in its aesthetic and literary realm, and
 gaining a group of adherents. 

It turns out that Shavian too has its group of adherents: see
previous postings, and particularly the [EMAIL PROTECTED]
mailing list, which has been running some 60 postings a month lately.

-- 
John Cowan   [EMAIL PROTECTED]
One art/there is/no less/no more/All things/to do/with sparks/galore
--Douglas Hofstadter




Re: Shavian

2001-07-06 Thread DougEwell2

In a message dated 2001-07-06 17:25:33 Pacific Daylight Time, 
[EMAIL PROTECTED] writes:

 Although nobody denies the 
 greatness of J.R.R. Tolkien as an author and scholar, it is extremely 
 unlikely that he intended the beautiful and carefully designed Tengwar and 
 Cirth scripts to be used by real humans to write real languages for use in 
 everyday life.  

  Well, real humans *have* done so, starting with JRRT and his son 
Christopher,
  and going on to lots of enthusiasts, and if they are not exactly used for
  laundry lists, the same is true of many other scripts as well.

Perhaps I should have said that real-life use was not Tolkien's *primary* 
intent for the scripts.  Of course there are some who might want to use them, 
more or less seriously.

I can identify with Michael Everson and his refrigerator notes because I do 
the same thing, although my repertoire is certainly much more limited than 
his.  I sometimes write shopping lists and such in Cyrillic, Greek, Runic, or 
my own conscript.  If someone wants to do the same with Tengwar or Cirth, 
more power to them.

The difference is that Deseret and Shavian were designed with this as the 
*primary* goal -- that people would use these scripts on *at least* equal 
footing with the Latin script, for everything from shopping lists and love 
letters to traffic signs, marriage licenses and government proclamations.

  Furthermore, the Tengwar and the Cirth have been used and are being used
  to write English as well as JRRT's invented languages, and in such a way
  that (unlike Pigpen, e.g.) they cannot be taken as mere ciphers.

Very true.  How did these usages develop?  Are they mentioned in Tolkien's 
manuscripts, or did they develop later?

 This goes double for some of the other scripts 
 listed in the ConScript registry.  Some appear *only* on the author's Web 
 pages, alongside elaborate descriptions of fantasy worlds.

  I don't think that anyone, certainly not Michael or I, ever intended the
  CSUR as a sort of vestibule or waiting area for Unicode registry.  Lots of
  the scripts there should never move from it.  But some few have Real
  World justifications which ought not to be dismissed out of hand.

I definitely agree, and did not intend to sound otherwise.  In addition to 
the scripts that have been proposed for Unicode (Deseret, Klingon, Tengwar, 
Cirth), there are lots of scripts that haven't been and never will be.  As I 
wrote some months ago, I proposed my own conscript for CSUR but would never 
dream of proposing it for Unicode, because it doesn't belong there.

-Doug Ewell
 Fullerton, California




Re: Shavian

2001-07-05 Thread Michael Everson

At 12:38 -0700 2001-07-04, Michael \(michka\) Kaplan wrote:

Fictional scripts have been, are, and will likely continue to be a constant
source of contention for both Unicode and 10646 for years to come.

Klingon is such a script, but didn't meet certain criteria and was 
rejected. Aiha didn't meet them either (the only mention of it on the 
web was the Roadmap!), so we took it out. Tengwar and Cirth meet the 
criteria admirably -- far better than some of the historical scripts 
like Carian.

There aren't any other fictional scripts roadmapped, except Sarati, 
grandmother of Tengwar, which waits in the wings at present.

So what is the source of contention?

I would welcome evidence that there are in fact supplementary character
fonts that will be produced, and of course evidence that the user
community would actually have the software needed to use these fonts or
input methods to type the characters?

Fonts for Shavian and Deseret exist. Input methods exist at least for Deseret.

Of course the bonus would be having
Microsoft and IBM support the conversion of legacy data. My heart palpitates
at seeing that build of ICU!

You might say the same for Gothic or Phoenician.

As for whether your script would be encoded, where it ends up vis-a-vis the
potetial roadmap is more a side effect of who you know than anything else.
:-)

I don't think that's really fair. (Yes, I see the smiley.) What, 
Michka, is on the Roadmap that you think oughtn't be there? What 
scripts were proposed that didn't get in because the Ad-Hoc wasn't, 
ah, properly motivated? :-) :-)
-- 
Michael Everson




Re: Shavian

2001-07-05 Thread David Starner

Michael Everson [EMAIL PROTECTED] wrote:
 At 12:38 -0700 2001-07-04, Michael \(michka\) Kaplan wrote:

 Fictional scripts have been, are, and will likely continue to be a
constant
 source of contention for both Unicode and 10646 for years to come.

 Klingon is such a script, but didn't meet certain criteria and was
 rejected.
[...]
 So what is the source of contention?

A lot of the arguments against Klingon weren't specificially against
Klingon; they were more against any fictional scripts in Unicode. The
editorial response to comments from national groups, in the public archive
of ISO 10646 stuff that you linked to at the start of this message, included
a complaint about Deseret from the German Standards body, in that it was
inappropriate for being a fictional script. The response to that was
bascially Not really, IIRC. That does not bode well for lack of contention
for later scripts.

--
David Starner - [EMAIL PROTECTED]





Re: Shavian

2001-07-05 Thread [EMAIL PROTECTED]

David Starner - [EMAIL PROTECTED] wrote...

 A lot of the arguments against Klingon weren't specificially against
 Klingon;

That was in WG2, I guess... The most recent discussion material that UTC saw  
is a document I wrote, which is solely about Klingon and reasons for  
rejecting it.

Fictional or invented scripts aren't in and of themselves bad candidates for  
encoding, they should just be, in general, of low priority because, pretty  
much without exception, they are toys.  Shavian and Deseret are examples of  
scripts that needn't have been encoded now, and aren't very widely used, and  
aren't _NEEDED_ by anyone at all, but were encoded because a while back  
someone just happened to have done the work, and the proposals have just been  
sitting around gathering dust.  Might as well get them in, because nothing  
more needs to be done to the proposals.

What's bad is that work seems to get done on fictional scripts while there  
are still millions of real people (some of whom even have access to  
computers) who can't express texts of their natively-used languages with  
Unicode because we don't have their scripts encoded.  There are various  
reasons for that, the most common being that we can't get enough information  
about them.  The most common reason for not having enough information is that  
we can't shlep enough experts to us, nor shlep enough of us to the experts,  
to complete any encoding proposals... a matter of time and funds.

Rick





Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Richard Cook

Michael Everson wrote:
 
 UTC approved it and there's a new document from John Jenkins and me
 on Shavian for WG2, so it should get approved for ballotting at the
 next meeting of WG2.

Hi Michael,

I'm new to the idea that anyone would care to have Shavian encoded. Will
you enlighten me?

Best,
Richard




Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Michael Everson

At 11:10 -0700 2001-07-04, Richard Cook wrote:
Michael Everson wrote:

  UTC approved it and there's a new document from John Jenkins and me
  on Shavian for WG2, so it should get approved for ballotting at the
  next meeting of WG2.

Hi Michael,

I'm new to the idea that anyone would care to have Shavian encoded. Will
you enlighten me?

Easily: just read http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2362.pdf.
-- 
Michael Everson




Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Richard Cook

Michael Everson wrote:
 
 At 11:10 -0700 2001-07-04, Richard Cook wrote:
 Michael Everson wrote:
 
   UTC approved it and there's a new document from John Jenkins and me
   on Shavian for WG2, so it should get approved for ballotting at the
   next meeting of WG2.
 
 Hi Michael,
 
 I'm new to the idea that anyone would care to have Shavian encoded. Will
 you enlighten me?
 
 Easily: just read http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2362.pdf.

 C. Technical -- Justification
 1. Contact with the user community?
 Yes, such as it is.

funny :-)

now, I know of other phonemic alphabets for English ... e.g., I think
Ben Franklin invented one, ... and I have one of my own. Are any of
these slated for encoding too?




Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Michael \(michka\) Kaplan

From: Richard Cook [EMAIL PROTECTED]

 now, I know of other phonemic alphabets for English ... e.g., I think
 Ben Franklin invented one, ... and I have one of my own. Are any of
 these slated for encoding too?

Fictional scripts have been, are, and will likely continue to be a constant
source of contention for both Unicode and 10646 for years to come.

I would welcome evidence that there are in fact supplementary character
fonts that will be produced, and of course evidence that the user
community would actually have the software needed to use these fonts or
input methods to type the characters? Of course the bonus would be having
Microsoft and IBM support the conversion of legacy data. My heart palpitates
at seeing that build of ICU!

As for whether your script would be encoded, where it ends up vis-a-vis the
potetial roadmap is more a side effect of who you know than anything else.
:-)

MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread John H. Jenkins

At 12:38 PM -0700 7/4/01, Michael (michka) Kaplan wrote:
I would welcome evidence that there are in fact supplementary character
fonts that will be produced, and of course evidence that the user
community would actually have the software needed to use these fonts or
input methods to type the characters?

FWIW, there is a small but non-zero Shavian user community, and a 
number of fonts are available, some of them very pretty.
-- 
=
John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://homepage.mac.com/jenkins/




Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Michael \(michka\) Kaplan

From: John H. Jenkins [EMAIL PROTECTED]

 FWIW, there is a small but non-zero Shavian user community, and a
 number of fonts are available, some of them very pretty.

Of this I have no doubt -- but this was true of Klingon, also. g

I was expressing doubt that the majority of the community are:
1) truly unhappy with their current fonts, and
2) eagerly awaiting encoding so that they can use supplementary character
fonts, and
3) will upgrade software as needed to accomplish #2

And then of course seeing folks provide means for conversion from their
current code pages to Unicode seems pretty far off, too would ICU
really provide it? Would Microsoft? Would Apple? Will Oracle have a legacy
database encoding for it?

Maybe someone can provide evidence that my [low key] concern is unfounded?


MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Michael \(michka\) Kaplan

From: John Cowan [EMAIL PROTECTED]

  As for whether your script would be encoded, where it ends up vis-a-vis
the
  potential roadmap is more a side effect of who you know than anything
else.

 Smiley or not, someone might actually believe that, and it
 isn't true.  Michael Everson is more than open-minded about such things.

Well, I did mean in reference to fictional scripts (the subtopic of the
current conversation, such as it is). Michael is certainly someone you would
want to be aware of a fictional script if you wanted it encoded? Is anyone
aware of such a script whose proposal did not benefit from his involvement?

I think the work speaks for itself, and people can draw their own
conclusions from it (for good or evil or other!), so I will not cast
dispersions on any of it. :-)

MichKa

Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/






Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread John Cowan

Michael \(michka\) Kaplan scripsit:

 As for whether your script would be encoded, where it ends up vis-a-vis the
 potential roadmap is more a side effect of who you know than anything else.

Smiley or not, someone might actually believe that, and it
isn't true.  Michael Everson is more than open-minded about such things.

-- 
John Cowan   [EMAIL PROTECTED]
One art/there is/no less/no more/All things/to do/with sparks/galore
--Douglas Hofstadter




Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread David Starner

 I was expressing doubt that the majority of the community are:
 1) truly unhappy with their current fonts, and
 2) eagerly awaiting encoding so that they can use supplementary character
 fonts, and
 3) will upgrade software as needed to accomplish #2

If you check out the Shavian group on groups.yahoo.com, there are messages
discussing how to use Unicode for Shavian, links to places with Unicode
(Conscript) encoded Shavian fonts, and actual messages encoded in Unicode (I
believe using the Conscript encoding - I fail to have any Shavian fonts
installed to check). Some of the websites linked to include several
documents in Shavian in Unicode (Conscript) with notes about how they were
going to have to be updated when Shavian actually gets into Unicode. Of all
the communities I've seen discussing Unicodization, this one seems the most
eager and ready.

As for conversions - we're not talking a large amount of data in databases
and what not. Most of what I saw was UTF-8 text files, and it take all of 15
minutes for me (not exactly a programming god) to write a program to convert
that from Conscript encoding to Unicode, and most of that deciding whether
to use the full force of a mighty Unicode library, or go at the byte values
with tr.

--
David Starner - [EMAIL PROTECTED]





Re: Shavian (was: Re: UTF-17)

2001-07-01 Thread Michael Everson

UTC approved it and there's a new document from John Jenkins and me 
on Shavian for WG2, so it should get approved for ballotting at the 
next meeting of WG2.
-- 
Michael Everson