RE: [unicode] Re: FW: Inappropriate Proposals FAQ

2002-07-17 Thread Peter_Constable


On 07/17/2002 02:06:11 PM David Possin wrote:

>I was just trying to draw out something useful from a rather
>useless long thread that wasted a lot of time.

I certainly won't object to any attempt to redeem value from what may have
seemed worthless.



- Peter


---
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <[EMAIL PROTECTED]>








RE: [unicode] Re: FW: Inappropriate Proposals FAQ

2002-07-17 Thread David Possin

Marco,

I see your point, you are probably right.

Peter,

I agree with you that color or other attributes are not Unicode issues
when each entity has a different meaning. Each just gets their own
codepoint. I was just trying to draw out something useful from a rather
useless long thread that wasted a lot of time.

What I am trying to understand is where exactly the color (or smell,
sound) information gets added to the code. Does the font developer add
this information to the glyphs and the rendering engine processes the
information correctly? The two glyphs look the same otherwise, so how
would the rendering engine know what to do without the attribute info?
Does a mechanism for attribute already exist when a glyph gets sent
from the font to the rendering engine?

I know that this Aztec writing system will probably never be encoded,
but I like to think in advance about possible solutions when a related
issue might pop up some day, even if I push into the back of my brain
then. Looking at the amount of time wasted already, a few thoughts
about the only usable issue in the thread shouldn't be a waste of time.

So, if it is a font issue, how would these attributes be stored with
the font? Or does it look up the descriptive attributes assigned to
each Unicode character? Meaning now that the attribute is defined in
the Unicode standard as part of the character description, thus a
Unicode issue after all? Hmm - full circle - chicken or egg?

Dave
--- [EMAIL PROTECTED] wrote:
> 
> On 07/05/2002 03:00:35 PM Marco Cimarosti wrote:
> 
> >David Possin wrote:
> >> But, if something it silently ignored, then somebody has
> discovered
> >> something that nobody wants to touch. I have observed this sevaral
> >> times now, the latest incident was in the Chromatic Font Research
> >> thread, with 2 cases:
> >>
> >> Aztec glyphs: [...] Silence.
> >
> >Funny. I interpreted that silence the opposite way: very positively.
> I
> >didn't expect any immediate action, and the absence of denials made
> me
> feel
> >the information I passed was not totally pointless.
> >
> >Anyway, even if the silence actually meant "Who cares?", it doesn't
> bother
> >me, because I think this is NOT an issue for Unicode...
> 
> I think Marco has got this right. Let's suppose Aztec writing gets
> deciphered and there are cases of the same shape with different
> colouring
> to mean different things. Let's further suppose that we determine
> that the
> difference in semantics isn't akin to the ways in which colouring of
> English text might conceivably be used (e.g. for emphasis) but is
> really
> fundamental. Let's also further suppose that, taking all things into
> consideration, we really do consider this text and come to the
> conclusion
> that the best solution is one that's purely text-based (i.e. no
> markup or
> other higher-level protocal). We're nowhere near having made all
> these
> conclusions, but let's just suppose. So, we identify two things that
> are
> minimally contrastive: a red-and-white-whatsit, and a
> blue-green-and-yellow-whatsit. They are two entities and each gets a
> codepoint. That's an encoding issue. How they get rendered isn't an
> encoding issue.
> 
> Of course, at that point, we'd be wanting to consider how to deal
> with
> chromatic issues in text rendering where chromaticity is inherent to
> the
> character and not a matter of user-discretion (for which formatting
> is
> appropriate). But we are not yet at the point of knowing that is even
> necessary. And since it would clearly not be a trivial problem to
> solve
> (it's not finding a way to do it that's hard -- it's the huge amount
> of
> secondary implications), I think the silence amounts to a reaction
> that we
> neither are ready to cross that bridge nor do we need to at this time
> -- in
> fact, it's not yet certain that we will ever need to -- and that in
> the
> mean time there are more immediate and real concerns to be dealt
> with.
> 
> 
> 
> - Peter
> 
> 
>
---
> Peter Constable
> 
> Non-Roman Script Initiative, SIL International
> 7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
> Tel: +1 972 708 7485
> E-mail: <[EMAIL PROTECTED]>
> 
> 
> 
> 


=
Dave Possin
Globalization Consultant
www.Welocalize.com
http://groups.yahoo.com/group/locales/

__
Do You Yahoo!?
Yahoo! Autos - Get free new car price quotes
http://autos.yahoo.com




RE: [unicode] Re: FW: Inappropriate Proposals FAQ

2002-07-17 Thread Peter_Constable


On 07/05/2002 03:00:35 PM Marco Cimarosti wrote:

>David Possin wrote:
>> But, if something it silently ignored, then somebody has discovered
>> something that nobody wants to touch. I have observed this sevaral
>> times now, the latest incident was in the Chromatic Font Research
>> thread, with 2 cases:
>>
>> Aztec glyphs: [...] Silence.
>
>Funny. I interpreted that silence the opposite way: very positively. I
>didn't expect any immediate action, and the absence of denials made me
feel
>the information I passed was not totally pointless.
>
>Anyway, even if the silence actually meant "Who cares?", it doesn't bother
>me, because I think this is NOT an issue for Unicode...

I think Marco has got this right. Let's suppose Aztec writing gets
deciphered and there are cases of the same shape with different colouring
to mean different things. Let's further suppose that we determine that the
difference in semantics isn't akin to the ways in which colouring of
English text might conceivably be used (e.g. for emphasis) but is really
fundamental. Let's also further suppose that, taking all things into
consideration, we really do consider this text and come to the conclusion
that the best solution is one that's purely text-based (i.e. no markup or
other higher-level protocal). We're nowhere near having made all these
conclusions, but let's just suppose. So, we identify two things that are
minimally contrastive: a red-and-white-whatsit, and a
blue-green-and-yellow-whatsit. They are two entities and each gets a
codepoint. That's an encoding issue. How they get rendered isn't an
encoding issue.

Of course, at that point, we'd be wanting to consider how to deal with
chromatic issues in text rendering where chromaticity is inherent to the
character and not a matter of user-discretion (for which formatting is
appropriate). But we are not yet at the point of knowing that is even
necessary. And since it would clearly not be a trivial problem to solve
(it's not finding a way to do it that's hard -- it's the huge amount of
secondary implications), I think the silence amounts to a reaction that we
neither are ready to cross that bridge nor do we need to at this time -- in
fact, it's not yet certain that we will ever need to -- and that in the
mean time there are more immediate and real concerns to be dealt with.



- Peter


---
Peter Constable

Non-Roman Script Initiative, SIL International
7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA
Tel: +1 972 708 7485
E-mail: <[EMAIL PROTECTED]>







FW: Chromatic text. (follows from Re: [unicode] Re: FW: Inappropriate Proposals FAQ)

2002-07-11 Thread Suzanne M. Topping

> -Original Message-
> From: William Overington [mailto:[EMAIL PROTECTED]]
> Sent: Monday, July 08, 2002 10:20 AM

> To me, such a distinction means that people who are using 
> lower cost, more
> generally available software packages, might by such an 
> approach be able in
> the not too distant future to use files in a non-proprietary 
> portable format
> and get much better results than just using monochrome 
> traditional plain text.
 
I don't see how putting this sort of stuff in Unicode in any way speeds
up adoption into real-world use. Applications have to "learn" how to
deal with the text manipulation specification, whether it's provided as
markup or theoretically stored in Unicode. Bottom line is that it would
take a while to trickle down to the level you describe. Unicode isn't
the/a solution for bridging your digital divide.




RE: Chromatic text. (follows from Re: [unicode] Re: FW: Inappropriate Proposals FAQ)

2002-07-08 Thread Marco Cimarosti

William Overington wrote:
> Actually I was trying in the posting upon which you comment 
> to suggest that, even if people do not agree with me about
> having colour codes in a plain text file, they might
> perhaps consider as a separate issue the adding into regular
> Unicode of a zero width operator whose use would be 
> to indicate that a character, such as U+1362, should be
> decorated chromatically.

Come on, William!!

Adding such a "zero width operator" *is* having color in plain text!

And adding such "zero width operators" *is* inserting mark up in plain text!

> >I interpret your post as one more lengthy repetition of your 
> well-known
> >opinion: differences between "plain text" and "rich text" 
> should not exist:
> >they should be eliminated by incorporating the mark-up in 
> the encoding.
> 
> Actually, that is not my opinion.

No, I know. This is my explanation of my perception of your explanation of
your opinion. Now I am not sure what your perception of my explanation of my
perception of your explanation of your opinion might be.

Gentlemen, communication is such a difficult art!

> [...]
> Perhaps some sort of consensus over nomenclature for three 
> categories of
> text file could occur, namely plain text in the manner which 
> you like it,
> plain text in the manner in which I like it and markup.  
> Maybe plain text,
> enhanced text and markup would be suitable names.  How do 
> people feel about
> that please?

I would suggest "proletarian text", "middle-class text" and "capitalist
text", if I wasn't so scared that someone could take it seriously.

> It is unfortunately the case in discussions that when someone 
> disagrees with
> an idea that is put forward that he or she is more likely to 
> respond in
> public than if he or she agrees with an idea which is put 
> forward, or has
> simply read about the idea and just notes it as an 
> interesting possibility.
> This can have the effect that many people may agree with an 
> idea or at least
> not be against it yet make no comment, perhaps giving an 
> impression that an
> idea is not well received at large when in fact that is not 
> necessarily the
> case.

Yes, definitely a difficult art.

_ Marco




Re: Chromatic text. (follows from Re: [unicode] Re: FW: Inappropriate Proposals FAQ)

2002-07-08 Thread William Overington

Marco Cimarosti wrote as follows.

>Of course you can. But my feeling is that you already *did* suggest this,
>many and many times.

Actually I was trying in the posting upon which you comment to suggest that,
even if people do not agree with me about having colour codes in a plain
text file, they might perhaps consider as a separate issue the adding into
regular Unicode of a zero width operator whose use would be to indicate that
a character, such as U+1362, should be decorated chromatically.  This would
mean that a sequence U+1362 ZWJ ZWCDO could be used in documents, which
would give a chromatically decorated glyph with a chromatic font yet would
just give U+1362 as a monochrome character if the font did not recognize the
U+1362 ZWJ ZWCDO sequence.

>
>I interpret your post as one more lengthy repetition of your well-known
>opinion: differences between "plain text" and "rich text" should not exist:
>they should be eliminated by incorporating the mark-up in the encoding.
>

Actually, that is not my opinion.

My opinion is that splitting text files into just two categories, either
plain text or markup is not sufficient, but that there should perhaps be
more categories or, if there are but two categories that the dividing line
between them should be in a different place.  I tend to base the essential
dividing line upon whether the encoding of the file of code points is
meaningful if one tries to compute the effect of a code point upon the
system as simply the effect of that code point as it stands, without having
to have software recognize a character such as < and determine that a markup
bubble is being entered then to have to read in several more characters
within the markup bubble before taking any action as a result of the first
character in the sequence (that is, the < character) being read.  That
distinction means that each Unicode character is processed as it is received
within the main loop of the program, without the receiving of a < character
putting the processing into an inner loop within a markup bubble, within
which bubble ordinary Unicode character codes which are read have a
different meaning than in the Unicode specification.

To me, such a distinction means that people who are using lower cost, more
generally available software packages, might by such an approach be able in
the not too distant future to use files in a non-proprietary portable format
and get much better results than just using monochrome traditional plain
text.

Perhaps some sort of consensus over nomenclature for three categories of
text file could occur, namely plain text in the manner which you like it,
plain text in the manner in which I like it and markup.  Maybe plain text,
enhanced text and markup would be suitable names.  How do people feel about
that please?

It is unfortunately the case in discussions that when someone disagrees with
an idea that is put forward that he or she is more likely to respond in
public than if he or she agrees with an idea which is put forward, or has
simply read about the idea and just notes it as an interesting possibility.
This can have the effect that many people may agree with an idea or at least
not be against it yet make no comment, perhaps giving an impression that an
idea is not well received at large when in fact that is not necessarily the
case.

William Overington

8 July 2002











RE: Chromatic text. (follows from Re: [unicode] Re: FW: Inappropriate Proposals FAQ)

2002-07-08 Thread Marco Cimarosti

William Overington wrote:
> >The problem (if there is one!) is only for font technology.
> >
> >> Ethiopian writing: [...] "The capability to the same electronically
> >> would be well received. /Daniel."
> >
> >Same for this one: Unicode's task was to provide a code point for the
> >Ethiopic full stop, and they did. Whether the corresponding glyph is
> colored
> >or not is problem for fonts and word processors.
> 
> Well, may I please suggest that the issue is one for Unicode 
> as well as for font technology?
>
> [...]

Of course you can. But my feeling is that you already *did* suggest this,
many and many times.

I interpret your post as one more lengthy repetition of your well-known
opinion: differences between "plain text" and "rich text" should not exist:
they should be eliminated by incorporating the mark-up in the encoding.

I think that it is your right to repeat your opinions as many times as you
want. Nevertheless, I find that repeating opinions which are already
well-known to everybody is *useless* and *boring*.

_ Marco




Chromatic text. (follows from Re: [unicode] Re: FW: Inappropriate Proposals FAQ)

2002-07-06 Thread William Overington

>The problem (if there is one!) is only for font technology.
>
>> Ethiopian writing: [...] "The capability to the same electronically
>> would be well received. /Daniel."
>
>Same for this one: Unicode's task was to provide a code point for the
>Ethiopic full stop, and they did. Whether the corresponding glyph is
colored
>or not is problem for fonts and word processors.

Well, may I please suggest that the issue is one for Unicode as well as for
font technology?

Firstly, for the avoidance of doubt in the matter, whereas I am an advocate
for adding codes into Unicode for effects for organizing and controlling
data in ways which some people consider should be done only by markup
methods, I am hoping that, without that aspect of my research prejudicing
the matter, readers might consider the possibility of adding into regular
Unicode some operators for use in ZWJ sequences for requesting that a
chromatically decorated glyph of the 'operated upon' regular Unicode
character be produced if the font can provide it, yet otherwise that a
monochrome ordinary glyph be provided.

May I please refer to the following document.

http://www.users.globalnet.co.uk/~ngo/courtfor.htm

In that document I wrote as follows.

quote

Here are some codes for use in ZWJ sequences of the form U+ ZWJ U+F3DC
and U+ ZWJ U+F3DD so as to provide facilities to indicate to a chromatic
font that a colour decorated version of U+ is requested, where U+
represents any Unicode character where such usage would be meaningful. This
facility is provided in anticipation of the possibility of chromatic fonts
being introduced at some time in the future.

U+F3DC ZERO WIDTH DECORATION OPERATOR OF THE FIRST KIND
U+F3DD ZERO WIDTH DECORATION OPERATOR OF THE SECOND KIND

end quote

May I please refer to the following document.

http://www.users.globalnet.co.uk/~ngo/courtcol.htm

In that document I wrote as follows.

quote

U+F3E0 BLACK
U+F3E1 BROWN
U+F3E2 RED
U+F3E3 ORANGE
U+F3E4 YELLOW

end quote

So, it would be the case that in order to set some text in black one would
use U+F3E0 then the text and in order to set some text in red one would use
U+F3E2 then the text.

In order to set some text in black including a character U+1362 in black
with red flourishes one would use U+F3E0 then the text which precedes the
U+1362 character and then U+1362 ZWJ U+F3DC which should do the job
perfectly well as the chromatic font would be set up so that the decoration
of the first kind operator worked in black and red.

More generally, for other chromatic characters from other applications where
the colours are not specific, then the chromatic colours can be changed
before using the ZWJ sequence.

Quoting from the same document.

quote

Colour changing is by a specially devised method which will hopefully be
efficient in practice.

Upon receiving one of the 18 codes to change colour, the system presumes on
a temporary basis that the new colour is to become the foreground colour,
which is what it will usually be. However, the previous foreground colour is
stored. If a command to set one of the other four colours is received, then
the foreground colour is used for that purpose, with the foreground colour
being replaced by the previous foreground colour.

This means that only one code point is needed to change the foreground
colour and two code points are needed to change the contents of any of the
other colour registers.

The decoration colours are intended to be ready for the possible
introduction of chromatic fonts at some future date.

U+F3AC SET NEW BACKGROUND COLOUR
U+F3AD SET NEW FIRST DECORATION COLOUR
U+F3AE SET NEW SECOND DECORATION COLOUR
U+F3AF SET NEW THIRD DECORATION COLOUR

end quote

In using chromatic font technology I suggest that the specific colours could
either be built into the glyph or could just be foreground colour,
background colour, first decoration colour, second decoration colour, third
decoration colour, in abstract terms with the specific colours being
supplied by the rendering software.  This would enable some glyphs, such as
those for the Ethiopic manuscripts, to be specified as black and red within
the font, and for some glyphs, such as those for an ornament, to be
specified by the rendering software.

As regards the possibility of including such code points as I have mentioned
above in regular Unicode, well, there are various levels.

There is the issue of whether codes such as U+F3E2 RED above should be
promoted as some people feel that they are markup and should not be included
in regular Unicode.  Well, that is an issue and I quite accept that I am
currently in the minority over that issue, though I would ask that readers
might look into the matter of the use of such codes in DVB-MHP (Digital
Video Broadcasting - Multimedia Home Platform) broadcasting
of multimedia where text in Unicode files will be processed by Java programs
which have been broadcast to television sets, before taking a definitive
view on the matter, an

Re: [unicode] Re: FW: Inappropriate Proposals FAQ

2002-07-05 Thread Michael Everson

At 10:37 am -0700 2002-07-05, David Possin wrote:

>Aztec glyphs: Some of the glyphs are identical in shape and form, but a
>certain colored area changes the meaning if a different color is
>applied. When Michael Everson asked for proof, both Marco Cimarosti and
>I sent him links to websites that state this color issue. Silence.

What did you want me to say? Aztec hasn't been fully deciphered yet, 
it seems. I did mention Budge's use of a black line over rubricked 
Egyptian text. If we encountered a script in which colour was really 
intrinsic we might have to deal with it, but in the real world such a 
convention would be pretty unstable. How would you carve your name 
into a tree with a knife if you had no ink with you and D was black 
but d was blue?

>Ethiopian writing: Daniel Yacob described the usage of red dots,
>accents, and words in that writing system, nobody except WO followed up
>with the significance of Daniel's statements. Silence, even though he
>wrote "The capability to the same electronically would be well
>received.

Would markup not do?

>I see two valid possible proposals here to add a color attribute to a
>character. What will happen if a need for these characters is
>discovered, a consortium with the necessary background is formed, and
>the UTC receives an orderly proposal?

In Quark I can add colour attributes to a character for printing. We 
would consider an orderly proposal on its merits.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




RE: [unicode] Re: FW: Inappropriate Proposals FAQ

2002-07-05 Thread Marco Cimarosti

David Possin wrote:
> But, if something it silently ignored, then somebody has discovered
> something that nobody wants to touch. I have observed this sevaral
> times now, the latest incident was in the Chromatic Font Research
> thread, with 2 cases:
> 
> Aztec glyphs: [...] Silence.

Funny. I interpreted that silence the opposite way: very positively. I
didn't expect any immediate action, and the absence of denials made me feel
the information I passed was not totally pointless.

Anyway, even if the silence actually meant "Who cares?", it doesn't bother
me, because I think this is NOT an issue for Unicode.

"Unicode encodes character, not glyphs!" Whether the glyphs representing
those characters are colored or not, the only problem for Unicode would be
having four-color printing for a future edition of the Unicode Standard...

The problem (if there is one!) is only for font technology.

> Ethiopian writing: [...] "The capability to the same electronically
> would be well received. /Daniel."

Same for this one: Unicode's task was to provide a code point for the
Ethiopic full stop, and they did. Whether the corresponding glyph is colored
or not is problem for fonts and word processors.


However, there has been one case when Unicode's silence disturbed me, and
this was when I (et al.) raised a real encoding problem, although certainly
a minor one. It had to do with encoding "repha" out of context ("repha" is
one of the contextual glyphs of letter RA in some Indic scripts).


_ Marco




Re: [unicode] Re: FW: Inappropriate Proposals FAQ

2002-07-05 Thread David Possin

Resending this email because for some reason my membership in the
Unicode list got deleted.
--- Rick McGowan <[EMAIL PROTECTED]> wrote:
> Suzanne T asked:
> 
> > Can people from the review committee give me some hard and fast
> > rules for when something is thrown out?
> 
> --snip-- 
> One rule of thumb that people can also use: if an off-the-cuff
> proposal  
> for a thing doesn't fly on the Unicode list, it is unlikely to fly in
> UTC.
> 
>   Rick
> 
I have noticed something else in this aspect. If an idea gets bashed on
the Unicode list it won't make it, and almost all the time I agree that
it shouldn't make it.

But, if something it silently ignored, then somebody has discovered
something that nobody wants to touch. I have observed this sevaral
times now, the latest incident was in the Chromatic Font Research
thread, with 2 cases:

Aztec glyphs: Some of the glyphs are identical in shape and form, but a
certain colored area changes the meaning if a different color is
applied. When Michael Everson asked for proof, both Marco Cimarosti and
I sent him links to websites that state this color issue. Silence.

Ethiopian writing: Daniel Yacob described the usage of red dots,
accents, and words in that writing system, nobody except WO followed up
with the significance of Daniel's statements. Silence, even though he
wrote "The capability to the same electronically would be well
received.
/Daniel."

I see two valid possible proposals here to add a color attribute to a
character. What will happen if a need for these characters is
discovered, a consortium with the necessary background is formed, and
the UTC receives an orderly proposal?

Between all the arguing and mile long emails nobody actually saw this
possibility, or wanted to see the valid issues for a proposal. I
believe it is necessary to invest some thought into what a color
implementation would mean for Unicode, not for a holly with red
berries, but for a real writing system. The silence after these valid
statements were made disturbs me. 

Dave











=
Dave Possin
Globalization Consultant
www.Welocalize.com
http://groups.yahoo.com/group/locales/

__
Do You Yahoo!?
Sign up for SBC Yahoo! Dial - First Month Free
http://sbc.yahoo.com




Re: FW: Inappropriate Proposals FAQ

2002-07-04 Thread Doug Ewell

Otto Stolz  wrote:

>> I noticed in particular that the characters are non-contiguous
>> within themselves.
>
> This is not enough to decide what constitutes a character. E. g. the
> cyrillic character "?" is non-contiguous; in the Latin script, when
> "f" and "l" are ligated, they still constitute two characters.

For that matter, the dot over lowercase "i" and "j" make those glyphs
non-contiguous as well.

-Doug Ewell
 Fullerton, California





Re: FW: Inappropriate Proposals FAQ

2002-07-04 Thread John H. Jenkins


On Thursday, July 4, 2002, at 09:07 AM, Otto Stolz wrote:

> Michael Everson wrote:
>> That, and the fact that it hasn't been deciphered.
>
>
> Which implies that you really cannot tell what constitutes a character,
> in that script, nor its writing-direction.
>
>

Actually, you can't even tell *that* it's a script, not for sure.  But if 
it *is* writing, then the nature of the characters seems fairly 
unambiguous as the various signs are self-contained and don't break down 
into smaller pieces.  It would appear to be a syllabary.  Also IIRC the 
writing direction has been deduced by determining the order in which the 
characters were stamped into the clay (as indicated by overlaps).

I should mention that the proposals for the encoding of the Phaistos disc 
are the only proposals made to the UTC and WG2 which contain the entire 
known corpus of writing with that script as a part of the proposal.  :-)

==
John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://homepage.mac.com/jenkins/





Re: FW: Inappropriate Proposals FAQ

2002-07-04 Thread Michael Everson

At 17:07 +0200 2002-07-04, Otto Stolz wrote:

>Michael Everson wrote:
>>That, and the fact that it hasn't been deciphered.
>
>Which implies that you really cannot tell what constitutes a character,
>in that script, nor its writing-direction.

Actually it appears that the writing direction is into the faces of 
the characters, and that the impression in the clay indicates that 
this was the likely direction of printing. And as far as the 
repertoire is concerned, it is small and each character is easily 
differentiated from the others.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: FW: Inappropriate Proposals FAQ

2002-07-04 Thread Otto Stolz

William Overington had written:
> the proposal for encoding of the Phaistos Disk Script into Unicode

> [...] was rejected.  Is this really for a purported reason that the

> script is only found on one item at present?

Michael Everson wrote:
> That, and the fact that it hasn't been deciphered.


Which implies that you really cannot tell what constitutes a character,
in that script, nor its writing-direction.

William Overington had written:
> I noticed in particular that the characters are non-contiguous within

> themselves.


This is not enough to decide what constitutes a character. E. g. the
cyrillic character "Ы" is non-contiguous; in the Latin script, when
"f" and "l" are ligated, they still constitute two characters.

Best wishes,
   Otto Stolz





Re: FW: Inappropriate Proposals FAQ

2002-07-04 Thread Michael Everson

At 05:22 +0100 2002-07-04, William Overington wrote:

>Has the Unicode Technical
>Committee in fact ever discussed the possibility of whether or not to encode
>chess diagrams in regular Unicode at all?

No, but I suspect that if they were to do so they would quickly come 
to the conclusion that markup was best for describing the chess board 
into which the chess characters were arranged.

>One script which the Unicode Technical Committee has turned down is the
>Phaistos Disk Script.  There is a document, which if I may say so is of high
>quality, about the proposal for encoding of the Phaistos Disk Script into
>Unicode, into Plane 1.  It was rejected.  Is this really for a purported
>reason that the script is only found on one item at present?

That, and the fact that it hasn't been deciphered.

>How about discussing the Phaistos Disk Script in this new FAQ document?

How about not? General aphorisms are better in this case.

>As for the script regarding the cipher in relation to a movie.  I feel that
>the characters are very elegant and I noticed in particular that the
>characters are non-contiguous within themselves.

The point made was that the writing system was cumbersome and 
inefficient qua writing system.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: FW: Inappropriate Proposals FAQ

2002-07-04 Thread William Overington

Marco Cimarosti wrote as follows.


>William Overington wrote:
>> As for the script regarding the cipher in relation to a
>> movie.  [...]
>
>Which one? Doug mentioned three movies: "Men in Black II", "Indiana Jones
>and the Temple of Doom", "Atlantis: The Lost Empire".
>
>_ Marco
>

The Men in Black II one to which Ken referred in the post to which I was
replying.

William Overington

4 July 2002






RE: FW: Inappropriate Proposals FAQ

2002-07-04 Thread Marco Cimarosti

William Overington wrote:
> As for the script regarding the cipher in relation to a 
> movie.  [...]

Which one? Doug mentioned three movies: "Men in Black II", "Indiana Jones
and the Temple of Doom", "Atlantis: The Lost Empire".

_ Marco




Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread William Overington

Ken Whistler wrote as follows.

>But if a "script", like the MIIB BurgerKing cipher mentioned today,
>or chess diagram notation, is missing from the Roadmap, there is probably
>a *good* reason for it not to be there, and people should think twice
>(and then again) before they start proposing it for encoding in Unicode.

Well, there may or may not be a good reason for something missing from the
Roadmap, one good reason possibly being that nobody has suggested something
before.

I make this point since you mention chess diagram notation.

I have recently published some Private Use Area code point allocations for
chess diagrams.

http://www.users.globalnet.co.uk/~ngo/chess.htm

http://www.users.globalnet.co.uk/~ngo/chess2.htm

http://www.users.globalnet.co.uk/~ngo/golden.htm

I am well aware that some people are not interested in my publishing of
Private Use Area allocations, but I am stating the matter here since you
mentioned chess diagram encoding and said that there is probably a good
reason for it not being in the Roadmap.  Has the Unicode Technical
Committee in fact ever discussed the possibility of whether or not to encode
chess diagrams in regular Unicode at all?  Just because something has not
arisen before and has never been discussed by members of the 1 dollars a
year for a vote group does not in any way make that matter questionable in
any way whatsoever.



One script which the Unicode Technical Committee has turned down is the
Phaistos Disk Script.  There is a document, which if I may say so is of high
quality, about the proposal for encoding of the Phaistos Disk Script into
Unicode, into Plane 1.  It was rejected.  Is this really for a purported
reason that the script is only found on one item at present?  I find that a
strange thing.  I feel that that proposal should be looked at again by the
committee.  How about discussing the Phaistos Disk Script in this new FAQ
document?



As for the script regarding the cipher in relation to a movie.  I feel that
the characters are very elegant and I noticed in particular that the
characters are non-contiguous within themselves.  Examining the gif file in
some detail I find that a lot of care has been taken in preparing those
characters and I wonder if anyone knows what are the design influences
underlying the design please.  Is the designer perhaps reading this list and
perhaps would like to comment please?

William Overington

4 July 2002













Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread Michael Everson

At 12:17 -0700 2002-07-03, Kenneth Whistler wrote:

>If a script is listed there in the Roadmap for the BMP or for Plane 1,
>then people can be assured that interested members of the encoding
>committees have *already* made a tentative determination that
>the script is suitable for encoding, although a proposal may not
>actually exist yet, and of course, there are no guarantees until the
>committees actually do the work on fully filled-out formal proposals.

Indeed, we might end up unifying Javanese and Balinese for instance. 
We don't know just now.

>But if a "script", like the MIIB BurgerKing cipher mentioned today,
>or chess diagram notation, is missing from the Roadmap, there is probably
>a *good* reason for it not to be there, and people should think twice
>(and then again) before they start proposing it for encoding in Unicode.

Well... a year ago we hadn't heard of N'Ko. Things turn up all the time.

>The "voice which shook the earth", from Chapter IV, verse 44 of
>LIBER LIBERI vel LAPIDIS LAZULI ADUMBRATIO KABBAL GYPTIORUM,
>one of the Holy Books of Thelema:
>
>http://www.nuit.org/thelema/Library/HolyBooks/LibVII.html

Wow. Or indeed UAOAU
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread Kenneth Whistler

Suzanne,

> Can people from the review committee give me some hard and fast rules for
> when something is thrown out?

As Michael Everson indicated, the answer to this is probably not.

However, perhaps the most important thing for serious script
proposers to do, to see if what they are concerned about might be
acceptable, is to consult the Roadmap:

http://www.unicode.org/roadmaps/

If a script is listed there in the Roadmap for the BMP or for Plane 1,
then people can be assured that interested members of the encoding
committees have *already* made a tentative determination that
the script is suitable for encoding, although a proposal may not
actually exist yet, and of course, there are no guarantees until the
committees actually do the work on fully filled-out formal proposals.

But if a "script", like the MIIB BurgerKing cipher mentioned today,
or chess diagram notation, is missing from the Roadmap, there is probably 
a *good* reason for it not to be there, and people should think twice 
(and then again) before they start proposing it for encoding in Unicode.

--Ken

Another "missing" example:

The "voice which shook the earth", from Chapter IV, verse 44 of
LIBER LIBERI vel LAPIDIS LAZULI ADUMBRATIO KABBALÆ ÆGYPTIORUM,
one of the Holy Books of Thelema:

http://www.nuit.org/thelema/Library/HolyBooks/LibVII.html

Disclaimer: The UTC New Scripts committee does not discriminate
among script applicants on the basis of race, color, gender,
religion, sexual orientation, national or ethnic origin, age,
disability, or veteran status. However, if they are risible,
we reserve the right to laugh. ;-)





Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread John H. Jenkins


On Wednesday, July 3, 2002, at 11:57 AM, Asmus Freytag wrote:

> Klingon (or any of the Latin ciphers/ movie scripts)
>
>

I'd say Klingon *and* one of the Latin ciphers.  Klingon is almost worth a 
FAQ in itself.

==
John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://homepage.mac.com/jenkins/





Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread Michael Everson

At 12:59 -0400 2002-07-03, Suzanne M. Topping wrote:

>Can people from the review committee give me some hard and fast rules for
>when something is thrown out?

I suspect we cannot.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread Asmus Freytag

This looks like a lot of work and it looks like it duplicates as lot of the 
work in the "submitting new proposals" section of instructions on our 
website and in the standard.

We are getting a large number of *informal* suggestions for proposals that 
are more or less clearly inappropriate and spend some amount of time on the 
list dealing with them. However, neither UTC nor ISO/IEC JTC1/SC2/WG2 are 
receiving a large number of inappropriate *formal* proposals. In fact, both 
groups few proposals that don't have active support or involvement of 
people active in either or both bodies - ad those people don't need an FAQ.

I submit that, while an interesting excercise, such a FAQ is a solution in 
search of a problem. Having a FAQ will not do anything to keep enthusiasts 
from storming in with their latest brain child - since it's the nature of 
these informal suggestions that people never read the FAQ (any FAQ) before 
hitting the send button.  (OK, maybe some do).

At the same time, there's the risk that we maintain TWO sets of information 
on the same topic ("what's an acceptable proposal"), with all the 
maintenance issues and issues of which text is the binding one.

I suggest that we either not do a FAQ, or do a very simple one of a few 
(three?) proposals that have definitely failed (or would definitely fail) 
as illustrations, just to establish the idea that there are inappropriate 
proposals, and then firmly point to the *official* document that sets out 
the criteria for creating a *formal* proposal.

My favorite examples are
Klingon (or any of the Latin ciphers/ movie scripts)
Hexadecimal digits or 'Decimal separator'
Any proposal asking for the rearrangement/removal of characters

The decimal separator is a clear example of coding something that is 
already encoded using a different model and coding a character purely by 
function, which Unicode tends to be leery of, esp. if it can't be visually 
distinguished from existing characters.

I would definitely NOT like to see a detailed typology of proposals in a 
FAQ, with detailed script classifications etc.

A./




At 12:59 PM 7/3/02 -0400, Suzanne M. Topping wrote:
>I realized that I should probably turn an off-list discussion back to
>the list, as it's illustrating an area of difficulty. (See the bottom of
>this note for a partial discussion of what writing systems could/would
>be considered.)
>
>In the "appropriate use" FAQ entry, how the heck can we state what is
>and isn't a suitable writing system for inclusion? Fictional scripts in
>some cases would be considered, in other cases would not. Historical
>scripts would in some cases be considered, in other cases not. Can
>people from the review committee give me some hard and fast rules for
>when something is thrown out?
>
>Is there some way of listing or detailing the criteria in a way which
>potential readers could determine where theire script or character
>stands? It could perhaps be presented in a table or matrix form, so that
>people could look through the criteria and say "yep, it's fictional, yep
>people currently use it, no, there are no fonts yet" etc. Or maybe a
>decision tree would be better, where the criteria forks.
>
>I guess the first thing I need to collect is the criteria... Here are
>some starters to get the ball rolling:
>
>--Is this an entire script or additions to an existing script?
>
>--Is the script fictional?
>
>--Is the script in use? (as determined how???)
>
>--Does the character(s) already exist in some other part of the
>standard?
>
>--Is there a compelling reason for including a characther which would
>normally not be considered, due to legacy support issues?
>
>--Is the character a precomposed ligature which can be encoded using a
>sequence of existing character (possibly joined by ZWJ's)?
>
>--Is the character a precomposed "accented character" which can be
>composed using an
>existing character and one or more existing combining diacritics?
>
>--Is the character a clone of an existing character whose sole purpose
>is making a *logical* differentiation from some existing characters
>(e.g., hex digits looking
>identical to existing characters "0..9" and "A...F"; or a symbol for
>"meter"
>looking identical to Latin "m")?
>
>--Is the chracter a clone of an existing character whose sole purpose is
>making a
>*graphical* differentiation from some existing characters (e.g., a
>Serbian
>letter "t", disunified from Russian on the basis that italics looks
>different in the two languages)?
>
>--Is the character really a presentation glyph for a shape that can be
>obtained using regular characters in conjunction with ZWJ or ZWNJ?
>
>Again, additions, suggestions, or other help appreciated,
>
>Suzanne
>
>
>-Original Message-
>
>
>At 15:35 -0400 2002-07-02, Suzanne M. Topping wrote:
>
> >Apologies, I was sloppy with my phrasing; of course a script is
> >"written". What I really meant was that the script is in current use in
> >some sort of written form, as opposed

Re: FW: Inappropriate Proposals FAQ

2002-07-03 Thread Rick McGowan

Suzanne T asked:

> Can people from the review committee give me some hard and fast
> rules for when something is thrown out?

There's only one hard and fast rule that I know: when a majority of UTC  
members vote to NOT encode something.

I think the criteria that UTC representatives use to determine their votes  
would be along the lines of Suzanne's checklist. She might be able to come  
up with a list of questions such that some specific negative or positive  
answers would lead one to the conclusion that it is or isn't worth  
submitting a proposal for some specified thing.

One rule of thumb that people can also use: if an off-the-cuff proposal  
for a thing doesn't fly on the Unicode list, it is unlikely to fly in UTC.

Rick