Re: Variation Selection (Was Re: Unicode 3.2: BETA files updated)

2002-01-27 Thread Asmus Freytag

At 12:33 AM 1/27/02 -0800, Mark Davis \(jtcsv\) wrote:
I find it fairly pointless to say that a font supports the variation 
selection sequence U+03B8, U+FE00 if it does not provide a visual 
distinction from U+03B8; and such a distinction should be based on the 
entry description. Thus, of the following four fonts, only number 4 
correctly supports the sequence U+03B8, U+FE00. (Of course, any real 
font would have designs for the two glyphs that were a bit more harmonious!)

I couldn't view your examples, but guessing at what they might have been, 
I'll risk an answer.

The problem is that some of the distinctions for which variant selectors 
are used, are *needed only by a minority of users*. Some of the math 
variants (not PHI, or THETA, but the ones in StandardizedVariants) may only 
be needed by *some* math authors.

Any requirement that by supporting VS1, one must restrict the glyph range 
of the unmarked symbol forces either

a) the tail to wag the dog

or

b) many dogs to do without tail

Case a is where fonts assume the most restrictive glyph choice for the 
unmarked symbol, so that they can be used by all users. This is bad since 
the restricted glyph range for the unmarked symbol may be a somewhat 
unnatural one. Given the expense of creating a math font, it may not make 
sense to do one that is not usable by all math authors.

Case b is where fonts assert their choice of unmarked glyph and (because of 
the required contrast) choose not to support the VS1 form, since that's the 
form they want to use as the unmarked default. Given the expense of 
creating a math font, the sub-set of math authors may not have the 
werewithal to source a font for their needs, since most of the market can 
be covered by something that is 'good enough'.

By explicitly defining a way to restrict the glyph range (via coding 
another character, or another VS sequence) it becomes easier to support 
groups with related needs, but different requirements in the face of variants.

A./




Re: Additional Ethiopic characters?

2002-01-27 Thread Michael Everson

At 18:59 -0800 2002-01-26, John Hudson wrote:
What is the status of ISO/IEC JTC1/SC2/WG2 N1846, the Proposal to 
encode Ethiopic Extensions in the BMP...?

I see that (Eth.Ext.) is included in the BMP roadmap, but don't see 
any other information on the Unicode site.

Daniel Yacob was to get me samples of the characters in use, so we 
could update the proposal. That hasn't happened yet.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread Michael Everson

At 17:54 -0500 2002-01-26, [EMAIL PROTECTED] wrote:

One of my favorite parts of Bytext was the section on Emoticons.  Certainly,
one thing that a serious competitor to Unicode must have is a rich set of
emoticons as single characters.  I've always felt the UTC was badly out of
touch with the user community by neglecting to encode TIRED-BORED FACE, QUESY
[sic] FACE, YUKKY FACE, and DROOLING FACE.

MISTER YUCK, actually, should be added to Unicode. Isn't it used on 
some poisonous cleaning products to warn children?

As far as Bytext goes, well, in my opinion, if the author had given 
as much energy to working on the Unicode standard as he did writing 
up that thesis of his, perhaps the end of his tweens might have been 
more fruitful.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: Additional Ethiopic characters?

2002-01-27 Thread Michael Everson

I suppose I should also say that Daniel and I are in touch and 
progress is being made, however slowly.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread James Kass


Michael Everson wrote,


 MISTER YUCK, actually, should be added to Unicode. Isn't it used on
 some poisonous cleaning products to warn children?


If so, perhaps it is a glyph variant of skull and crossbones
at U+2620 (☠).

Best regards,

James Kass.





Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread Michael Everson

At 06:56 -0800 2002-01-27, James Kass wrote:

   MISTER YUCK, actually, should be added to Unicode. Isn't it used on
   some poisonous cleaning products to warn children?

If so, perhaps it is a glyph variant of skull and crossbones
at U+2620 (’òÝ).

I don't think it is. Indeed it was introduced precisely because 
toddlers don't understand the skull and crossbones. They may both 
denote poison, but they are different things.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: Additional Ethiopic characters?

2002-01-27 Thread Daniel Yacob

 Daniel Yacob was to get me samples of the characters in use, so we 
 could update the proposal. That hasn't happened yet.


All good things come to those who wait./Hannibal

..and lots of good things are coming, however slowly ;)




Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread James Kass


Michael Everson wrote,


 (MISTER YUCK)

 If so, perhaps it is a glyph variant of skull and crossbones
 at U+2620 ([Gratuitous UTF-8 Skull and Crossbones]).

 I don't think it is. Indeed it was introduced precisely because 
 toddlers don't understand the skull and crossbones. They may both 
 denote poison, but they are different things.

Neither do I, really.  It was a whimsical statement.  Should've used
a smiley.  

MORE WHIMSEY
Maybe emoticons should be considered as variants of the smiley,
and could be specifically provided for with variant selectors.
/MORE WHIMSEY

Seriously, the section in the bytext file about emoticons was 
most enjoyable.  Do people exchange information using emoticons?
MISTER YUCK:  graphic or symbol?

Best regards,

James Kass.







Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread Patrick Andries



James Kass wrote:

Seriously, the section in the bytext file about emoticons was 
most enjoyable.  Do people exchange information using emoticons?
MISTER YUCK:  graphic or symbol?

Do people use exclamation marks to communicate ;-)

P. Andries






Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread Michael Everson

At 08:41 -0800 2002-01-27, James Kass wrote:

Seriously, the section in the bytext file about emoticons was
most enjoyable.  Do people exchange information using emoticons?

Yes, but these are representable in plain text by Unicode now. ASCII!

MISTER YUCK:  graphic or symbol?

Symbol.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: Variation Selection (Was Re: Unicode 3.2: BETA files updated)

2002-01-27 Thread Mark Davis \(jtcsv\)

Sorry, I guess not all mailers handle embedded graphics in HTML messages. I
posted it so that you could see the graphics, on


http://www.macchiato.com/utc/variation_selection/variation_selection_followu
p.htm

It is *not* exactly the same. I added and rearranged the concrete examples
at the very end. I think those last examples of the font are key to this
issue: depending on which we say conformantly support the variation sequence
will help determine how we handle this issue.


I also posted my previous paper, although my thinking has changed a bit
since then, particularly on the 'tightness' of the description -- the
paragraph containing Suppose a glyph has a slash with an angle of 32° from
vertical.

 http://www.macchiato.com/utc/variation_selection/variation_selection.htm

Mark
—

Πόλλ’ ἠπίστατο ἔργα, κακῶς δ’ ἠπίστατο 
πάντα — Ὁμήρου Μαργίτῃ
[For transliteration, see http://oss.software.ibm.com/cgi-bin/icu/tr]

http://www.macchiato.com

- Original Message -
From: Asmus Freytag [EMAIL PROTECTED]
To: Mark Davis (jtcsv) [EMAIL PROTECTED]
Sent: Sunday, January 27, 2002 01:36
Subject: Re: Variation Selection (Was Re: Unicode 3.2: BETA files updated)


Now this message was clear as mud. Literally. As in totally opaque in the
pure visual sense: All the examples show as nice gray boxes in my mailer -
please make sure whatever you are sending is an attachment, and not in-line.

Thanks,
A./

(PS: They may well arrive in viewable condition when this reply gets back
to you..., but I can't read it)


—

Πόλλ’ ἠπίστατο ἔργα, κακῶς δ’ ἠπίστατο 
πάντα — Ὁμήρου Μαργίτῃ
[For transliteration, see http://oss.software.ibm.com/cgi-bin/icu/tr]

http://www.macchiato.com

- Original Message -
From: Asmus Freytag [EMAIL PROTECTED]
To: Mark Davis (jtcsv) [EMAIL PROTECTED]
Sent: Sunday, January 27, 2002 01:36
Subject: Re: Variation Selection (Was Re: Unicode 3.2: BETA files updated)


Now this message was clear as mud. Literally. As in totally opaque in the
pure visual sense: All the examples show as nice gray boxes in my mailer -
please make sure whatever you are sending is an attachment, and not in-line.

Thanks,
A./

(PS: They may well arrive in viewable condition when this reply gets back
to you..., but I can't read it)

At 12:33 AM 1/27/02 -0800, you wrote:


  The other possibility is to say that to be strictly Unicode-conformant,
  fonts should always use the reference glyph for unmarked characters
  (ignoring differences only of style). I think this is actually a better
 
  Boy, great minds to think alike. Mark Davis just proposed that in
  a paper to the UTC this week.

I would like to thank you for the compliment, but I must not have been
clear in my paper, since that is not really what I was proposing. Let me
try again, with a concrete case.

Since I don't have a variety of math fonts, I will use the example of
U+03B8 greek small letter theta. This character can be represented by the
following glyphs (collected on my system) and many more. Any of these are
acceptable, conformant representations of the theta, depending on the
design of the font.

1f908bf2.jpg

Now, there is already a presentation form for the open form of the
theta, at U+03D1 greek theta Symbol. A presentation form of a character
represents the same abstract character, but is restricted in format to a
subset of the possible glyphs that could represent the character, such as
the following:

1f908db5.jpg

But suppose there were no such character. In that case, we might add an
encoded variant, expressed as an entry in the StandardizedVariants.html
file in the UCD, looking something like:
Ref Glyph Character Sequence Alt Glyph Description of variant appearance
1f908e05.jpg 03B8, FE00 1f908e4b.jpg Open Theta, unconnected on the left
side

Now there are four key facts that we need to keep in mind:
* This does not, at all, prevent the character alone U+03B8 from
 having the same appearance as what is titled as the Alt Glyph. It is
 still a perfectly acceptable representation of that character in normal
text.
* What is titled Alt Glyph in this entry is also merely a
 representative, one of many possible glyphs that can represent the
 variant. Thus both of the glyphs are representative: they do not, and
 cannot, encompass all of the possible glyphs that can represent either
 the character alone, or the variant. (It would be a good idea for us to
 change the title of this column, since it may mislead some into thinking
 that that precise glyph must be used: it should be Variant Ref Glyph.)
* A key feature of the entry is the description; it provides the
 limitation on the set of glyphs that can be used to represent the
 sequence, if the sequence is supported by a font.
* If a font does not support the variation sequence given by this
 entry, that it is also perfectly acceptable to ignore the U+FE00, and
 thus render the sequence U+03B8, U+FE00 with precisely the same glyph
 as the single character U+03B8.
So the open 

Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread Curtis Clark

At 06:56 AM 1/27/02, James Kass wrote:
If so, perhaps it is a glyph variant of skull and crossbones
at U+2620 (☠).

cheek contents=tongue amount=50%

I would argue against unification: the skull and crossed bones has 
additional meanings beyond poison. Although the vision of Disney's 
Pirates of the Caribbean flying the Yucky Roger is enticing, I can 
imagine the need for plain text use of U+2620 where the alternate glyph 
simply wouldn't do.

/cheek


-- 
Curtis Clark  http://www.csupomona.edu/~jcclark/
Mockingbird Font Works  http://www.mockfont.com/






Re: Variation Selection

2002-01-27 Thread Michael Everson

At 10:00 -0800 2002-01-27, Mark Davis \(jtcsv\) wrote:
Sorry, I guess not all mailers handle embedded graphics in HTML messages.

In the first place they do not all handle the, though I saw it.

In the second place, Mark, you must remember that not all of us live 
in the US where a local call is free and many of you have broadband. 
If you doubt this you should look at my monthly phone bill, of which 
an enormous percentage is internet connection time. Metered by the 
minute. Check out the IrelandOffline link on my website if you want 
to know more.

Unicode list policy is to NOT send attachments to the lists. Please 
post to your server and send the URLs.

Thank you.
-- 
Michael Everson *** Everson Typography *** http://www.evertype.com




Re: Variation Selection (Was Re: Unicode 3.2: BETA files updated)

2002-01-27 Thread Mark Davis \(jtcsv\)

It sounds like what you are saying, in concrete terms, is that Font #6
at the bottom of:

http://www.macchiato.com/utc/variation_selection/variation_selection_f
ollowup.htm

is conformant. If that is so, then we would have to have an additional
VS to select the closed form of the glyph. In that case, one could
only depend on a visual distinction based upon the description if the
font supported both of the VS sequences. I can see your point.

Mark
—

Πόλλ’ ἠπίστατο ἔργα, κακῶς δ’ ἠπίστατο 
πάντα — Ὁμήρου Μαργίτῃ
[For transliteration, see http://oss.software.ibm.com/cgi-bin/icu/tr]

http://www.macchiato.com

- Original Message -
From: Asmus Freytag [EMAIL PROTECTED]
To: Mark Davis (jtcsv) [EMAIL PROTECTED]; David Hopwood
[EMAIL PROTECTED]; [EMAIL PROTECTED]; Unicore
[EMAIL PROTECTED]; [EMAIL PROTECTED]
Sent: Sunday, January 27, 2002 01:49
Subject: Re: Variation Selection (Was Re: Unicode 3.2: BETA files
updated)


 At 12:33 AM 1/27/02 -0800, Mark Davis \(jtcsv\) wrote:
 I find it fairly pointless to say that a font supports the
variation
 selection sequence U+03B8, U+FE00 if it does not provide a
visual
 distinction from U+03B8; and such a distinction should be based
on the
 entry description. Thus, of the following four fonts, only number
4
 correctly supports the sequence U+03B8, U+FE00. (Of course, any
real
 font would have designs for the two glyphs that were a bit more
harmonious!)

 I couldn't view your examples, but guessing at what they might have
been,
 I'll risk an answer.

 The problem is that some of the distinctions for which variant
selectors
 are used, are *needed only by a minority of users*. Some of the math
 variants (not PHI, or THETA, but the ones in StandardizedVariants)
may only
 be needed by *some* math authors.

 Any requirement that by supporting VS1, one must restrict the glyph
range
 of the unmarked symbol forces either

 a) the tail to wag the dog

 or

 b) many dogs to do without tail

 Case a is where fonts assume the most restrictive glyph choice for
the
 unmarked symbol, so that they can be used by all users. This is bad
since
 the restricted glyph range for the unmarked symbol may be a somewhat
 unnatural one. Given the expense of creating a math font, it may not
make
 sense to do one that is not usable by all math authors.

 Case b is where fonts assert their choice of unmarked glyph and
(because of
 the required contrast) choose not to support the VS1 form, since
that's the
 form they want to use as the unmarked default. Given the expense of
 creating a math font, the sub-set of math authors may not have the
 werewithal to source a font for their needs, since most of the
market can
 be covered by something that is 'good enough'.

 By explicitly defining a way to restrict the glyph range (via coding
 another character, or another VS sequence) it becomes easier to
support
 groups with related needs, but different requirements in the face of
variants.

 A./







Re: POSITIVELY MUST READ! Bytext is here!

2002-01-27 Thread James Kass


Michael Everson wrote,

...  graphic or symbol?
 
 Symbol.
 -- 

Because it signifies something specific while a graphic would be
subject to random interpretations.

Curtis Clark wrote,
 I would argue against unification: the skull and crossed bones has 
 additional meanings beyond poison. Although the vision of Disney's 
 Pirates of the Caribbean flying the Yucky Roger is enticing, 
 ...

Quite enticing.  And the notion of making emoticons variants of the 
smiley because facial expressions are variations of the face is out, too, 
as the various glyphs each signify something completely different 
from smiley.

Patrick Andries wrote,

 Do people use exclamation marks to communicate ;-)

Yes!

Best regards,

James Kass.






Re: Variation Selection

2002-01-27 Thread $B$m!;!;!;!;(B $B$m!;!;!;(B

Unicode list policy is to NOT send attachments to the lists. Please
post to your server and send the URLs.

First, have we all servers?
Second, if it is a small attachment, the number of bytes transmitted to GO 
there may well be greater than the number to send the bloody thing. How 
about an arbitrary attachment limit of 1535 ($B8lO$9g$o$;!'$R$a$_$3(B) bytes?


$B"*!!$8$e$&$$$C$A$c$s!!"+(B
$B!!$@$s$;$$$i$7$5$`$h$&(B

_
$B4JC1$K=PMh$k3Z$7$$%"%k%P%`!#M'C#$KEO$9$N$b%a!<%k$G4JC1$K!*(B
http://photos.msn.co.jp/


Re: Variation Selection

2002-01-27 Thread Barry Caplan

At 10:29 PM 1/27/2002 -0500, you wrote:
In a message dated 2002-01-27
18:51:35 Pacific Standard Time, 
[EMAIL PROTECTED] writes:
 First, have we all servers?
No. Assuming we all do is no better than assuming we all have
broadband or 
T1 connections.
Yes, we do all have servers:
Yahoo is your friend - you can get an unlimited number of 6mb (I think,
maybe more) accounts for free. Store images in
briefcase.yahoo.com/yourid. Store any files in
briefcase.yahoo.com/yourid.
Also, this list is mirrored on a yahoo group. the group has storage space
too. I don't know ho the moderator of that group is, but maybe he/she can
assist.
In any of these cases, all that needs to be passed to the list is the
url.
Frankly, the issue of unexpected attachments in email is not the size for
me, but it does cause me security concerns. I would much rather decide
whether or not to download a file then to wake up one morning with a
virus or worse.


Best Regards,
Barry Caplan
[EMAIL PROTECTED]
www.i18n.com
- coming soon, preview available now
News | Tools | Process for Global Software
Team I18N



attachment size (was: Re: Variation Selection)

2002-01-27 Thread $B$m!;!;!;!;(B $B$m!;!;!;(B



From: [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
CC: [EMAIL PROTECTED]
Subject: Re: Variation Selection
Date: Sun, 27 Jan 2002 22:29:02 EST

In a message dated 2002-01-27 18:51:35 Pacific Standard Time,
[EMAIL PROTECTED] writes:

  First, have we all servers?

No.  Assuming we all do is no better than assuming we all have broadband 
or
T1 connections.

  Second, if it is a small attachment, the number of bytes transmitted to 
GO
  there may well be greater than the number to send the bloody thing. How
  about an arbitrary attachment limit of 1535 
($B%JDE%'%RW$%D:d(B€$B!#c`TE%ec`TD%1%D!'c`(B
$BTE@d(B€$BTE^d(B€$BTEZd(B€$BTD%"(B)
bytes?

I just sent a 500-byte attachment to the Unicode list without any qualms.
Almost all messages are longer than 500 bytes, especially when you take
headers into account.  However, there is certainly such a thing as "too
long," and I have no idea where to draw the line and say "this attachment 
is
short enough, that one is too long."

-Doug Ewell
  Fullerton, California

I just said 1535 because I like the word "himemiko". Also, it is just under 
1 1/2 KB, and that attachment size is good for small black-and-white 
illustrations (for character placement, etc.) Some Swedish (?) person came 
up with an excellent use for some illustrations about a month ago, when 
discussing a Swedish ampersand and a few other characters on this list.

Also, Sarasvati's birth year should be an agreeable number for her to use 
as a file size limit. And since I never give a lady's age, remember, it is 
not necessarily a Gregorian year!




$B"*!!$8$e$&$$$C$A$c$s!!"+(B
$B!!$@$s$;$$$i$7$5$`$h$&(B


_
$B%a!<%k%5!<%S%9$O!"@$3&(B No.1 $B$N(B MSN Hotmail 
$B$G!*(Bhttp://www.hotmail.com/JA/


Re: Variation Selection

2002-01-27 Thread DougEwell2

In a message dated 2002-01-27 18:51:35 Pacific Standard Time, 
[EMAIL PROTECTED] writes:

 First, have we all servers?

No.  Assuming we all do is no better than assuming we all have broadband or 
T1 connections.

 Second, if it is a small attachment, the number of bytes transmitted to GO 
 there may well be greater than the number to send the bloody thing. How 
 about an arbitrary attachment limit of 1535 
(Œê˜C‡‚킹F‚Ђ߂݂±) 
bytes?

I just sent a 500-byte attachment to the Unicode list without any qualms.  
Almost all messages are longer than 500 bytes, especially when you take 
headers into account.  However, there is certainly such a thing as too 
long, and I have no idea where to draw the line and say this attachment is 
short enough, that one is too long.

-Doug Ewell
 Fullerton, California




Re: attachment size

2002-01-27 Thread Sarasvati

Dear Little Ones:

Michael E suggested:

 Unicode list policy is to NOT send attachments to the lists.

to which Doug E. replied:

 I just sent a 500-byte attachment to the Unicode list
 without any qualms.

The policy is rather to let the piranhas chew up people who send
huge attachments. Little attachment are not troublesome.

This list has a 52k limit on message body size, regardless
of content, and there is not a specific policy that I care
to enforce about little attachments or mail formats.

Use good judgement, please.

Little [EMAIL PROTECTED] at hotmail.com suggested using
or Sarasvati's birth year. However, in terms of C.E., that is
an inappropriately large negative number.

By the way, the other day on this list Michael K called
out my name, but I thought you all were reaching the correct
conclusion without interference, and I see the controversy
has blown over.

Cheery regards from your,
-- Sarasvati