Re: Biblical Hebrew

2003-06-26 Thread Kenneth Whistler
John Hudson wrote: > At 03:52 PM 6/26/2003, Rick McGowan wrote: > > >I'll weigh in to agree with Ken here. The solution of cloning a whole set > >of these things just to fix combining behavior is, to understate, not quite > >nice. > > No, but would be far from the not nicest thing in Unicode, an

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Kenneth Whistler
John, > At 03:36 PM 6/26/2003, Kenneth Whistler wrote: > > >Why is making use of the existing behavior of existing characters > >a "groanable kludge", if it has the desired effect and makes > >the required distinctions in text? If there is not some > >rendering system or font lookup showstopper h

Re: Biblical Hebrew

2003-06-26 Thread Mark Davis
1. I agree with Ken about the current lack of precedent for Cfs before combining marks. Interestingly, that we do have a proposal to do just that, in http://www.unicode.org/review/pr-9.pdf However, note that the whole purpose of putting the Cf after the Ra is to separate it from the halant, so th

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Kenneth Whistler
Michael wrote: > At 15:36 -0700 2003-06-26, Kenneth Whistler wrote: > > >I now like better the suggestions of RLM or WJ for this. > > ZZZT. Thank you for playing. > > RLM is for forcing the right behaviour for stops and parentheses and > question marks and so on. Introducing it between two

Re: Biblical Hebrew

2003-06-26 Thread Kenneth Whistler
Rick wrote: > > I now like better the suggestions of RLM or WJ for this. > > I'll have to disagree with Ken. I'm not so sure about either of these. I > don't think anyone has, in the past, considered what conforming or > non-conforming behavior would be for a RLM or WJ between two combining

Re: Biblical Hebrew

2003-06-26 Thread John Hudson
At 03:52 PM 6/26/2003, Rick McGowan wrote: I'll weigh in to agree with Ken here. The solution of cloning a whole set of these things just to fix combining behavior is, to understate, not quite nice. No, but would be far from the not nicest thing in Unicode, and there's a really good reason for it

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread John Hudson
At 03:36 PM 6/26/2003, Kenneth Whistler wrote: Why is making use of the existing behavior of existing characters a "groanable kludge", if it has the desired effect and makes the required distinctions in text? If there is not some rendering system or font lookup showstopper here, I'm inclined to th

Re: Biblical Hebrew

2003-06-26 Thread Rick McGowan
Ken wrote... > I now like better the suggestions of RLM or WJ for this. I'll have to disagree with Ken. I'm not so sure about either of these. I don't think anyone has, in the past, considered what conforming or non-conforming behavior would be for a RLM or WJ between two combining marks. T

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Michael Everson
At 15:36 -0700 2003-06-26, Kenneth Whistler wrote: I now like better the suggestions of RLM or WJ for this. ZZZT. Thank you for playing. RLM is for forcing the right behaviour for stops and parentheses and question marks and so on. Introducing it between two combining characters in Hebrew t

Re: Yerushala(y)im - or Biblical Hebrew (was Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread John Hudson
At 03:04 PM 6/26/2003, Kenneth Whistler wrote: > How about RLM? This already belongs, naturally, in the context of the Hebrew text handling, which is going to have to handle bidi controls. Ouch. RLM is not expected to fall between combining marks. Not only does this not render correctly, Uniscri

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread John Hudson
At 02:45 PM 6/26/2003, Mark Davis wrote: Another consequence is that it separates the sequence into two combining sequences, not one. Don't know if this is a serious problem, especially since we are concerned with a limited domain with non-modern usage, but I wanted to mention it. It is a serious

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Kenneth Whistler
Peter responded: > Ken Whistler wrote on 06/25/2003 06:57:56 PM: > > > People could consider, for example, representation > > of the required sequence: > > > > > > > > as: > > > > > > So, we want to introduce yet *another* distinct semantic for ZWJ? Actually, no, I don't. That was just

Re: Yerushala(y)im - or Biblical Hebrew (was Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Kenneth Whistler
Jony took the words right out of my mouth: > How about RLM? > > Jony This already belongs, naturally, in the context of the Hebrew text handling, which is going to have to handle bidi controls. Another possibility to consider is U+2060 WORD JOINER, the version of the zero width non-breaking spa

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Mark Davis
Another consequence is that it separates the sequence into two combining sequences, not one. Don't know if this is a serious problem, especially since we are concerned with a limited domain with non-modern usage, but I wanted to mention it. Mark __ http://www.macchi

RE: Major Defect in Combining Classes of Tibetan Vowels (Hebrew)

2003-06-26 Thread Jony Rosenne
That may be what you see. Myself, every time I look at it, I see an orphaned Hiriq without a consonant. It is normally placed in between the Lamed and the Mem, to make certain the point isn't missed (a pun). Jony > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED]

Yerushala(y)im - or Biblical Hebrew (was Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Jony Rosenne
How about RLM? Jony > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of John Hudson > Sent: Thursday, June 26, 2003 6:36 PM > To: Jony Rosenne > Cc: [EMAIL PROTECTED] > Subject: SPAM: RE: Major Defect in Combining Classes of > Tibetan Vowels (Hebrew)

Re: Revised N2586R

2003-06-26 Thread Michael Everson
At 13:23 -0700 2003-06-26, Kenneth Whistler wrote: Not only is the name likely to change (based on all the issues already discussed), but it is conceivable that WG2 could decide to approve it at some other code position instead. Indeed I will probably propose to move the character on general pr

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread Kenneth Whistler
Peter replied to Karljürgen: > Karljürgen Feuerherm wrote on 06/25/2003 08:31:41 PM: > > > I was going to suggest something very similar, a ZW-pseudo-consonant of > some > > kind, which would force each vowel to be associated with one consonant. > > An invisible *consonant* doesn't make sense b

Re: Nightmares

2003-06-26 Thread Michael Everson
At 14:32 -0400 2003-06-26, John Cowan wrote: If you are going to discriminate (invidiously) using a computerized database, using H for Handicapped (or G for Gimp) will do just as well. Are you going to complain about the various symbols of religion already encoded on the same grounds? I am prepari

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread John Hudson
At 10:09 AM 6/26/2003, [EMAIL PROTECTED] wrote: > The Meteg is a completely different issue. There is a small number of places > were the Meteg is placed differently. Since it does not behave the same as > the regular Meteg, and is thus visually distinguishable, it should be > possible to add a

Re: Revised N2586R

2003-06-26 Thread Kenneth Whistler
Doug, Peter, and Michael already provided good responses to this suggestion by William O, but here is a little further clarification. > Well, certainly authority would be needed, yet I am suggesting that where a > few characters added into an established block are accepted, which is what > is clai

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Philippe Verdy
On Thursday, June 26, 2003 8:16 PM, Elisha Berns <[EMAIL PROTECTED]> wrote: > It would appear from your answer that even after implementing the > algorithm to search the Unicode block coverage of a font, the actual > comparison "data", that is which blocks to compare and how many code > points, is

RE: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Kenneth Whistler
Elisha Berns asked: > It would appear from your answer that even after implementing the > algorithm to search the Unicode block coverage of a font, the actual > comparison "data", that is which blocks to compare and how many code > points, is totally undefined. Is there any kind of standard for >

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread John Cowan
Elisha Berns scripsit: > It's odd to think that the old way of using Charset identifiers in fonts > worked a lot more cleanly for finding fonts matching a language/language > group. I would think this kind of core issue would be addressed more > cleanly by the font standard. Actually it worked b

Re: WHEELCHAIR (was Revised N2586R)

2003-06-26 Thread Karljürgen Feuerherm
WHEELCHAIR SYMBOL at least has the virtue of being descriptive of the symbol rather than of the use and thus potentially more neutral all the way around. K - Original Message - From: "Michael Everson" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, June 26, 2003 2:13 PM Subject

Re: Nightmares

2003-06-26 Thread John Cowan
William Overington scripsit: > This issue has arisen because of my concern that a particular symbol has > been labelled as HANDICAPPED SIGN. I hope that the name will be changed to > WHEELCHAIR SYMBOL. If you are going to discriminate (invidiously) using a computerized database, using H for Hand

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Philippe Verdy
On Thursday, June 26, 2003 4:13 PM, Andrew C. West <[EMAIL PROTECTED]> wrote: > On Thu, 26 Jun 2003 14:26:13 +0200, "Philippe Verdy" wrote: > > > Isn't there a work-around with the following function (quote from > > Microsoft MSDN): > > (with the caveat that you first need to allocate and fill a

RE: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Elisha Berns
Andrew West wrote: > By looping through the "ranges" array it is possible to determine exactly > which > characters in which Unicode blocks a given font covers (as long as your > sofware > has an array of Unicode blocks and their codepoint ranges). > As long as your software has an up-to-date li

Re: Revised N2586R

2003-06-26 Thread Michael Everson
At 12:09 -0500 2003-06-26, [EMAIL PROTECTED] wrote: The only meaning that the Standard implies is that the character encoded at codepoint x represents they symbol of a wheelchair. It does not imply *anything* about how its usage in juxtaposition with the name of a person should be interpreted. Ind

Re: Revised N2586R

2003-06-26 Thread Michael Everson
At 13:03 +0100 2003-06-26, William Overington wrote: Well, certainly authority would be needed, yet I am suggesting that where a few characters added into an established block are accepted, which is what is claimed for these characters, there should be a faster route than having to wait for bulk r

Major Defects in Subject Lines!

2003-06-26 Thread Rick McGowan
Wow... How on earth did the subject line "Major Defect in Combining Classes of Tibetan Vowels" turn into a discussion of Biblical Hebrew? At least, people, if you're going to transmogrify the discussion, please use a subject line such as "Biblical Hebrew" which someone already was wise enou

Re: Revised N2586R

2003-06-26 Thread Peter_Constable
William Overington wrote on 06/26/2003 07:03:12 AM: > yet I am suggesting that where a > few characters added into an established block are accepted, which is what > is claimed for these characters, there should be a faster route than having > to wait for bulk release in Unicode 4.1. Once both

Re: Revised N2586R

2003-06-26 Thread Peter_Constable
William Overington wrote on 06/26/2003 06:24:44 AM: > > the name is simply a unique identifier within the std. > > Well, the Standard is the authority for what is the meaning of the symbol > when found in a file of plain text. So if the symbol is in a plain text > file before or after the

RE: Major Defect in Combining Classes of Tibetan Vowels (Hebrew)

2003-06-26 Thread Peter_Constable
Jony Rosenne wrote on 06/26/2003 06:26:02 AM: > It may look, silly, but it is correct. What you see are letters according to > the writing tradition, which does not include a Yod, and vowels according to > the reading tradition which does. I understand that. My point was, you were talking about

RE: Major Defect in Combining Classes of Tibetan Vowels (Hebrew)

2003-06-26 Thread John Hudson
At 04:26 AM 6/26/2003, Jony Rosenne wrote: I don't think we need any new characters, ZERO WIDTH SPACE would do and it requires no new semantics. ZERO WIDTH SPACE would screw up search and sort algorithms, I think, because it is not a control character per se and may not be ignored as desired. I'

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of Tibetan Vowels)

2003-06-26 Thread John Hudson
At 12:43 AM 6/26/2003, [EMAIL PROTECTED] wrote: > The problem of combinations of vowels with meteg could be > amenable to a similar approach. OR, one could propose just > one additional meteq/silluq character, to make it possible > to distinguish (in plain text) instances of left-side and > right-

Re: Revised N2586R

2003-06-26 Thread Doug Ewell
William Overington wrote: > Well, certainly authority would be needed, yet I am suggesting that > where a few characters added into an established block are accepted, > which is what is claimed for these characters, there should be a > faster route than having to wait for bulk release in Unicode

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Andrew C. West
On Thu, 26 Jun 2003 14:26:13 +0200, "Philippe Verdy" wrote: > Isn't there a work-around with the following function (quote from Microsoft > MSDN): > (with the caveat that you first need to allocate and fill a Unicode string for > the > codepoints you want to test, and this can be lengthy if one wa

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Philippe Verdy
On Thursday, June 26, 2003 2:26 PM, Philippe Verdy <[EMAIL PROTECTED]> wrote: I forgot also the probably better function from the Uniscribe library, which processes strings through a language-dependant shaping algorithm, and can determine appropriate glyph substitution, or use custom composite f

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Philippe Verdy
On Thursday, June 26, 2003 11:50 AM, Andrew C. West <[EMAIL PROTECTED]> wrote: > On Wed, 25 Jun 2003 21:58:28 -0700, "Elisha Berns" wrote: > > > Some weeks back there were a number of postings about software for > > viewing Unicode Ranges in TrueType fonts and I had a few questions > > about that

Re: Nightmares

2003-06-26 Thread William Overington
Tom Gewecke wrote as follows. > My personal idea of an Orwellian nightmare would to have a committee of "vigilant freedom protectors" evaluating the "political and social implications of encoding symbols" and passing judgement on whether particular characters should be encoded and what their names

Re: Revised N2586R

2003-06-26 Thread William Overington
Peter Constable wrote as follows. > the name is simply a unique identifier within the std. Well, the Standard is the authority for what is the meaning of the symbol when found in a file of plain text. So if the symbol is in a plain text file before or after the name of a person then the Sta

Re: Revised N2586R

2003-06-26 Thread William Overington
Michael Everson wrote as follows. >At 08:44 -0700 2003-06-25, Doug Ewell wrote: > >>If it's true that either the UTC or WG2 has formally approved the character, for a future version of Unicode or a future amendment to 10646, then I don't see any reason why font makers can't PRODUCE a font with a g

RE: Major Defect in Combining Classes of Tibetan Vowels (Hebrew)

2003-06-26 Thread Jony Rosenne
It may look, silly, but it is correct. What you see are letters according to the writing tradition, which does not include a Yod, and vowels according to the reading tradition which does. There are in the Bible other, more extreme cases. I don't think we need any new characters, ZERO WIDTH SPACE

Re: Question about Unicode Ranges in TrueType fonts

2003-06-26 Thread Andrew C. West
On Wed, 25 Jun 2003 21:58:28 -0700, "Elisha Berns" wrote: > Some weeks back there were a number of postings about software for > viewing Unicode Ranges in TrueType fonts and I had a few questions about > that. Most viewers listed seemed to only check the Unicode Range bits of > the fonts which can

IUC23 Unicode conference exhibitors' panel report

2003-06-26 Thread Tex Texin
Hi, For those of you that couldn't attend and were interested in the exhibitor's panel at the last Unicode conference, a brief summary is now online at: http://www.unicode.org/iuc/iuc23/showcase-report.html If you have any comments or feedback on the page, I would be glad to receive it off-list.

Re: Uquivalence of some Japanes charcaters in Unicode

2003-06-26 Thread Tex Texin
Sourav, Hi, your question is ambiguous to me. You seem to be referring to the fullwidth space and other "wide" or "fullwidth" characters. For the fullwidth space look at u+3000 ideographic space. Unicode has other fullwidth characters encoded. Look at the code charts... hth tex souravm wrote: >

24th Unicode Conference - Atlanta, GA - September 3-5, 2003

2003-06-26 Thread Tex Texin
Twenty-fourth Internationalization and Unicode Conference (IUC24) Unicode, Internationalization, the Web: Powering Global Business http://www.unicode.org/iuc/iuc24

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of TibetanVowels)

2003-06-26 Thread Peter_Constable
Karljürgen Feuerherm wrote on 06/25/2003 08:31:41 PM: > I was going to suggest something very similar, a ZW-pseudo-consonant of some > kind, which would force each vowel to be associated with one consonant. An invisible *consonant* doesn't make sense because the problem involves more than just

Re: Major Defect in Combining Classes of Tibetan Vowels

2003-06-26 Thread Peter_Constable
John Hudson wrote on 06/25/2003 06:47:44 PM: > >This is not. The Unicode Standard makes no assumptions or claims > >about what the phonological or meaning equivalence of > >or is for Biblical Hebrew. > > But it does make assumptions about the canonical equivalence of the mark > orders and ,

Re: Biblical Hebrew (Was: Major Defect in Combining Classes of TibetanVowels)

2003-06-26 Thread Peter_Constable
Ken Whistler wrote on 06/25/2003 06:57:56 PM: > People could consider, for example, representation > of the required sequence: > > > > as: > > So, we want to introduce yet *another* distinct semantic for ZWJ? We've got one for Indic, another for Arabic, another for ligatures (similar to

Re: Major Defect in Combining Classes of Tibetan Vowels (Hebrew)

2003-06-26 Thread Peter_Constable
Jony Rosenne wrote on 06/26/2003 12:16:22 AM: > When, in the Bible, one sees two vowels on a given consonant, it isn't so. That's silly. When one sees two vowels on a given consonant in the Bible, it *is* so: the two vowels are written there. It may not correspond to actual phonology, ie what

Re: Major Defect in Combining Classes of Tibetan Vowels

2003-06-26 Thread Peter_Constable
Michael Everson wrote on 06/25/2003 04:36:20 PM: [ re Biblical Hebrew ] > Write it up with glyphs and minimal pairs and people will see the > problem, if any. Or propose some solution. (That isn't "add duplicate > characters".) The only solution that UTC is willing to consider I have already s

Re: Major Defect in Combining Classes of Tibetan Vowels

2003-06-26 Thread Peter_Constable
John Cowan wrote on 06/25/2003 03:15:21 PM: > I don't understand how the current implementation "breaks BH text". > At worst, normalization may put various combining marks in a non-traditional > order, but all alternative orders are canonically equivalent anyway, and > no (ordinary) Unicode proce

Re: Major Defect in Combining Classes of Tibetan Vowels

2003-06-26 Thread Peter_Constable
Ken Whistler wrote on 06/25/2003 05:29:59 PM: > > The point is that hiriq before patah is *not* > > canonically equivalent to patah before hiriq, > > This is true. > > > except in the erroneous > > assumption of the Unicode Standard: the order of vowels makes words sound > > different and m