Re: [hebrew] Re: Collation contractions and reordering, was: Hebrew composition model, with cantillation marks

2003-11-12 Thread Philippe Verdy
From: "Kent Karlsson" <[EMAIL PROTECTED]> > The problem is that to express the alt 2 as an ICU tailoring I need > an "anchor" at level 3, which is ignored at levels 1 and 2. I'm not > sure if I can use a punctuation character (ignored at levels 1-3) as > such an anchor, esp. not since punctuation c

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-07 Thread Doug Ewell
Andrew C. West wrote: > And given that most CJK fonts aim to cover both Chinese and Japanese > characters, how would the square missing ideograph glyph and the > Japanese geta mark be differentiated ? By means of variant selectors ? In the Windows world at least, most fonts that include any CJK

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-07 Thread Andrew C. West
On Thu, 6 Nov 2003 12:51:53 -0500, John Cowan wrote: > > IIRC we talked about this a year or so ago, and kicked around the idea that > the Chinese square could be treated as a glyph variant of U+3013 GETA MARK, > which looks quite different but symbolizes the same thing. I suspect that few Chines

Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Doug Ewell
Rick McGowan wrote: > Andrew, There isn't a CJK list. Oops, sorry, I thought there was. I spent an admittedly brief time looking for a "list of Unicode lists," as Rick mentioned, but couldn't find one. I'm sure it's on the Web site in an obvious place. -Doug Ewell Fullerton, California http

Re: CJK mailing list (was: Hebrew composition model, with cantillation marks)

2003-11-06 Thread Philippe Verdy
From: "Rick McGowan" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, November 06, 2003 6:43 PM Subject: Re: Hebrew composition model, with cantillation marks > Andrew, There isn't a CJK list. > Rick CJK normalization at least does not cause so man

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Dean Snyder
Philippe Verdy wrote at 10:15 PM on Wednesday, November 5, 2003: >If it's not in the written text, it is not implied by the writer. If this were true, based on the fact that writers wrote very few of them, we would be faced with the implication that there were very few vowels indeed in the old He

RE: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Peter Constable
> But we Hebrew "experts" want our proposals to be reviewed in advance by > UTC members and others who understand the broad scope of Unicode... There have been several such people subscribed to the Hebrew list. Rambling verbose discussions are making some of them leave however. Peter Peter Con

[offline] RE: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Peter Constable
age- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On > Behalf Of Doug Ewell > Sent: Wednesday, November 05, 2003 3:54 PM > To: Unicode Mailing List > Subject: Re: [hebrew] Re: Hebrew composition model, with cantillation > marks > > Gads, how I wish there were a Hebre

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-06 Thread John Cowan
Andrew C. West scripsit: > The problem of how to represent missing/obliterated characters in Unicode when > transcribing manuscript/printed texts and inscriptions, etc. has always > perplexed me. IIRC we talked about this a year or so ago, and kicked around the idea that the Chinese square could

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Peter Kirk
I agree with you here, Doug. I am copying this to the Hebrew list in the hope that those on both lists will follow this kind of procedure. Or does anyone have strong objections? On 06/11/2003 08:30, Doug Ewell wrote: ... Peter Kirk responded to Michael a few messages later: Please keep the

Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Rick McGowan
Andrew, There isn't a CJK list. Rick > CJK list ? Now if only there was a list of Unicode lists ...

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Andrew C. West
On Thu, 6 Nov 2003 08:30:24 -0800, "Doug Ewell" wrote: > > I can't help thinking that other specialized lists, such as those > for bidi and CJK, were created to resolve this exact type of problem. CJK list ? Now if only there was a list of Unicode lists ...

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Doug Ewell
Michael Everson wrote: > At 15:53 -0800 2003-11-05, Doug Ewell wrote: >> Gads, how I wish there were a Hebrew-specific list where these >> protracted Hebrew-specific discussions could take place. > > There is. [EMAIL PROTECTED] I know. I was being facetious. Peter Kirk responded to Michael a

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Peter Kirk
On 06/11/2003 05:14, Michael Everson wrote: At 04:55 -0800 2003-11-06, Peter Kirk wrote: We need to work towards some real proposals for improving Hebrew support, not just chat. But who is going to know about these proposals and assess them if they are not on the Hebrew list, and if discussion

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-06 Thread Andrew C. West
On Wed, 5 Nov 2003 12:24:00 +0100, "Philippe Verdy" wrote: > > The obliterated character needed for paleolitic studies, or to encode any > texts in which the character is not recognizable already exists: isn't it > the REPLACEMENT CHARACTER? > The problem of how to represent missing/obliterated

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Michael Everson
At 04:55 -0800 2003-11-06, Peter Kirk wrote: We need to work towards some real proposals for improving Hebrew support, not just chat. But who is going to know about these proposals and assess them if they are not on the Hebrew list, and if discussion of Hebrew is not allowed on the main list? P

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Peter Kirk
On 06/11/2003 02:42, Michael Everson wrote: There is. [EMAIL PROTECTED] I just unsubscribed from it because I just can't track the volume of what's being discussed there. Understandable, but sad. When new people join a discussion like that they often have a lot of questions which need answering

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-06 Thread Michael Everson
At 15:53 -0800 2003-11-05, Doug Ewell wrote: Gads, how I wish there were a Hebrew-specific list where these protracted Hebrew-specific discussions could take place. There is. [EMAIL PROTECTED] I just unsubscribed from it because I just can't track the volume of what's being discussed there. -- Mi

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-05 Thread Doug Ewell
Gads, how I wish there were a Hebrew-specific list where these protracted Hebrew-specific discussions could take place. -Doug Ewell Fullerton, California http://users.adelphia.net/~dewell/

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-05 Thread Philippe Verdy
From: "Jony Rosenne" <[EMAIL PROTECTED]> > Philippe Verdy wrote: > > > In fact, even spelling errors are often made on purpose, and > > wanted by their authors. What seems a spelling error at one > > time often happens to become a normal spelling for these > > words (look at the many abreviated fo

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-05 Thread John Hudson
At 09:55 AM 11/5/2003, John Cowan wrote: > I think this is a typographical decision, so perhaps a glyph issue. Absolutely. > Personally, there is no way I'd let a rounded box with oblique hatches > anywhere near any scholarly work that I was typesetting. :) What glyph would you use for "indeciph

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-05 Thread John Cowan
John Hudson scripsit: > I think this is a typographical decision, so perhaps a glyph issue. Absolutely. > Personally, there is no way I'd let a rounded box with oblique hatches > anywhere near any scholarly work that I was typesetting. :) What glyph would you use for "indecipherable character

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-05 Thread John Hudson
At 03:24 AM 11/5/2003, Philippe Verdy wrote: The obliterated character needed for paleolitic studies, or to encode any texts in which the character is not recognizable already exists: isn't it the REPLACEMENT CHARACTER? Such base character should be rendered with a spacing glyph (for example a cir

Re: elided base character or obliterated character (was: Hebrew composition model, with cantillation marks)

2003-11-05 Thread Philippe Verdy
From: "John Hudson" <[EMAIL PROTECTED]> > At 07:20 PM 11/4/2003, Jony Rosenne wrote: > > >I was thinking we need a zero width non-breaking character. Maybe we mean > >the same. > > I think Peter and I were thinking of two characters: one spacing and one > zero-width, both non-breaking. Both would

Re: Collation contractions and reordering, was: Hebrew composition model, with cantillation marks

2003-11-04 Thread Markus Scherer
Peter Kirk wrote: On 03/11/2003 15:26, Markus Scherer wrote: I suggest you try it out - http://oss.software.ibm.com/cgi-bin/icu/lx/en_US/utf-8/?_=he&EXPLORE_CollationElements= ICU implements the UCA, including discontiguous contractions. Thank you, Markus. Unfortunately the results are barely u

RE: Collation contractions and reordering, was: Hebrew composition model, with cantillation marks

2003-11-04 Thread Kent Karlsson
[this should go back to the Hebrew list only...] > But your mention of ignoring non-blocking combining marks when > processing contractions made me look at the newly released > http://www.unicode.org/reports/tr10/. I noticed there for the first > time, maybe because they are there for the first

Re: Collation contractions and reordering, was: Hebrew composition model, with cantillation marks

2003-11-04 Thread Peter Kirk
On 03/11/2003 15:26, Markus Scherer wrote: I suggest you try it out - http://oss.software.ibm.com/cgi-bin/icu/lx/en_US/utf-8/?_=he&EXPLORE_CollationElements= ICU implements the UCA, including discontiguous contractions. markus Thank you, Markus. Unfortunately the results are barely usable bec

[hebrew] Re: variation selectors for combining characters (was: Hebrew composition model, with cantillation marks)

2003-11-04 Thread Philippe Verdy
From: "Philippe Verdy" <[EMAIL PROTECTED]> > All that can be done is to create a new variation selector for combining > characters. It could be created: > - either within a new generic set of variation selectors for combining > characters (noted CVSn here) to produce sequences like METEG>; >

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-11-04 Thread Philippe Verdy
From: "Peter Kirk" <[EMAIL PROTECTED]> > The only issue is the > medial meteg, which can be solved most simply by defining a new medial > meteg character (or variation selector wiith meteg) which always > combines with an adjacent hataf vowel. According to Chapter 15 of the Unicode Standard:

Re: Collation contractions and reordering, was: Hebrew composition model, with cantillation marks

2003-11-03 Thread Markus Scherer
I suggest you try it out - http://oss.software.ibm.com/cgi-bin/icu/lx/en_US/utf-8/?_=he&EXPLORE_CollationElements= ICU implements the UCA, including discontiguous contractions. markus Peter Kirk wrote: On 03/11/2003 07:01, Kent Karlsson wrote: However, the UCA does ignore differences between or

Collation contractions and reordering, was: Hebrew composition model, with cantillation marks

2003-11-03 Thread Peter Kirk
On 03/11/2003 07:01, Kent Karlsson wrote: ... However, the UCA does ignore differences between order of *"non-blocking"* (**different** non-zero combining classes) combining marks **when processing contractions**. ... Kent, thanks for the hint. For the last few weeks I have been complaining l

RE: Hebrew composition model, with cantillation marks

2003-11-03 Thread Kent Karlsson
[I'm not sure why this Hebrew thread has migrated back to the general Unicode list (from the Hebrew list); but my comments below aren't specific to Hebrew.] Jony Rosenne wrote: ... > > As they will share the same combining class 220, the > > canonical ordering will preserve their relative order

Re: Hebrew composition model, with cantillation marks

2003-11-03 Thread Peter Kirk
On 02/11/2003 21:55, Jony Rosenne wrote: I don't see any basis for saying "now generally considered misguided". Some people don't like them. Some of the reasons given were based on a misunderstanding. Jony Well, for example Ken Whistler wrote in http://www.unicode.org/faq/normalization.html:

RE: Hebrew composition model, with cantillation marks

2003-11-02 Thread Jony Rosenne
r 03, 2003 1:37 AM > To: Jony Rosenne > Cc: 'Philippe Verdy'; Unicode > Subject: Re: Hebrew composition model, with cantillation marks > > > Currently the only such sequences in Hebrew are sequences of > accents and > so of significance for collation only at th

Re: Hebrew composition model, with cantillation marks

2003-11-02 Thread Peter Kirk
On 02/11/2003 12:00, Jony Rosenne wrote: I meant the collation order as it applies to Hebrew, of course. Jony OK, so you mean not the collation algorithm but the default (DUCET) collation data for Hebrew. There are still question marks about the adequacy of this data, and there is a clear n

Re: Hebrew composition model, with cantillation marks

2003-11-02 Thread Peter Kirk
On 02/11/2003 08:37, Jony Rosenne wrote: As they will share the same combining class 220, the canonical ordering will preserve their relative order Although normalization preserves the order of combining marks of the same class, I think no meaning should be attached to it, for two reasons: T

RE: Hebrew composition model, with cantillation marks

2003-11-02 Thread Jony Rosenne
> > As they will share the same combining class 220, the > canonical ordering will preserve their relative order Although normalization preserves the order of combining marks of the same class, I think no meaning should be attached to it, for two reasons: The collation algorithm ignores such di

RE: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-31 Thread Peter Constable
Folks, we should not be cross-posting on these threads. The Hebrew list was created to get these long Hebrew threads off the Unicode list; I think they should stay there. Peter Peter Constable Globalization Infrastructure and Font Technologies Microsoft Windows Division

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-31 Thread Philippe Verdy
After careful analysis of the rendering versus canonical ordering problem of dagesh/rafe/varika after shin/sin dots, and before vowels, I may conclude that the sil.org proposal is completely not needed for rendering, as the existing encoding already complies with Biblical Hebrew with exactly the sa

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-31 Thread Peter Kirk
On 30/10/2003 21:15, Mark E. Shoulson wrote: Peter Kirk wrote: On 28/10/2003 18:49, Philippe Verdy wrote: I just finished an Excel speadsheet that shows the Hebrew composition model, and all the problems caused by the canonical order of Hebrew diacritics. In summary, most problems come from c

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-30 Thread Mark E. Shoulson
Peter Kirk wrote: On 28/10/2003 18:49, Philippe Verdy wrote: I just finished an Excel speadsheet that shows the Hebrew composition model, and all the problems caused by the canonical order of Hebrew diacritics. In summary, most problems come from consonnant modifiers which have a combining clas

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread Philippe Verdy
From: "Peter Kirk" <[EMAIL PROTECTED]> > On 29/10/2003 10:46, John Hudson wrote: > > > While we're about it, we could propose a spacing, non-breaking ELIDED > > CHARACTER for use in ketiv/qere where combining marks need to be > > applied to empty space within a word. > > How would this differ from

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread Peter Kirk
On 29/10/2003 10:46, John Hudson wrote: At 10:26 AM 10/29/2003, Philippe Verdy wrote: The problem I see here is that ZWJ is not intended to create ligatures between diacritics, only between clusters that would otherwise still be a single combining sequence. Normally CGJ would have fitted better

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread Peter Kirk
On 29/10/2003 10:26, Philippe Verdy wrote: ... The problem I see here is that ZWJ is not intended to create ligatures between diacritics, only between clusters that would otherwise still be a single combining sequence. Normally CGJ would have fitted better there, but this conflicts with the inten

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread John Hudson
At 10:26 AM 10/29/2003, Philippe Verdy wrote: The problem I see here is that ZWJ is not intended to create ligatures between diacritics, only between clusters that would otherwise still be a single combining sequence. Normally CGJ would have fitted better there, but this conflicts with the intent

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread John Hudson
At 10:26 AM 10/29/2003, Philippe Verdy wrote: In the sil.org proposal, the medial meteg is missing, but not the right and left meteg, as they are encoded within the same class and their order is preserved when attached to a vowel. It is not missing, per se. It was presumed that the medial meteg wo

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread Philippe Verdy
From: "John Hudson" <[EMAIL PROTECTED]> > At 08:17 AM 10/29/2003, Peter Kirk wrote: > > >Normally meteg is positioned below and to the left of any other low > >centred mark. Less frequently it is positioned to the right of a low > >centred mark. But it is always to the left of a low right mark i.e.

Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread Philippe Verdy
> Normally meteg is positioned below and to the left of any other low > centred mark. Less frequently it is positioned to the right of a low > centred mark. But it is always to the left of a low right mark i.e. > yetiv or dehi. It can also be centred within a hataf vowel. In > http://www.qaya.org/a

Re: Hebrew composition model, with cantillation marks

2003-10-29 Thread Peter Kirk
Thank you, Philippe. I include the full text of your posting plus the attachment for the benefit of those on the Unicode Hebrew list who have missed out on this. Some of the issues here have already been discussed on that list. Also I wonder if you have seen http://scripts.sil.org/cms/sites/nrs

Re: Hebrew composition model, with cantillation marks

2003-10-28 Thread Philippe Verdy
I just finished an Excel speadsheet that shows the Hebrew composition model, and all the problems caused by the canonical order of Hebrew diacritics. In summary, most problems come from consonnant modifiers which have a combining class higher than vowels or vowel modifiers. If vowels had been ass

Hebrew composition model, with cantillation marks

2003-10-28 Thread Philippe Verdy
This is a separate issue (not strictly related to combining classes or order), related to the current content of the description of the Hebrew script in the Unicode reference (chapter 8.1) I see that the description includes the following text: [quote] When points and marks are located below the