Re: pronounciation character

2003-10-30 Thread Doug Ewell
Bhavin Sanghani wrote: > I m giving entity in my xml file with these decimal values (e.g. > ͞) and i convert it to html by applying css and xsl files. > Now if proper unicode font is selected then it shows proper chars > otherwise it shows question mark. > > So, following are some queries: >

Re: [hebrew] Re: Hebrew composition model, with cantillation marks

2003-10-30 Thread Mark E. Shoulson
Peter Kirk wrote: On 28/10/2003 18:49, Philippe Verdy wrote: I just finished an Excel speadsheet that shows the Hebrew composition model, and all the problems caused by the canonical order of Hebrew diacritics. In summary, most problems come from consonnant modifiers which have a combining clas

Re: [hebrew] Re: [OT]--history...Re: Re: meteg + hataf and text processing distinctions

2003-10-30 Thread John Cowan
John Hudson scripsit: > I can't comment on the precise history of the canonical combining classes > for Hebrew -- you'd need someone who was involved at the time --, but the > canonical combining classes are particular to Unicode, although other and > earlier character encoding standards may ha

pronounciation character

2003-10-30 Thread Bhavin Sanghani
Hi folks, I m facing one problem to show pronounciation diacritic chars in browser like double 'o' (decimal value 862)with overbar, reverse 'e' (decimal value 601), 'th' with underline (decimal values 863) etc...etc.. I m giving entity in my xml file with these decimal values (e.g. ͞) and

Re: UTF-9

2003-10-30 Thread John Cowan
Philippe Verdy scripsit: > Are there still now platforms where storage bytes are not octets but nonets? As others have written, the PDP-10/PDP-20 hardware is long obsolete. However, there are still emulators running on modern 32-bit and 64-bit hardware. That was the point of the remark about "the

RE: UTF-9

2003-10-30 Thread Addison Phillips [wM]
Pub date = 1 April 2003. I think that's the salient part. Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N) Working Group Chair, W3C-I18N-WG, Web Services Task Force http://w

Re: UTF-9

2003-10-30 Thread Kenneth Whistler
John Cowan wrote: > > http://panda.com/tops-20/utf9.txt > > > > Res ipsa loquitur. But apparently res ipsa non loquitur, because Phillipe continued: > > Are there still now platforms where storage bytes are not octets but nonets? > i.e. 9-bit based platforms? If so this proposal makes sense, bu

Re: UTF-9

2003-10-30 Thread Philippe Verdy
From: "Stefan Persson" <[EMAIL PROTECTED]> > Philippe Verdy wrote: > > Are there still now platforms where storage bytes are not octets but nonets? > > i.e. 9-bit based platforms? If so this proposal makes sense, but as a local > > optimization for these platforms. Problems will code if you want t

Re: UTF-9

2003-10-30 Thread Stefan Persson
Philippe Verdy wrote: Are there still now platforms where storage bytes are not octets but nonets? i.e. 9-bit based platforms? If so this proposal makes sense, but as a local optimization for these platforms. Problems will code if you want to interchange this data with the large majority of hosts t

Re: UTF-9

2003-10-30 Thread Philippe Verdy
From: "John Cowan" <[EMAIL PROTECTED]> > http://panda.com/tops-20/utf9.txt > > Res ipsa loquitur. Are there still now platforms where storage bytes are not octets but nonets? i.e. 9-bit based platforms? If so this proposal makes sense, but as a local optimization for these platforms. Problems wil

Re: Terminology verification

2003-10-30 Thread Philippe Verdy
From: "Lars Marius Garshol" <[EMAIL PROTECTED]> > Does this make sense? Is "code point" the right term, or should I say > "scalar value"? And what about "abstract character"? Are two equal > sequences of code points in NFC necessarily composed of the same > sequence of abstract characters? As Uni

Re: Merging combining classes, was: New contribution N2676

2003-10-30 Thread Jim Allan
Philppe Verdy posted: I do think the opposite: one can fold all commas below to cedillas by default, and, in a Romanian or Latvian context, fold all cedillas below to commas below. I see no difference. Folding either way will find all occurrences of cedilla or comma below. The direction of fold

Re: Terminology verification

2003-10-30 Thread Lars Marius Garshol
* Lars Marius Garshol | | The definition currently says: | | String | Strings are sequences of Unicode code points | conforming to Unicode Normalization Form C . * Kenneth Whistler | | I really think this is asking for trouble. A string data type should | be specified in terms of specific

UTF-9

2003-10-30 Thread John Cowan
http://panda.com/tops-20/utf9.txt Res ipsa loquitur. -- I marvel at the creature: so secret and John Cowan so sly as he is, to come sporting in the pool [EMAIL PROTECTED] before our very window. Does he think that http://www.reutershealth.com Men sleep without watch all night? --

Re: Terminology verification

2003-10-30 Thread Kenneth Whistler
Lars Marius Garshol asked: > I'm working on a specification for a data model and would like to > check that my definition of the string type makes sense. Well, language designers and data modelers may want to chime in with alternate opinions, but here is my two cents on this topic. > > The defi

Terminology verification

2003-10-30 Thread Lars Marius Garshol
I'm working on a specification for a data model and would like to check that my definition of the string type makes sense. The definition currently says: String Strings are sequences of Unicode code points conforming to Unicode Normalization Form C . Strings are equal if they consist

Re: Merging combining classes, was: New contribution N2676

2003-10-30 Thread Philippe Verdy
- Original Message - From: "Jim Allan" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Thursday, October 30, 2003 4:48 PM Subject: Re: Merging combining classes, was: New contribution N2676 > I offered a suggestion on cedilla and combining undercomma: > > > / It seems to me that Cedil

Re: Where is CEN TC304 ?

2003-10-30 Thread John Cowan
Philippe Verdy scripsit: > Can this link to CEN TC304 be updated ? Where can we find the results of its > (past?) activities? In the ISO10646 repertoires collection? The CEN Workshop Agreement defining MES-{1,2,3A,3B} can be found at http://www.evertype.com/standards/iso10646/pdf/cwa13873.pdf .

Re: Merging combining classes, was: New contribution N2676

2003-10-30 Thread Jim Allan
I offered a suggestion on cedilla and combining undercomma: / It seems to me that Cedilla/undercomma folding would be a useful / /addition to "Character Foldings" at http://www.unicode.org/reports/tr30. / and Philippe Verdy responded: Excellent idea, however it has to be tailored by language:

Where is CEN TC304 ?

2003-10-30 Thread Philippe Verdy
Note that I can find some references for the MES-* subsets in Google, but in various web sites, and I don't know which one is good and complete. For example I have these summary pages for MES-1 and MES-2: http://www.cl.cam.ac.uk/~mgk25/ucs/mes-1-rationale.html http://www.cl.cam.ac.uk/~mgk25/ucs/

Re: Merging combining classes, was: New contribution N2676

2003-10-30 Thread jon
> On 29/10/2003 15:07, John Cowan wrote: > > >Not necessarily. A process may check its input for normalization and > >reject it if it is not normalized, and XML consumers are encouraged > >(not required) to do so. > > > > > > > This looks to me like a clear breach of C9, at least of the derived

Where is CEN TC304 ?

2003-10-30 Thread Philippe Verdy
Just a little question: the unicode web site (example at: http://www.unicode.org/timesens/calendar.html) includes a link to what was supposed to be the information page for CEN TC304 (European Standards): http://www.stri.is/TC304/default.html This is no longer working, and even the Icelandic domai

Re: Hacek - Typing from a keyboard... Help!!!!

2003-10-30 Thread Toyin Ryan
Thanks Philippe Philippe Verdy wrote: > From: "Toyin Ryan" <[EMAIL PROTECTED]> > > > Thank you Philippe. The characters you detailed below only seem to work in > Word. > > They don't work in DBArtisan or netscape messenger or outlook. > > There are a lot of applications that can use and render th