Re: Indic LLA sorting

2013-10-20 Thread Anubhav Chattoraj
On 10/21/2013 05:17 AM, Richard Wordingham wrote: > I am surprised that this has not been corrected - the corresponding > codepoint, when it exists, comes in the alphabetical order you describe > in the Buddhist Indic scripts. Assuming the current collations are > wrong, please raise a ticket at h

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Richard Wordingham
On Mon, 21 Oct 2013 01:41:12 +0200 Philippe Verdy wrote: > Is there standard way in Unicode to specify tht one wants to display > a combining > mark in isolation ? Unicode recommends U+00A0 NO-BREAK SPACE as the base character for combining marks to be displayed in isolation. I believe the seq

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Richard Wordingham
On Mon, 21 Oct 2013 10:24:41 +1100 Andrew Cunningham wrote: > I suspect it is a font issue, rather than a renderer issue, but then > using a dotted circle is a convention used in the unicode charts and > in the unicode spec. The extra dotted circles are a layout engine issue at least in part. On

Re: Definition of Logical Order

2013-10-20 Thread Richard Wordingham
On Mon, 21 Oct 2013 00:33:58 +0530 Pravin Jain wrote: I've taken the liberty of replying to the list. > One observation for Indic scripts. > +U0933 normally comes after +U0939, in dictionary, except for this all > other code points are properly ordered. > similarly in the Gujarati block > +U0AB3

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
Is there standard way in Unicode to specify tht one wants to display a combining mark in isolation ? Which leading charcter should be used (that should not display by itself but would force the separation from the previaous context) which can also act as valid leader in combining sequences, and wo

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Andrew Cunningham
I suspect it is a font issue, rather than a renderer issue, but then using a dotted circle is a convention used in the unicode charts and in the unicode spec. It is not a combination I'd expect a font developer from SE Asia to necessarily support. Since publications in SE Asia have their own typog

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
This could be enforced in OpenType by making this mapping of U+25CC within a dedicated feature In fact the feature could instead map the combining marks directly to the glyph ID showing the combining mark with its suitable dotted glyph. The renderer would then lookup this feature directly for the

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Asmus Freytag
On 10/20/2013 3:45 PM, Philippe Verdy wrote: 2013/10/20 Asmus Freytag > Incidentally, the dotted circle shown in the Unicode Code charts is *not* 25CC, and if I were to implement a "show dotted circle" feature in a program I would not use 25CC for thi

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
2013/10/20 Asmus Freytag > Incidentally, the dotted circle shown in the Unicode Code charts is *not* > 25CC, and if I were to implement a "show dotted circle" feature in a > program I would not use 25CC for this - that character has a standard glyph > of rather unsuitable metrics for the purpose,

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Asmus Freytag
On 10/20/2013 1:47 AM, Jukka K. Korpela wrote: 2013-10-20 2:38, Richard Wordingham wrote: Is a sequence of a U+25CC DOTTED CIRCLE plus a combining mark plain text? Well, is hello plain text? The answer is that any string of characters may be considered as plain text and any string of charac

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
OK for ZWSP, I overlooked it because of its name. In fact I want to include all characters that have some whitespace property (to investigate), so that renderers would add them automatically in their list of suitable base characters for which theyr should NOT insert any dotted circle as the defaul

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Richard Wordingham
On Sun, 20 Oct 2013 17:17:55 +0200 Philippe Verdy wrote: > 2013/10/20 Richard Wordingham > Interesting, so the list of "place holders" to support increases, we > have: > - whitespaces (including SP, NBSP, NNBSP, ZWSP, ideographic...) > - arabic joiners > - U+25CC (possibly also other geometric

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
2013/10/20 Richard Wordingham > > What it means is a different issue. U+25CC is a symbol that can be > > used in a variety of meanings. I don’t think it means anything > > specific to most people, unless a definition is given. U+0E31 is a > > Thai vowel sign, and I don’t think any meaning in gen

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Richard Wordingham
On Sun, 20 Oct 2013 11:47:23 +0300 "Jukka K. Korpela" wrote: > 2013-10-20 2:38, Richard Wordingham wrote: > > Is a sequence of a U+25CC DOTTED CIRCLE plus a combining mark plain > > text? > The answer is that any string of > characters may be considered as plain text and any string of > chara

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
2013/10/20 Jukka K. Korpela > 2013-10-20 11:47, Jukka K. Korpela wrote: > > What you could do in a web page is to put U+00A0 U+25CC in one element >> and U+0E31 in another and position the elements in the same place, set >> to have the same width and to be horizontally centered. >> > > Oops. I m

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Philippe Verdy
2013/10/20 Jukka K. Korpela > 2013-10-20 2:38, Richard Wordingham wrote: > > Is a sequence of a U+25CC DOTTED CIRCLE plus a combining mark plain >> text? >> > > Well, is hello plain text? The answer is that any string of > characters may be considered as plain text and any string of characters m

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Jukka K. Korpela
2013-10-20 11:47, Jukka K. Korpela wrote: What you could do in a web page is to put U+00A0 U+25CC in one element and U+0E31 in another and position the elements in the same place, set to have the same width and to be horizontally centered. Oops. I meant U+25CC and U+00A0 U+0E31. But I’m afra

Re: Definition of Logical Order

2013-10-20 Thread Richard Wordingham
On Sun, 20 Oct 2013 00:21:08 +0100 Richard Wordingham wrote: > Am I correct in believing there is no formal definition of 'logical > order' for Indic scripts. Thank you to Vinod Kumar for taking the trouble to reply to me (off-list). On re-reading, I see that 'logical order' is defined in TUS 6

Re: Dotted Circle plus Combining Mark as Text

2013-10-20 Thread Jukka K. Korpela
2013-10-20 2:38, Richard Wordingham wrote: Is a sequence of a U+25CC DOTTED CIRCLE plus a combining mark plain text? Well, is hello plain text? The answer is that any string of characters may be considered as plain text and any string of characters may be treated as rich text according to so