Re: Another UAX #29 bug: property tables need updating

2016-12-23 Thread Mark Davis ☕️
Also, under http://unicode.org/reports/tr29/#Conformance see the following.
The wording could be stronger: the CLDR customizations are strongly
recommended.

   - Some changes to rules and data are needed for best segmentation
   behavior of additional emoji zwj sequences [UTR51
   ], prior to the
   eventual publication of Unicode 10.0. Such changes are planned for
   inclusion in CLDR Version 30 [CLDR
   ].


Mark

On Thu, Dec 22, 2016 at 11:20 PM, Manish Goregaokar 
wrote:

> Will do, thanks!
> -Manish
>
>
> On Thu, Dec 22, 2016 at 11:16 AM, Ken Whistler 
> wrote:
> > Manish,
> >
> >
> > On 12/22/2016 10:35 AM, Manish Goregaokar wrote:
> >>
> >> The property table should include all role and gender modifiers as GAZ.
> >>
> >> Could this be updated?
> >>
> >
> > Property values cannot be updated for *published* versions of the
> standard.
> > What you should do is submit your feedback as part of the public review
> for
> > UAX #29 for version 10.0 of the standard. See:
> >
> > http://www.unicode.org/review/pri341/
> >
> > If you submit your feedback to UAX #29 (and its associated data files)
> > according to the directions there, that will ensure that it gets properly
> > considered during the review of UAX #29 at the next UTC meeting
> scheduled at
> > the end of January.
> >
> > --Ken
> >
> > P.S. In general, any feedback on property values in the UCD need to be
> > handled that way, to make sure they get appropriate consideration by the
> > UTC.
> >
>


Re: Another UAX #29 bug: property tables need updating

2016-12-22 Thread Manish Goregaokar
Will do, thanks!
-Manish


On Thu, Dec 22, 2016 at 11:16 AM, Ken Whistler  wrote:
> Manish,
>
>
> On 12/22/2016 10:35 AM, Manish Goregaokar wrote:
>>
>> The property table should include all role and gender modifiers as GAZ.
>>
>> Could this be updated?
>>
>
> Property values cannot be updated for *published* versions of the standard.
> What you should do is submit your feedback as part of the public review for
> UAX #29 for version 10.0 of the standard. See:
>
> http://www.unicode.org/review/pri341/
>
> If you submit your feedback to UAX #29 (and its associated data files)
> according to the directions there, that will ensure that it gets properly
> considered during the review of UAX #29 at the next UTC meeting scheduled at
> the end of January.
>
> --Ken
>
> P.S. In general, any feedback on property values in the UCD need to be
> handled that way, to make sure they get appropriate consideration by the
> UTC.
>


Re: Another UAX #29 bug: property tables need updating

2016-12-22 Thread Ken Whistler

Manish,


On 12/22/2016 10:35 AM, Manish Goregaokar wrote:

The property table should include all role and gender modifiers as GAZ.

Could this be updated?



Property values cannot be updated for *published* versions of the 
standard. What you should do is submit your feedback as part of the 
public review for UAX #29 for version 10.0 of the standard. See:


http://www.unicode.org/review/pri341/

If you submit your feedback to UAX #29 (and its associated data files) 
according to the directions there, that will ensure that it gets 
properly considered during the review of UAX #29 at the next UTC meeting 
scheduled at the end of January.


--Ken

P.S. In general, any feedback on property values in the UCD need to be 
handled that way, to make sure they get appropriate consideration by the 
UTC.




Another UAX #29 bug: property tables need updating

2016-12-22 Thread Manish Goregaokar
The spec lists GraphemeBreakProperty.txt[1] and
WordBreakProperty.txt[2] as the normative source for grapheme and word
categorization respectively.

However, the spec also gives non-normative definitions of these
properties. In particular, it defines Glue_After_Zwj[3] as

> Emoji characters that do not break from a previous ZWJ in a defined emoji zwj 
> sequence, and are not listed as Emoji_Modifier_Base=Yes in emoji-data.txt. 
> See [UTR51].

Going through emoji-zwj-sequences.txt[4], there are a lot of emoji
characters that satisfy this property. The kiss/heart emojis are like
this, as well as every object emoji in the "Gendered Role, with
object" section. However, we only count the kiss, heart, and speech
bubble emoji as GAZ in the property table.

The property table should include all role and gender modifiers as GAZ.

Could this be updated?

 [1]: 
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/GraphemeBreakProperty.txt
 [2]: 
http://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakProperty.txt
 [3]:http://www.unicode.org/reports/tr29/proposed.html#Glue_After_Zwj
 [4]: http://unicode.org/Public/emoji/4.0/emoji-zwj-sequences.txt

Thanks,
-Manish