# New Ticket Created by  Alex Jakimenko 
# Please include the string:  [perl #127048]
# in the subject line of all future correspondence about this issue. 
# <URL: https://rt.perl.org/Ticket/Display.html?id=127048 >


This is a continuation of
https://rt.perl.org/Public/Bug/Display.html?id=127047

>From http://unicode.org/reports/tr51/#Emoji_ZWJ_Sequences:

“The U+200D ZERO WIDTH JOINER (ZWJ) can be used between the elements of a
sequence of characters to indicate that a single glyph should be presented
if available.”

“So to the user, these would behave like single emoji characters, even
though internally they are sequences.”

It sounds like we shouldn't cut these sequences in half when doing .substr
(which in turn means that these should be treated as one grapheme).

There is a chart of possible combinations here
http://www.unicode.org/emoji/charts/emoji-zwj-sequences.html, but I think
that any sequence with U+200D ZERO WIDTH JOINER should probably result in
one grapheme. As crazy as it sounds…

Reply via email to