On 10/23/19 11:27 PM, Andrew Barnert via Python-ideas wrote: > On Oct 23, 2019, at 18:59, Christopher Barker <python...@gmail.com> wrote: >> Since I'm doing this, the three that aren't are: >> >> U+180E MONGOLIAN VOWEL SEPARATOR >> U+200B ZERO WIDTH SPACE >> U+FEFF ZERO WIDTH NO-BREAK SPACE >> >> The Mongolian vowel separator makes some sense (not knowing Mongolian in the >> least). Though I wonder what the point of a zero-width space is if it's NOT >> going to be a separator? > It’s a Cf (formatting character), because it’s not used for spacing, it’s > used for controlling higher-level formatting like soft line breaks. Or, put > another way, it’s a bit more like a soft hyphen than it is like a space. It’s > a weird distinction, but not as weird as, say, U+2028 and U+2029, which are > also used for controlling formatting but literally have “separator” in their > name, so they ended up creating a special category for each one so they can > be Z but not Zs. > > Anyway, some of the answers the Unicode committee came up with are odd, but > they’re the right answers by definition. Plus, even if I had a time machine > and an unlimited life span, I’m pretty sure I wouldn’t want to participate in > those arguments.
My understanding was that the ZWNBS was provide a way to logically separate two code-points that would otherwise like to bind together for meaning. -- Richard Damon _______________________________________________ Python-ideas mailing list -- python-ideas@python.org To unsubscribe send an email to python-ideas-le...@python.org https://mail.python.org/mailman3/lists/python-ideas.python.org/ Message archived at https://mail.python.org/archives/list/python-ideas@python.org/message/JYNEFYFDWZQR5WUEAP6EJK52ZSHRT4HL/ Code of Conduct: http://python.org/psf/codeofconduct/