On Oct 23, 2019, at 18:59, Christopher Barker <python...@gmail.com> wrote: > > Since I'm doing this, the three that aren't are: > > U+180E MONGOLIAN VOWEL SEPARATOR > U+200B ZERO WIDTH SPACE > U+FEFF ZERO WIDTH NO-BREAK SPACE > > The Mongolian vowel separator makes some sense (not knowing Mongolian in the > least). Though I wonder what the point of a zero-width space is if it's NOT > going to be a separator?
It’s a Cf (formatting character), because it’s not used for spacing, it’s used for controlling higher-level formatting like soft line breaks. Or, put another way, it’s a bit more like a soft hyphen than it is like a space. It’s a weird distinction, but not as weird as, say, U+2028 and U+2029, which are also used for controlling formatting but literally have “separator” in their name, so they ended up creating a special category for each one so they can be Z but not Zs. Anyway, some of the answers the Unicode committee came up with are odd, but they’re the right answers by definition. Plus, even if I had a time machine and an unlimited life span, I’m pretty sure I wouldn’t want to participate in those arguments. _______________________________________________ Python-ideas mailing list -- python-ideas@python.org To unsubscribe send an email to python-ideas-le...@python.org https://mail.python.org/mailman3/lists/python-ideas.python.org/ Message archived at https://mail.python.org/archives/list/python-ideas@python.org/message/YNONA2X63SZSOVDGEELO3DJONSDXC7CY/ Code of Conduct: http://python.org/psf/codeofconduct/