Christian Heimes added the comment:

There is no ligature for "lff", just "ffl". Ligatures are treated as one char. 
I guess Python would have to grow a str.reverse() method to handle ligatures 
and combining chars correctly.

At work I ran into the issue with ligatures and combining chars multiple times 
in medieval and early modern age scripts. Eventually I started to normalize all 
incoming data to NFKC. That solves most of the issues.

s = b'ba\xef\xac\x84e'.decode('utf-8')
>>> print("".join(reversed(s)))
efflab
>>> print("".join(reversed(unicodedata.normalize("NFKC", s))))
elffab

----------
nosy: +christian.heimes

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue19819>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to