Aldo Cortesi <[EMAIL PROTECTED]> wrote: > [ ... ] There is no general way to detect homoglyphs and "convert them to >a normal form". Observe: > >import unicodedata >print repr(unicodedata.normalize("NFC", u"\u2160")) >print u"\u2160" >print "I"
FYI, those come out as two very clearly distinct glyphs in the default terminal font I have here. (The ROMAN NUMERAL ONE has no cross-bars, and is more likely to be confused with "|".) -- \S -- [EMAIL PROTECTED] -- http://www.chaos.org.uk/~sion/ "Frankly I have no feelings towards penguins one way or the other" -- Arthur C. Clarke her nu becomeþ se bera eadward ofdun hlæddre heafdes bæce bump bump bump
-- http://mail.python.org/mailman/listinfo/python-list