Hello,
I have an other problem making the normalization process binary
compatible with ICU.
Why does "30B9 3099" not combine to "30BA"?
Steps to reproduce:
wget http://doppelbauer.name/katakana.txt
uconv -f utf8 -t utf8 -x nfd <katakana.txt >ndf.txt
uconv -f utf8 -t utf8 -x nfc <ndf.txt >nfc.txt
diff katakana.txt nfc.txt
uconv -f utf8 -t utf8 -x nfd <katakana.txt >ndf.txt
uconv -f utf8 -t utf8 -x nfc <ndf.txt >nfc.txt
diff katakana.txt nfc.txt
Expected result: "katakana.txt" == "nfc.txt"
uconv v2.1 ICU 4.8.1.1
Thanks a lot
Markus
_______________________________________________ Unicode mailing list Unicode@unicode.org http://unicode.org/mailman/listinfo/unicode