Re: NFD -> NFC

2014-03-11 Thread Markus Scherer
Here is the demo using ICU4C:

http://demo.icu-project.org/icu-bin/nbrowser?t=%5Cu30B9%5Cu3099&s=&uv=0

markus
___
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode


Re: NFD -> NFC

2014-03-11 Thread Mark Davis ☕
Not sure about your exact case, but ICU's normalization does handle those
characters.

http://unicode.org/cldr/utility/transform.jsp?a=nfc%3Bhex&b=%5Cu30B9%5Cu3099

(That tool uses ICU for NFC).


Mark 

*— Il meglio è l’inimico del bene —*


On Tue, Mar 11, 2014 at 4:50 PM, Markus Doppelbauer wrote:

> Hello,
>
> I have an other problem making the normalization process binary
> compatible with ICU.
>  Why does "30B9 3099" not combine to "30BA"?
>
> Steps to reproduce:
>  wget http://doppelbauer.name/katakana.txt
> uconv -f utf8 -t utf8 -x nfd ndf.txt
> uconv -f utf8 -t utf8 -x nfc nfc.txt
> diff katakana.txt nfc.txt
>
>  Expected result: "katakana.txt" == "nfc.txt"
>
> uconv v2.1  ICU 4.8.1.1
>
> Thanks a lot
> Markus
>
>
>
> ___
> Unicode mailing list
> Unicode@unicode.org
> http://unicode.org/mailman/listinfo/unicode
>
>
___
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode


NFD -> NFC

2014-03-11 Thread Markus Doppelbauer
Hello,

 

I have an other problem making the normalization process binary

compatible with ICU.


Why does "30B9 3099" not combine to "30BA"?

 

Steps to reproduce:


wget http://doppelbauer.name/katakana.txt
uconv -f utf8 -t utf8 -x nfd ndf.txt
uconv -f utf8 -t utf8 -x nfc nfc.txt
diff katakana.txt nfc.txt

 


Expected result: "katakana.txt" == "nfc.txt"

 

uconv v2.1  ICU 4.8.1.1

 

Thanks a lot

Markus

 


 
___
Unicode mailing list
Unicode@unicode.org
http://unicode.org/mailman/listinfo/unicode