> The fullwidth comma character U+FF0C is properly handled in UTF-8
> encoding. However, this is not true for GBK since this encoding
> uses the punctuation tables of the GB 2312 character set. The same
> holds for Big5+ (which uses the Big5 punctuation tables).
>
> It should be straightforward to fix this for GBK (and Big5+) by
> providing complete punctuation tables in CJK.enc. Any volunteers?
> Just look at the postpunct and prepunct values for UTF-8 encoding
> and map those values to GBK code points...
I've now added punctuation tables for GBK in the git repository. (I'm
too lazy to do that for Big5+ -- are there still users of this
encoding?)
Please test.
Werner
_______________________________________________
Cjk maillist - [email protected]
https://lists.ffii.org/mailman/listinfo/cjk