So ............... I was looking at 
http://unicode.org/cldr/utility/regex.jsp?a=%5Cp%7Bscript%3DHan%7D&b=? and 
getting a cool looking Modified Regex Pattern. The last range ??-?? is CJK 
Compatibility Ideographs Supplement U+2F800-2FA1D.


[?-??-??-????-??-??-??-??-??-???-????-????-????-????-??]


So ....... then ....... I decided to copy/paste the above Modified Regex 
Pattern into Richard Ishida's Uniview http://r12a.github.io/uniview/


So ........ I then noticed that ?? U+2F800 was listed as ? U+4E3D  [CJK Unified 
Ideographs]


Thus the decomposition/compatibility mapping U+4E3D was being substituted for 
the original U+2F800.


I was using Safari on OS X Yosemite. I repeated the above with Chrome and 
Firefox and there was no problem, no substitution occurred. Thus it appears to 
be a copy/paste problem with Safari or code used by Safari.


I could have so easily missed this problem. I wonder if there are similar 
decomposition/compatibility mapping issues.


André Schappo

Reply via email to