Il 03/03/2014 07:13, Paul Eggert ha scritto:
Norihiro Tanaka wrote:
However I don't understand why the optimization isn't completed on
non-UTF8 locale only.  Can you explain it?

Sorry, no; there's a lot about that code I don't yet understand.

IIRC it's because a CSET matches any byte, while the corresponding MBCSET only matches that byte if it is a single-byte character. So for example, say "\x83A" is a two-byte character. The CSET "A" will match it but the corresponding MBCSET will not.

This can happen in the Shift-JIS encoding.

Paolo




Reply via email to