The source of the bug has been found. 
As for dash-0.5.4, in expand.c:240 there is a line
        rmescapes(p);
If one follows the macro, he or she will find that it just trashes chars 129 
and 136 ("\201\210" octal).
In UTF-8 representation of letters сшЁ (two bytes per a letter) the second byte 
is just from the set:
$ echo сшЁ|hexdump -b
0000000 321 201 321 210 320 201 012                                    
0000007
That's what causes the bug. The bug is UTF-8 specific, if KOI-8 was used for 
Cyrillics (as it is in Debian), there would be no such bug.

-- 
dash interpreter don't handle some unicode characters correctly
https://bugs.launchpad.net/bugs/422298
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to