On 2014-09-11 20:26:12 -0700, Paul Eggert wrote: > Vincent Lefevre wrote: > > >ypig% LC_ALL=C locale charmap > >ANSI_X3.4-1968 > > That may be what the 'locale' command says, but bytes with the top bit on > are considered to be valid single-byte characters. There are no encoding > errors. So, in that sense it's not strict ASCII.
Glibc regards it as ASCII: $ printf '\xe8' | LC_ALL=C iconv iconv: illegal input sequence at position 0 > >the current behavior breaks the sometimes used "grep ." solution > >to match non-empty lines. > > "grep ." matches lines containing one or more characters. Encoding errors > are not characters, at least, not as far as plain grep is concerned. I just mean that "grep ." is a method given by some people, that was working before UTF-8. -- Vincent Lefèvre <vinc...@vinc17.net> - Web: <https://www.vinc17.net/> 100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/> Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon) -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org