On 2014-09-11 20:26:12 -0700, Paul Eggert wrote:
> Vincent Lefevre wrote:
> 
> >ypig% LC_ALL=C locale charmap
> >ANSI_X3.4-1968
> 
> That may be what the 'locale' command says, but bytes with the top bit on
> are considered to be valid single-byte characters.  There are no encoding
> errors.  So, in that sense it's not strict ASCII.

Glibc regards it as ASCII:

$ printf '\xe8' | LC_ALL=C iconv
iconv: illegal input sequence at position 0

> >the current behavior breaks the sometimes used "grep ." solution
> >to match non-empty lines.
> 
> "grep ." matches lines containing one or more characters.  Encoding errors
> are not characters, at least, not as far as plain grep is concerned.

I just mean that "grep ." is a method given by some people, that
was working before UTF-8.

-- 
Vincent Lefèvre <vinc...@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)


-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Reply via email to