Alex J. Dam wrote:

>   $ echo 'ABÇ' | tr [:upper:] [:lower:]
>   gives me
>   abÇ
>   (the last character is an uppercase cedilla)
>   I expecte its output to be:
>   abç
>
> Am I doing something wrong?

No, your expectations match what POSIX specifies.

> Is tr (version 2.1) broken?

Yes, and even the i18n patches from IBM
http://oss.software.ibm.com/developer/opensource/linux/patches/?patch_id=24
contain no fix for it.

> It happens with sed, too.

$ echo 'ABÇ' | sed -e 's,\(.*\),\L\1\E,'
abÇ

Yes this seems like a bug in GNU sed 4.0.3.

I'm CCing bug-coreutils and the sed maintainer, so the maintainers can do
something about it.

Bruno

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to