Hi Bruno,

Today at 17:24, Bruno Haible wrote:

> This will mess up users who have their LC_CTYPE set to a non-UTF-8 encoding.
> It is weird if a user, in an application, enters a new file name "Süß",
> and then in a terminal, the filename appears as "Süà " (wow, it even
> hangs my xterm!).

Oh, indeed.  But what about user deciding to change LC_CTYPE?  Or even
worse, what if administrator provides some dirs for the user in an
encoding different from the one user wants to use? 

Eg. imagine having a global "/Müsik" in ISO-8859-1, and user desires
to use UTF-8 or ISO-8859-5.  Now not only that it will be weird (and
possibly even hang your xterm!), you'd be in a mess if you try to fix it.

My point is that the filesystem encoding should be filesystem-wide
(not per-user), because that's the only way to warrant that it won't
break.  And in the sense of POSIX API, UTF-8 makes most sense as a
single, backwards compatible filesystem encoding (well, it wasn't
originally called "UTF-FS" for no reason :), which can work for
everybody.

> It is just as bad as those old Motif applications which assume that
> everything is ISO-8859-1. This makes these applications useless in UTF-8
> locales.

No, it's not.  UTF-8 can encode all characters, so you'd be able to
use whatever characters you wish, give or take a conversion step.
ISO-8859-1 limits you not only on "implementation details" step, but
also on features.

> In summary, I'd suggest
>   - that ALL application follow LC_ALL/LC_CTYPE/LANG, like POSIX specifies,
>   - that users switch to UTF-8 locale when they want.

That's not closer to ever solving the problem.  It's status quo.  I
think we should at least recommend improvements, if not require them
(and nobody suggested requiring them).

Basically, my recommendation was to set LC_CTYPE to UTF-8 on all new
systems.

Cheers,
Danilo

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to