Andrew Benton wrote:
Another application I'm keen on is the file sharing application gtk-gnutella. Filenames are an issue there as people on the network could be in any locale. There was a long thread on the gtk-gnutella-devel mailing list about language issues. I read it, but I don't pretend to understand it all. If you have time, could you have a look and see what you think? http://sourceforge.net/mailarchive/forum.php?thread_id=8318817&forum_id=5942

Exactly the same issue/bug. The only known working eDonkey client is
Windows-based crapware running under Wine (and I don't use any other
filesharing networks).

They should have looked how SAMBA solves this, when talking with
WinNT/2K/XP clients. UNICODE is used on the wire. Two configurations are
possible, depending on the "unix charset" parameter in the config file:

unix charset = (locale encoding): filenames are shown correctly by "ls",
in Midnight Commander, Mutt and all other console programs. But invalid
characters are being replaced with underscores, so after storing a file
a client can receive it with a changed name. But that's extremely rare
as Windows users can't type such characters anyway if they configured
the same default regional settings.

unix charset = utf-8: filenames on disk are garbage from the viewpoint
of console apps. But no riund-trip loss occurs.

Admins generally prefer either the first variant or (if the distro is RedHat or SuSE) use UTF-8 locales where this problem doesn't exist. But in UTF-8 locales the maximum path length in characters is too short for Russians.

Also, the following words are 100% correct for ru_RU.KOI8-R if one replaces "file manager" with "ls command":

So russian filenames will be more important to them than german filenames. What would be more convenient for them? Store all names in koi8-r, losing one foreign "รค", but still be able to correctly see all their downloaded russian files in their file manager? Or store all names in utf-8, technically not losing any information but losing the ability to read these filenames, especially their preferred russian filenames? I bet they won't choose the latter. Why not at least give the option?

That's why we set G_FILENAME_ENCODING="@locale" in BLFS. Not obeying to this variable in a glib2-based app is a bug.

Of course the problem doesn't exist in ru_RU.UTF-8, but other problems appear in that locale.

--
Alexander E. Patrakov

--
http://linuxfromscratch.org/mailman/listinfo/blfs-dev
FAQ: http://www.linuxfromscratch.org/blfs/faq.html
Unsubscribe: See the above information page

Reply via email to