On Sat, Feb 23, 2002 at 07:37:41PM +0000, Markus Kuhn wrote:
> You have two files
> 
>   Müller
>   Müllerin
> 
> in a directory, the first in NFD, the second in NFC. If you press M+TAB
> in a yet to be written UTF-8 aware version of bash, it will fail to
> expand to Müller, as the two strings differ after the first letter.
> Typing Mu+TAB will expand one, and typing Mü+TAB will expand the other,
> so there is a solution for experienced users.  A user interface
> inprovement would be to provide two control keys that allows to scroll
> through the list of files that are available in the current state of the
> TAB selection. I could also imagine bash doing a normalization, such
> that entering a prefix in one normalization will include the file name
> in the other one as well. There are lots of ways to implement this in a

But nothing with tab completion can fix the fact that I can't enter this
by typing "Müller", and I don't see a good fix at all for that.

This isn't quite the same as the homoglyph problem.  (Someone pointed
out that the c in "cat" could actually be a few different characters--but
I can tell from the context that it *should* be a regular Roman "c".  If
it's not, it's probably someone playing games deliberately.)  Either should
still be copy-and-pastable already, but it'd be nice if that fallback
could be avoided.

I suspect that, like homoglyphs, there is no good fix, but I think this
should be considered independently.  (Since it's still copy-and-pastable, 
it's not as much of an issue to me as unprinting characters and other
things that aren't.)

> convenient way, and the only real problem is to get the bash maintainers
> interested in UTF-8 at all ...

*Is* there any bash maintainance?

-- 
Glenn Maynard
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to