Markus Kuhn wrote:

> This property is in essence what makes the range notation in regular
> expressions useful, because by selecting a range of characters in some
> prefix of a string, you are guaranteed to select a consecutive sequence
> of strings in a sorted list. With multi-pass comparison algorithms like
> ISO, this property does not hold any more, and that is why many people
> (including myself) prefer to hold on to the old naive "C" strcmp()
> sorting order until we get a nicer single-pass sorting algorithm that is
> guaranteed to be a homomorphism over string concatenation.

Can't we have instead an alternative local available with single pass sorting
algorythm ?

Something like LC_COLLATE=en_GB.single_pass

where we would sort as :

ter
tes
t�r
t�s
tfs

instead of the multipass ISO sorting which would be :

ter
t�r
tes
t�s
tfs

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to