>...
> mysql> SELECT col2 FROM test ORDER BY col2 COLLATE utf8_swedish_ci;
> +------------+
> | col2       |
> +------------+
> | M(u-diaresis)ller  |
> | Muffler    |
> | MX Systems |
> | MySQL      |
> +------------+
> ...

> I expect M(u-diaeresis)ller to sort after MX Systems in the following:
> ...
> I have tried various UTF8 collations and, apart from utf8_bin, they all
> place M(u-diaresis)ller at the start.
> ...

Is this for Swedish language data?  I don't know Swedish so I don't
actually know where u-diaeresis is sorted in Swedish myself, but
according to the source code (in the file: strings/ctype-uca.c) the
u-diaeresis is sorted as an equivalent of "y" in utf8_swedish_ci.

The unicode codepoint for u-diaeresis is 0x00FC and the capital
U-diaeresis is 0x00DC.

I just tested this with 4.1.4 (from the bk tree) and it worked correctly
for me.  My keyboard isn't setup to enter u-diaeresis easily so I
entered it in hex.  Try this:

CREATE TABLE swedish (col char(20) COLLATE utf8_swedish_ci);

INSERT INTO swedish VALUES (CONVERT(_ucs2 0x004D00FC006C006C00650072
USING utf8)), ('MySQL'), ('Muffler'), ('MX Systems');

SELECT * FROM swedish ORDER BY col;
+------------+
| col        |
+------------+
| Muffler    |
| MX Systems |
| MÃller    |
| MySQL      |
+------------+
4 rows in set (0.00 sec)



-- 
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Reply via email to