On 8/4/06, Cory Nelson <[EMAIL PROTECTED]> wrote:
On 8/4/06, Trevor Talbot <[EMAIL PROTECTED]> wrote: > On 8/4/06, Cory Nelson <[EMAIL PROTECTED]> wrote: > > > But, since you brought it up - I have no expectations of SQLite > > integrating a full Unicode locale library, however it would be a great > > improvement if it would respect the current locale and use wcs* > > functions when available, or at least order by standard Unicode order > > instead of completely mangling things on UTF-8 codes.
> What do you mean by "standard Unicode order" in this context?
Convert UTF-8 to UTF-16 (or both to UCS-4 if you want to be entirely correct) while sorting, to at least make them follow the same pattern.
Ah, so Unicode codepoint order. Unfortunately this isn't accurate: UTF-8 and UTF-32/UCS-4 are both naturally in codepoint order (UTF-8 because of the MSB-first style format), but UTF-16 isn't due to the way surrogate pairs are constructed. UTF-16 is actually the oddball here :P