10FFFF decimal value is 1114111. But, some chinese characters are greater than this value. Is it correct character(10FFFF) to replace with z?
Please correct me if I am doing wrong. On Tue, Feb 26, 2013 at 5:58 PM, Igor Tandetnik <i...@tandetnik.org> wrote: > On 2/26/2013 8:31 AM, Clemens Ladisch wrote: > >> Igor Tandetnik wrote:> On 2/26/2013 2:39 AM, dd wrote: >> >>> SELECT * FROM emp WHERE column_test BETWEEN "somedata/" AND >>>> "somedata/zzz" >>>> >>>> This database has unicode strings(chinese/japanese/...**etc strings). >>>> can >>>> you tell me which is the correct character to replace with z? >>>> >>> >>> U+FFFF, of course. >>> >> >> Unicode characters can have more than 16 bits, of course. >> > > ... but SQLite orders them with simple memcmp (absent a custom collation), > so 0xFFFF will still compare greater than any surrogate pair. > > If the database file uses UTF-8 encoding, and contains supplemental > characters, then yes, a UTF-8 representation of U+10FFFF would be prudent. > -- > Igor Tandetnik > > > ______________________________**_________________ > sqlite-users mailing list > sqlite-users@sqlite.org > http://sqlite.org:8080/cgi-**bin/mailman/listinfo/sqlite-**users<http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users> > _______________________________________________ sqlite-users mailing list sqlite-users@sqlite.org http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users