Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-10 Thread Ryan Chan
Hi Domas, On Thu, Jun 10, 2010 at 2:48 PM, Domas Mituzas wrote: > varbinary() otoh is varchar with 'binary' character set (if you define > default server charset to be binary, as we do on our 5.x boxes, all varchar > creation will be varbinary). > This back to my original question: http://svn

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-10 Thread Gerard Meijssen
Domas. Yes it is a technical pain in the arse.The question is one of primacy. Is it more important to provide service or are technical considerations of the most importance. Yes, we discussed this in the past and we did not agree then and we do not agree now. I wonder how our English language read

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-09 Thread Domas Mituzas
Hello, On Jun 8, 2010, at 11:22 PM, Gerard Meijssen wrote: > The difference is that is actually does sort according to the CLDR.. It > would be really nice if we did that. It does not, it sorts according to the partial UCA implementation. We have discussed CLDR in the past - it is a huge colle

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-08 Thread Gerard Meijssen
Hoi, The difference is that is actually does sort according to the CLDR.. It would be really nice if we did that. Thanks, GerardM On 8 June 2010 21:38, Aryeh Gregor > wrote: > On Tue, Jun 8, 2010 at 10:28 AM, Paul Houle wrote: > > As a person who has labored mightily to make sense of

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-08 Thread Aryeh Gregor
On Tue, Jun 8, 2010 at 10:28 AM, Paul Houle wrote: >     As a person who has labored mightily to make sense of dbpedia,  I > think that one reason why varbinary is preferable to varchar in many > applications in wikimedia is that varchar() string comparisons are case > insensitive and varbinary co

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-08 Thread Paul Houle
As a person who has labored mightily to make sense of dbpedia, I think that one reason why varbinary is preferable to varchar in many applications in wikimedia is that varchar() string comparisons are case insensitive and varbinary comparisons are case sensitive. There are 10,000 or s

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-08 Thread Ryan Chan
Hi, On Tue, Jun 8, 2010 at 2:35 AM, Aryeh Gregor wrote: > Yes, but the binary encoding doesn't use MySQL's UTF-8 support.  It > stores the UTF-8 as binary data, so it can store non-BMP characters > even though MySQL's utf8 encoding can't.  MySQL 5.5 is planned to add > a utf8mb4 encoding that wi

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-07 Thread Aryeh Gregor
On Mon, Jun 7, 2010 at 11:17 AM, Ryan Chan wrote: > It seems 5.0 and 5.1 support for UTF-8 are the same > > http://dev.mysql.com/doc/refman/5.0/en/charset-unicode.html > http://dev.mysql.com/doc/refman/5.1/en/charset-unicode.html Yes, but the binary encoding doesn't use MySQL's UTF-8 support. It

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-07 Thread Gerard Meijssen
Hoi, Do you mean to say the way MySQL supports Unicode is the same ie the way it is stored? Thanks, GerardM On 7 June 2010 17:17, Ryan Chan wrote: > Hi, > > On Mon, Jun 7, 2010 at 1:36 AM, Platonides wrote: > > No. The BMP problem is only present with the "MySQL 4.1/5.0 UTF-8" > option.

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-07 Thread Ryan Chan
Hi, On Mon, Jun 7, 2010 at 1:36 AM, Platonides wrote: > No. The BMP problem is only present with the "MySQL 4.1/5.0 UTF-8" option. It seems 5.0 and 5.1 support for UTF-8 are the same http://dev.mysql.com/doc/refman/5.0/en/charset-unicode.html http://dev.mysql.com/doc/refman/5.1/en/charset-unico

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-06 Thread Platonides
Ryan Chan wrote: > Hello, > > On Sun, Jun 6, 2010 at 11:12 PM, Platonides wrote: >> MediaWiki supports both ways. Wikipedia still uses the mysql 4 >> compatible options, and since mysql chars only support the bmp, it isn't >> likely to change. >> It all depends on what you choose on install. Curr

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-06 Thread Ryan Chan
Hello, On Sun, Jun 6, 2010 at 11:12 PM, Platonides wrote: > MediaWiki supports both ways. Wikipedia still uses the mysql 4 > compatible options, and since mysql chars only support the bmp, it isn't > likely to change. > It all depends on what you choose on install. Currently, the preselected > ch

Re: [Wikitech-l] varchar(255) binary in tables.sql

2010-06-06 Thread Platonides
Ryan Chan wrote: > Hello all, > > I remember in old days, UTF-8 string are stored as varbinary, are > there reason to change to varchar(255) binary? > > Also, what is the default server/connection/client character set settings now? > > Thanks. MediaWiki supports both ways. Wikipedia still uses

[Wikitech-l] varchar(255) binary in tables.sql

2010-06-05 Thread Ryan Chan
Hello all, I remember in old days, UTF-8 string are stored as varbinary, are there reason to change to varchar(255) binary? Also, what is the default server/connection/client character set settings now? Thanks. ___ Wikitech-l mailing list Wikitech-l@l