> -----Original Message----- > From: Harald Fuchs [mailto:[EMAIL PROTECTED] > Sent: 30 September 2004 12:16 > To: [EMAIL PROTECTED] > Subject: Re: Indexing problem with UTF8 in 4.1.4? > > In article > <[EMAIL PROTECTED]>, > Kevin Cowley <[EMAIL PROTECTED]> writes: > > > If I knew why I wouldn't be asking. Now by our reconing the key of the > > fields is 343 bytes, encoding in UTF8 makes that key 343 bytes Not 1000 > > since under utf8 each character is encode in 8 bits. > > What makes you think so? Under UTF8 each character is encoded in 8, > 16, 24, or 32 bits - it depends on the character. So a conservative > guess is that each UTF8 character needs 4 bytes.
Because we're using an Latin character set which is encodable in 8 bytes or 16 for the Greek/Russian cyrilic character sets. However as posted by Jeremy March, MySQL automatically encodes UTF8 characters as a 3 byte representation, which explains the error we're seeing. ************************************************************************************** ALCHEMETRICS LIMITED (ALCHEMETRICS) Mulberry Park, Fishponds Road, Wokingham, Berkshire, RG41 2GX Tel: +44 (0) 118 902 9000 Fax: +44 (0) 118 902 9001 This e-mail is confidential and is intended for the use of the addressee only. If you are not the intended recipient, you are hereby notified that you must not use, copy, disclose, otherwise disseminate or take any action based on this e-mail or any information herein. If you receive this transmission in error, please notify the sender immediately by reply e-mail or by using the contact details above and then delete this e-mail. Please note that e-mail may be susceptible to data corruption, interception and unauthorised amendment. Alchemetrics does not accept any liability for any such corruption, interception, amendment or the consequences thereof. ************************************************************************************** -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe: http://lists.mysql.com/[EMAIL PROTECTED]