Re: CONCAT() returns not correct character set
Hi, Is there any difference between _binary'Binary' and CONVERT('Binary' USING binary) mysql SELECT CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))); +--+ | CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))) | +--+ | latin1 | +--+ 1 row in set (0.00 sec) mysql SELECT CHARSET(CONCAT(CONVERT('Bianry' USING binary),CONVERT('abc' USING latin1))); +-+ | CHARSET(CONCAT(CONVERT('Bianry' USING binary),CONVERT('abc' USING latin1))) | +-+ | binary | +-+ 1 row in set (0.00 sec) In Japan, we have to use many kinds of character set. eucjpms,ujis for unix cp932,sjis for Windows utf8 for Java, MySQL meta data So, it's a very complicated world. Hi there, I tried a few other queries: first, confirm that what you think is a binary is indeed a binary: mysql SELECT CHARSET(_binary'Binary'); +--+ | CHARSET(_binary'Binary') | +--+ | binary | +--+ 1 row in set (0.00 sec) check the regular text (for completeness' sake) mysql select charset ('binary'); ++ | charset ('binary') | ++ | latin1 | ++ 1 row in set (0.00 sec) OK, so concatenating the 2 should result in a binary according to the manual: mysql SELECT CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))); +--+ | CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))) | +--+ | latin1 | +--+ 1 row in set (0.00 sec) Hrm, I got the same result you did. However, mysql SELECT CHARSET(CONCAT(_binary'Binary','foo')); +--+ | CHARSET(CONCAT((_binary'Binary'),'foo')) | +--+ | binary | +--+ 1 row in set (0.00 sec) works just fine. Why are you converting the text to latin1? It's already there. That conversion seems to be messing things up. Perhaps someone can explain why the conversion messes things up -- seems like a bug to me. -Sheeri On 2/26/06, Hirofumi Fujiwara [EMAIL PROTECTED] wrote: Dear MySQL fans, I tested CONCAT() with binary strings and I got strange result. Manual says: http://dev.mysql.com/doc/refman/5.0/en/string-functions.html If the arguments include any binary strings, the result is a binary string. But the following test says: bianry + latin1 latin1 (not bianry) mysql SELECT CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))); +--+ | CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))) | +--+ | latin1 | +--+ 1 row in set (0.00 sec) Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ My SUDOKU Probs http://www.pro.or.jp/~fuji/sudoku/problems/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED] -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
CONCAT() returns not correct character set
Dear MySQL fans, I tested CONCAT() with binary strings and I got strange result. Manual says: http://dev.mysql.com/doc/refman/5.0/en/string-functions.html If the arguments include any binary strings, the result is a binary string. But the following test says: bianry + latin1 latin1 (not bianry) mysql SELECT CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))); +--+ | CHARSET(CONCAT(_binary'Bianry',CONVERT('abc' USING latin1))) | +--+ | latin1 | +--+ 1 row in set (0.00 sec) Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ My SUDOKU Probs http://www.pro.or.jp/~fuji/sudoku/problems/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
default characterset of function string parameter
Dear MySQL fans, I am checking MySQL 5.0 to use multibyte strings, Japanese. When I use char type parameter without CHARACTER SET clause, parameter's character set is not the database character set but latin1. 17.2.1. CREATE PROCEDURE says: For character data types, if there is a CHARACTER SET clause in the declaration, the specified character set and its default collation are used. If there is no such clause, the database character set and collation are used. (These are given by the values of the character_set_database and collation_database system variables.) Following is my test: - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - mysql SET @@character_set_database=eucjpms; Query OK, 0 rows affected (0.00 sec) mysql SHOW VARIABLES LIKE 'character\_set\_%'; +--+-+ | Variable_name| Value | +--+-+ | character_set_client | eucjpms | | character_set_connection | eucjpms | | character_set_database | eucjpms | | character_set_results| eucjpms | | character_set_server | latin1 | | character_set_system | utf8| +--+-+ 6 rows in set (0.00 sec) mysql DELIMITER // mysql CREATE FUNCTION parametercharset( s CHAR(20) ) - RETURNS CHAR(50) CHARACTER SET binary - DETERMINISTIC RETURN CONCAT( s, ':', CHARSET(s) ); - // Query OK, 0 rows affected (0.00 sec) mysql DELIMITER ; mysql SELECT parametercharset('hello'); ++ | parametercharset('hello') | ++ | hello:latin1 | ++ 1 row in set (0.00 sec) mysql - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED] http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ My SUDOKU Probs http://www.pro.or.jp/~fuji/sudoku/problems/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Re: How to define utf8 function
Hi, Hello. This should be fixed in 5.0.18. See: http://bugs.mysql.com/bug.php?id=13909 17.2.1. CREATE PROCEDURE and CREATE FUNCTION http://dev.mysql.com/doc/refman/5.0/en/create-procedure.html says ... - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - As of MySQL 5.0.18, the server uses the data type of a routine parameter or function return value as follows. These rules also apply to local routine variables created with the DECLARE statement (Section 17.2.9.1, “DECLARE Local Variables”). * Assignments are checked for data type mismatches and overflow. Conversion and overflow problems result in warnings, or errors in strict mode. * For character data types, if there is a CHARACTER SET clause in the declaration, the specified character set and its default collation are used. If there is no such clause, the database character set and collation are used. (These are given by the values of the character_set_database and collation_database system variables.) * Only scalar values can be assigned to parameters or variables. For example, a statement such as SET x = (SELECT 1, 2) is invalid. Before MySQL 5.0.18, parameters, return values, and local variables are treated as items in expressions, and are subject to automatic (silent) conversion and truncation. Stored functions ignore the sql_mode setting. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - I thought that this explanation means function controls CHARACTER SET clause properly from 5.0.18. So, I checked on version 5.0.18, but situation is same mysql SELECT VERSION(); +-+ | VERSION() | +-+ | 5.0.18-standard-log | +-+ 1 row in set (0.01 sec) mysql SET NAMES utf8; Query OK, 0 rows affected (0.00 sec) ## I tried to make function tokyo() which returns string 'Tokyo' in ## Japanase. _utf8 X'E69DB1E4BAAC' means Tokyo in Japanese. mysql DELIMITER // mysql CREATE FUNCTION tokyo() RETURNS VARCHAR(20) CHARACTER SET utf8 - DETERMINISTIC RETURN _utf8 X'E69DB1E4BAAC'; - // Query OK, 0 rows affected (0.00 sec) mysql DELIMITER ; mysql SELECT tokyo(); +-+ | tokyo() | +-+ | ?? | +-+ 1 row in set, 1 warning (0.00 sec) mysql SELECT HEX(tokyo()); +--+ | HEX(tokyo()) | +--+ | 3F3F | +--+ 1 row in set, 1 warning (0.00 sec) mysql SELECT CHARSET(tokyo()); +--+ | CHARSET(tokyo()) | +--+ | binary | +--+ 1 row in set (0.00 sec) mysql SHOW CREATE FUNCTION tokyo\G *** 1. row *** Function: tokyo sql_mode: Create Function: CREATE FUNCTION `tokyo`() RETURNS varchar(20) DETERMINISTIC RETURN _utf8 X'E69DB1E4BAAC' 1 row in set (0.00 sec) mysql Returned charater type of function is binary. And, SHOW CREATE FUNCTION removed CHARACTER SET utf8 part. Why? Hirofumi Fujiwara[EMAIL PROTECTED], [EMAIL PROTECTED] Time Intermedia Corporationhttp://www.timedia.co.jp/ Corporate Strategy Department Knowledge Engineering Center 26-27 Saka-machi Shinjuku-ku, Tokyo 160-0002 Japan -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
How to define utf8 function
Dear MySQL fans, I want to make a function which returns utf8 string. mysql CREATE FUNCTION hello() RETURNS CHAR(20) CHARACTER SET utf8 - RETURN 'japanese-string'; Query OK, 0 rows affected (0.02 sec) mysql SELECT hello(); +-+ | hello() | +-+ | japanese-string | +-+ 1 row in set (0.00 sec) mysql but, returned character type is binary. mysql SELECT CHARSET(hello()); +--+ | CHARSET(hello()) | +--+ | binary | +--+ 1 row in set (0.00 sec) mysql I tried to make function tokyo() which returns string 'Tokyo' in Japanase. _utf8 X'E69DB1E4BAAC' means Tokyo in Japanese. mysql CREATE FUNCTION tokyo() RETURNS CHAR(20) CHARACTER SET utf8 - RETURN _utf8 X'E69DB1E4BAAC'; Query OK, 0 rows affected (0.01 sec) mysql SELECT tokyo(); +-+ | tokyo() | +-+ | ?? | +-+ 1 row in set (0.03 sec) mysql SELECT hex(tokyo()); +--+ | hex(tokyo()) | +--+ | 3F3F | +--+ 1 row in set (0.01 sec) mysql SELECT CHARSET(tokyo()); +--+ | CHARSET(tokyo()) | +--+ | binary | +--+ 1 row in set (0.00 sec) mysql But function's returns character type is always binary. I want to know how to return utf8 string. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Re: Can we run linux commands from inside the msql client
hi.. I searched the documentation and googled for sometime, but didnt find anything related to this can we execute shell commands inside mysql client (like using ! in oracle), and if possible please tell me how. sujay system(\!) Execute a system shell command. I think \! is what you want. Hirofumi Fujiwara[EMAIL PROTECTED], [EMAIL PROTECTED] Time Intermedia Corporationhttp://www.timedia.co.jp/ Corporate Strategy Department Knowledge Engineering Center 26-27 Saka-machi Shinjuku-ku, Tokyo 160-0002 Japan -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
manual is garbled
Hi, MySQL online manual of Japanese edition is garbled. http://dev.mysql.com/doc/mysql/ja/index.html I think the manuals other than English are garbled. What'a happen? Hirofumi Fujiwara[EMAIL PROTECTED], [EMAIL PROTECTED] Time Intermedia Corporationhttp://www.timedia.co.jp/ Corporate Strategy Department Knowledge Engineering Center 26-27 Saka-machi Shinjuku-ku, Tokyo 160-0002 Japan -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
INSTALL-SOURCE is size 0
Hi, I downloaded mysql-5.0.10-beta.tar.gz. INSTALL-SOURCE is size 0. drwxr-xr-x2 fuji user 4096 7月 23 04:50 BUILD -rw-r--r--1 fuji user19071 7月 23 04:42 COPYING -rw-r--r--1 fuji user 195900 7月 23 04:42 ChangeLog drwxr-xr-x3 fuji user 4096 7月 23 04:50 Docs -rw-r--r--1 fuji user0 7月 23 04:45 EXCEPTIONS-CLIENT -rw-r--r--1 fuji user0 7月 23 04:45 INSTALL-SOURCE -rw-r--r--1 fuji user 3599 7月 23 04:42 Makefile.am -rw-r--r--1 fuji user28126 7月 23 04:43 Makefile.in -rw-r--r--1 fuji user 1937 7月 23 04:42 README What happen? Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
I couldn't find cp932 and eucjpms
Hi, A new version of MySQL Community Edition 5.0.3-beta Open Source database management system has been released. This version now includes support for Stored Procedures, Triggers, Views and many other features. It is now available in source and binary form for a number of platforms from our download pages at http://dev.mysql.com/downloads/ and mirror sites. I downloaded mysql-5.0.3-beta-win32.zip and installed it. MySQL 5.0.3 works very well on my Window XP (Japanese environment). Manual says that MySQL supports cp932 and eucjpms. 10.11. Character Sets and Collations That MySQL Supports http://dev.mysql.com/doc/mysql/en/charset-charsets.html | cp932| SJIS for Windows Japanese | cp932_japanese_ci | | eucjpms | UJIS for Windows Japanese | eucjpms_japanese_ci | +--+-+-+ But, I couldn't find cp932 nor eucjpms. mysql select version(); +---+ | version() | +---+ | 5.0.3-beta-nt | +---+ 1 row in set (0.00 sec) mysql show character set; +--+-+-++ | Charset | Description | Default collation | Maxlen | +--+-+-++ | big5 | Big5 Traditional Chinese| big5_chinese_ci | 2 | | dec8 | DEC West European | dec8_swedish_ci | 1 | | cp850| DOS West European | cp850_general_ci| 1 | | hp8 | HP West European| hp8_english_ci | 1 | | koi8r| KOI8-R Relcom Russian | koi8r_general_ci| 1 | | latin1 | ISO 8859-1 West European| latin1_swedish_ci | 1 | | latin2 | ISO 8859-2 Central European | latin2_general_ci | 1 | | swe7 | 7bit Swedish| swe7_swedish_ci | 1 | | ascii| US ASCII| ascii_general_ci| 1 | | ujis | EUC-JP Japanese | ujis_japanese_ci| 3 | | sjis | Shift-JIS Japanese | sjis_japanese_ci| 2 | | hebrew | ISO 8859-8 Hebrew | hebrew_general_ci | 1 | | tis620 | TIS620 Thai | tis620_thai_ci | 1 | | euckr| EUC-KR Korean | euckr_korean_ci | 2 | | koi8u| KOI8-U Ukrainian| koi8u_general_ci| 1 | | gb2312 | GB2312 Simplified Chinese | gb2312_chinese_ci | 2 | | greek| ISO 8859-7 Greek| greek_general_ci| 1 | | cp1250 | Windows Central European| cp1250_general_ci | 1 | | gbk | GBK Simplified Chinese | gbk_chinese_ci | 2 | | latin5 | ISO 8859-9 Turkish | latin5_turkish_ci | 1 | | armscii8 | ARMSCII-8 Armenian | armscii8_general_ci | 1 | | utf8 | UTF-8 Unicode | utf8_general_ci | 3 | | ucs2 | UCS-2 Unicode | ucs2_general_ci | 2 | | cp866| DOS Russian | cp866_general_ci| 1 | | keybcs2 | DOS Kamenicky Czech-Slovak | keybcs2_general_ci | 1 | | macce| Mac Central European| macce_general_ci| 1 | | macroman | Mac West European | macroman_general_ci | 1 | | cp852| DOS Central European| cp852_general_ci| 1 | | latin7 | ISO 8859-13 Baltic | latin7_general_ci | 1 | | cp1251 | Windows Cyrillic| cp1251_general_ci | 1 | | cp1256 | Windows Arabic | cp1256_general_ci | 1 | | cp1257 | Windows Baltic | cp1257_general_ci | 1 | | binary | Binary pseudo charset | binary | 1 | | geostd8 | GEOSTD8 Georgian| geostd8_general_ci | 1 | +--+-+-++ 34 rows in set (0.00 sec) mysql Hirofumi Fujiwara[EMAIL PROTECTED], [EMAIL PROTECTED] Time Intermedia Corporationhttp://www.timedia.co.jp/ Corporate Strategy Department Knowledge Engineering Center 26-27 Saka-machi Shinjuku-ku, Tokyo 160-0002 Japan -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
What is 8-byte address
Hi, I found the expression 8 bytes network address in MySQL Manual online manual. I know IPv4(4bytes), IPv6(16bytes). Please let me know what is 8 bytes network address. http://dev.mysql.com/doc/mysql/en/Miscellaneous_functions.html INET_ATON(expr) Given the dotted-quad representation of a network address as a string, returns an integer that represents the numeric value of the address. Addresses may be 4- or 8-byte addresses. INET_NTOA(expr) Given a numeric network address (4 or 8 byte), returns the dotted-quad representation of the address as a string. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Question on hex expression of strings
I got the following result when I tried to display strings with hex expression. x'B4C1BBFA' OK X'B4C1BBFA' OK 0xB4C1BBFA OK 0XB4C1BBFA Error I was checking how MySQL treats upper- and lower-case 'x'. '0X' didn't work and I am wondering if this is decided by the specification of MySQL. Will you let me know if this is specified so or this is unexpected behavior? Hirofumi Fujiwara, Knowledge Engineering Center 26-27 Saka-machi Shinjuku-ku, Tokyo 160-0002 Japan Time Intermedia Corporation, http://www.timedia.co.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
SJIS character including 7 bit ASCII code
When using SJIS charset, there is a problem when the second byte of a character matches with 7bit ASCII code. The character codes including \(\x5C) in their second byte need caution. The range of SJIS code is [\x81-\x9F\xE0-\xFC][\x40-\x7E\x80-\xFC] Please see my test result at http://epx.timedia.co.jp/mysql/sjis-2ndbyte_html The problem was solved when I set sjis to launch mysql client command mysql --default-character-set=sjis or in my.cnf [mysql] default-character-set = sjis Without above setting, MySQL didn't handle the character correctly even if I typed the following settings: set names sjis set character set sjis This is the explanation in the English manual: --- http://dev.mysql.com/doc/mysql/en/Character_sets.html 5.7.1 The Character Set Used for Data and Sorting You can force the client to use specific character set as follows: [client] default-character-set=character-set-name This is normally unnecessary, however. --- But it seems that I have to set this option to use SJIS character set. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Space in multi-byte character set
Here are charcodes of full-width space (IDSP,Ideographic Space) of sjis, ujis and utf8: sjis 81 40 ujis A1 A1 utf8 E3 80 80 String processing functions TRIM, LTRIM, and RTRIM don't recognize full-width space in a string. They don't trim the full-width space and leave it in string. It seems that the functions don't process Japanese full-width space as space. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Re: MySQL and Unicode
Hi, I have a question. I have read that since MySQL 4.1, VARCHAR type can hold unicode characters. But, it seems that the API of the C library use to send queries to the server still use 'const char*' for the type of the query. So, praticaly, if I have a unicode string stored in C, how can I use it with the C API? Thanks. I use unicode(UTF-8) strings as following. mysql_query( mysql, SELECT inlude Japanese characters(KANJI) ); I think that this might answer your question. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Displaying string including Kanji characters
When I use multi-byte code (especially UTF-8) with data including Kanji (Japanese kanji character) and execute SELECT from mysql client command, the vertical lines(|) of output table aren't aligned as straight lines and the output doesn't look like a table. When I use SJIS code, the problem of sheared line doesn't occur. This is because the display width and the byte number of data always match. To display multi-byte code, not only the count of character and byte, but the display width is also very important. The width of Kanji in EUC-JP, SJIS, and UTF-8 is double the width of alphanumeric characters. As far as I could see, the result of functions involving display, such as LPAD and RPAD, doesn't seem to take the display width of Kanji into account. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Re: Displaying string including Kanji characters
When I use multi-byte code (especially UTF-8) with data including Kanji (Japanese kanji character) and execute SELECT from mysql client command, the vertical lines(|) of output table aren't aligned as straight lines and the output doesn't look like a table. When I use SJIS code, the problem of sheared line doesn't occur. This is because the display width and the byte number of data always match. I captured this result as images and posted them to the following URL. Please check them to see what I'm saying. http://epx.timedia.co.jp/mysql/select_result_html Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
REGEXP with UTF-8
I am testing regular expression feature of MYSQL 4.1.1-alpha (REGEXP) with UTF-8 characters (Japanese). So far it doesn't seem to work. LIKE handles each Japanese character properly as one character, but REGEXP doesn't. I'd like to know if REGEXP handles UTF-8 characters. Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html Mirror site http://www.kojima-cci.or.jp/~fuji/index-eng.html -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]
Re: REGEXP with UTF-8
From: Sergei Golubchik [EMAIL PROTECTED] Hi! On Apr 06, Hirofumi Fujiwara wrote: I am testing regular expression feature of MYSQL 4.1.1-alpha (REGEXP) with UTF-8 characters (Japanese). So far it doesn't seem to work. LIKE handles each Japanese character properly as one character, but REGEXP doesn't. I'd like to know if REGEXP handles UTF-8 characters. No it does not :( (we need to change regexp library that we use, to have it fixed) I wonder if REGEXP doesn't handle multi bytes character(big5,ujis, sjis,euckr,gb2312,gbk,utf8,ucs2) either, which means it handles only single byte character set (Maxlen=1). Hirofumi Fujiwara (Tokyo JAPAN) enjoy JAVA and Puzzle World [EMAIL PROTECTED]http://www.pro.or.jp/~fuji/index-eng.html [EMAIL PROTECTED] Puzzle Japanhttp://www.puzzle.jp/ -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]