- Original Message -
From: angie ahl [EMAIL PROTECTED]
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Sent: Wednesday, December 01, 2004 9:26 AM
Subject: Unicode (utf8) and MySQL (with Perl)
Hi List.
Please excuse the cross posting but I've been scouring the archives
and no joy as yet.
I'm trying to get Perl and MySQL using utf8 happily and I've followed
several tutorials but am not getting the same results.
I've got a load of utf8 characters like so (perl):
my %uni = (
hebrew_alef = {
character = chr(0x05d0),
language = hebrew,
},
smiley = {
character = \x{263a},
language = none,
},
);
I am inserting them into MySQL using the dbi module DBD::MySQL
The tutorial said to insert the values like this:
INSERT INTO unitest (id, aword) VALUES ( smiley,
CONVERT(_utf8'\x{263a}' USING utf8) );
get the values back like this:
select aword from unitest where id = smiley;
then use perl to decode the returned value like so:
decode(utf8, $aword)
This doesn't work for me properly. However when I insert them like this:
INSERT INTO unitest (id, aword) VALUES ( $smiley, '\x{263a}' );
It seems to work for all but the hebrew_alef which is the character
chr(0x05d0)
So here's my questions:
Is chr(0x05d0) a unicode character?
Do we need to use CONVERT to insert data (it's seems to working better
without it here, but I don't understand CONVERT and the manual didn't
clear that up for me), or should we be inserting utf8 chars the
standard way.
I'm using mysql 4.1.7 and perl 5.8.1 on OS X 10.3
TIA I'm struggling now.
I can't help with very much of your question because I haven't really worked
with character sets in MySQL. However, I can tell you that 0x05d0 *is* the
Unicode value of aleph in Hebrew. You can see the entire Hebrew Unicode
character set at http://www.unicode.org/charts/PDF/U0590.pdf to confirm this
for yourself (and look up other Hebrew codes).
To see all of the Unicode charts, go to http://www.unicode.org/charts/.
By the way, I don't know if you've examined it already but there is a full
chapter in the MySQL manual on character sets which may answer some of your
questions. See http://dev.mysql.com/doc/mysql/en/Charset.html.
Rhino
--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:http://lists.mysql.com/[EMAIL PROTECTED]