----- Original Message -----
From: "angie ahl" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>; <[EMAIL PROTECTED]>
Sent: Wednesday, December 01, 2004 9:26 AM
Subject: Unicode (utf8) and MySQL (with Perl)
> Hi List.
>
> Please excuse the cross posting but I've been scouring the archives
> and no joy as yet.
>
> I'm trying to get Perl and MySQL using utf8 happily and I've followed
> several tutorials but am not getting the same results.
>
> I've got a load of utf8 characters like so (perl):
>
> my %uni = (
> hebrew_alef => {
> character => chr(0x05d0),
> language => "hebrew",
> },
> smiley => {
> character => "\x{263a}",
> language => "none",
> },
> );
>
> I am inserting them into MySQL using the dbi module DBD::MySQL
>
> The tutorial said to insert the values like this:
> INSERT INTO unitest (id, aword) VALUES ( "smiley",
> CONVERT(_utf8'\x{263a}' USING utf8) );
>
> get the values back like this:
> select aword from unitest where id = "smiley;"
>
> then use perl to decode the returned value like so:
>
> decode("utf8", $aword)
>
> This doesn't work for me properly. However when I insert them like this:
>
> INSERT INTO unitest (id, aword) VALUES ( "$smiley", '\x{263a}' );
>
> It seems to work for all but the hebrew_alef which is the character
chr(0x05d0)
>
> So here's my questions:
>
> Is chr(0x05d0) a unicode character?
>
> Do we need to use CONVERT to insert data (it's seems to working better
> without it here, but I don't understand CONVERT and the manual didn't
> clear that up for me), or should we be inserting utf8 chars the
> standard way.
>
> I'm using mysql 4.1.7 and perl 5.8.1 on OS X 10.3
>
> TIA I'm struggling now.
>
I can't help with very much of your question because I haven't really worked
with character sets in MySQL. However, I can tell you that 0x05d0 *is* the
Unicode value of aleph in Hebrew. You can see the entire Hebrew Unicode
character set at http://www.unicode.org/charts/PDF/U0590.pdf to confirm this
for yourself (and look up other Hebrew codes).
To see all of the Unicode charts, go to http://www.unicode.org/charts/.
By the way, I don't know if you've examined it already but there is a full
chapter in the MySQL manual on character sets which may answer some of your
questions. See http://dev.mysql.com/doc/mysql/en/Charset.html.
Rhino
--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe: http://lists.mysql.com/[EMAIL PROTECTED]