Japanese AND Cyrillic based languages such as Romania is still messing up.

- "The Greek and Cyrillic looked ok to me" -

Unfortunately they are not, some characters are also being given the wrong
UTF-8 char code, resulting in the dreaded question marks :-(.

- "The web browser doesn't have a Japanese font" -

This is definitely not the problem. All character sets that are available to
me on Windows are installed, along with the language specific fonts.

Does anybody have experience in using libxml to insert UTF-8 data in 4.1.x?

Cheers,
Martin

-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] 
Sent: 15 January 2005 22:09
To: Martin Gallagher
Cc: mysql@lists.mysql.com
Subject: RE: MySQL 4.1.8 and storing east characters

So its just the Japanese titles which aren't working?  The Greek and
Cyrillic looked ok to me.  If that's the case then the causes which occur to
me would be:

1. The web browser doesn't have a Japanese font (I don't know if I have one
or not so I can't check that)

2. Japanese characters are longer than 3 bytes long in utf8 (again I don't
know if that's the case or not).  If they are then according the following
link mysql only supports up to 3 byte long utf8 characters.

http://dev.mysql.com/doc/mysql/en/Charset-Unicode.html


> I am inserting data into mySQL via this script:
> http://www.feedsfarm.com/tmp.phps
> 
> I've set default-character-set=utf8
> 
> When I output the data to the browser (see:
> http://www.feedsfarm.com/tmp.php) it displays perfectly in the UTF-8 char
> set.
> 
> And I am certain libxml converts ANY encoding to UTF-8 when traversing and
> XML file thru DOM.
> 
> Cheers,
> - Martin
> 
> 
> -----Original Message-----
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] 
> Sent: 15 January 2005 20:47
> To: mysql@lists.mysql.com
> Subject: Re: MySQL 4.1.8 and storing east characters
> 
> How are you inserting the data into mysql?  LOAD DATA INFILE?  Be sure the
> client you are using to import the data is using the utf8 character set:
> 
> SET CHARACTER SET utf8 or --default-character-set=utf8
> 
> The best way to check whether the data was inserted into mysql correctly
is
> to use the mysql command line client and select the hex codes for the
> strings you inserted.  This will eliminate apache/php/web browser/terminal
> issues until you know you can insert the data properly into the database.

> 
> SELECT HEX(CONVERT(your_column USING ucs2)) FROM your_table;
> 
> I find its easier to check the codes using the ucs2 character set--That's
> the point of the convert() function.  Again make sure the client you are
> using to read the data is using the utf8 character set.
> 
> Have you verified that libxml is converting the data correctly?
> 
> regards,
> Jeremy March
> 
> -- 
> MySQL General Mailing List
> For list archives: http://lists.mysql.com/mysql
> To unsubscribe:
> http://lists.mysql.com/[EMAIL PROTECTED]
> 
> 
> 

-- 
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:
http://lists.mysql.com/[EMAIL PROTECTED]




-- 
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Reply via email to