I just noticed that in the link I sent you Japanese utf8 characters are indeed only 3 bytes long. So much for that idea.
> I am inserting data into mySQL via this script: > http://www.feedsfarm.com/tmp.phps > > I've set default-character-set=utf8 > > When I output the data to the browser (see: > http://www.feedsfarm.com/tmp.php) it displays perfectly in the UTF-8 char > set. > > And I am certain libxml converts ANY encoding to UTF-8 when traversing and > XML file thru DOM. > > Cheers, > - Martin > > > -----Original Message----- > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > Sent: 15 January 2005 20:47 > To: mysql@lists.mysql.com > Subject: Re: MySQL 4.1.8 and storing east characters > > How are you inserting the data into mysql? LOAD DATA INFILE? Be sure the > client you are using to import the data is using the utf8 character set: > > SET CHARACTER SET utf8 or --default-character-set=utf8 > > The best way to check whether the data was inserted into mysql correctly is > to use the mysql command line client and select the hex codes for the > strings you inserted. This will eliminate apache/php/web browser/terminal > issues until you know you can insert the data properly into the database. > > SELECT HEX(CONVERT(your_column USING ucs2)) FROM your_table; > > I find its easier to check the codes using the ucs2 character set--That's > the point of the convert() function. Again make sure the client you are > using to read the data is using the utf8 character set. > > Have you verified that libxml is converting the data correctly? > > regards, > Jeremy March > > -- > MySQL General Mailing List > For list archives: http://lists.mysql.com/mysql > To unsubscribe: > http://lists.mysql.com/[EMAIL PROTECTED] > > > -- MySQL General Mailing List For list archives: http://lists.mysql.com/mysql To unsubscribe: http://lists.mysql.com/[EMAIL PROTECTED]