Sergiu Dumitriu wrote:
> Paul Libbrecht wrote:
>>
>> Le 14-mars-09 à 19:53, Vincent Massol a écrit :
>>
>>> We'd need to put huge disclaimers everywhere since this would cause
>>> problems with users (the majority?) who have MySQL or other dbs set in
>>> ISO8859-1, or their containers set a ISO8859-1 as the default, right?
>>
>> Actually no.
>> My experience is that there's no "encoding mismatch" even with a mysql 
>> which is set to iso-8859-1 while xwiki and jdbc is set to utf-8. The 
>> only mismatch we endured until we moved to a mysql that's parametrized 
>> for utf-8 is the storage inability for high-bytes characters (my test 
>> cases was lambda λ and leq  ≤) which simply got garbled after the 
>> persist and read cycle happened (may take one hour sometimes on xwiki).
> 
> Yes, the problem is that mysql silently transforms characters it can't 
> store using the current charset into '?'. And since the document is 
> still in the cache, this won't get noticed for a while.
> 
>>> Is mysql in UTF8 by default?
>>
>> It was in my MacOSX MySQL which I downloaded (from mysql.org) about a 
>> year ago.
>> Same for my colleague on Windows.
>> Same for our sysadmin with a Linux MySQL downloaded from there.
>> But it was not the case with the mysql coming from a 3-years old SuSE 
>> installation.
> 
> mysql set to utf8 on Gentoo.
> 
>>> Same question for tomcat or other JEE   containers.
>>
>> Many many many times the platform encoding is used rather than the 
>> containers' encoding. So that's MacRoman on mac! (enjoy!)
>> The re-encoding happens transparently though, there's no commonality 
>> between that back-end and front-end.
> 
> The only problem I've seen in practice is the PDF export, which uses new 
> String(bytes) and String.getBytes() without specifying the xwiki.cfg 
> encoding. I've opened http://jira.xwiki.org/jira/browse/XWIKI-3361 which 
> isn't hard to fix.

And now it's fixed, so there's no other problem for switching to utf8, 
except latin1 databases.

>> One thing that has bitten me more often than not is that Apache Httpd 
>> defaults to iso-8859-1 set explicitly on files with suffix html (so 
>> the mime-type is text/html;charset=iso-8859-1 instead of leaving to 
>> the meta head element the luxury of specifying that). I believe it 
>> should not affect normal XWiki though.
>>
>> paul
>>
>> PS: I think that a sane admin that does an update will diff xwiki.cfg 
>> and will see the difference!
> 
> Absolutely. Unfortunately sane admins are almost as rare as flying pigs. 
> Almost.


-- 
Sergiu Dumitriu
http://purl.org/net/sergiu/
_______________________________________________
devs mailing list
devs@xwiki.org
http://lists.xwiki.org/mailman/listinfo/devs

Reply via email to