ID:               33793
 Updated by:       [EMAIL PROTECTED]
 Reported By:      lars dot jensen at careercross dot com
-Status:           Open
+Status:           Assigned
 Bug Type:         mbstring related
 Operating System: FreeBSD 5.3
 PHP Version:      5.0.3
-Assigned To:      
+Assigned To:      moriyoshi
 New Comment:

Assigned to the maintainer.


Previous Comments:
------------------------------------------------------------------------

[2005-07-21 03:19:47] lars dot jensen at careercross dot com

Description:
------------
Writing a class to handle conversion of UTF-8 input into SJIS using
usual 
$body = mb_convert_encoding($body, "SJIS", "UTF-8");
function, usually works, but so far by testing, I identified three
kanji's which makes this function fail to convert these correctly and
thus causing mojibake

The three UTF-8 characters is identified as follows 
chr(227).chr(136).chr(177)
chr(227).chr(136).chr(178)
chr(227).chr(136).chr(185)

which in SJIS corresponds to
chr(135).chr(138)
chr(135).chr(139)
chr(135).chr(140)

Reproduce code:
---------------
I created a "quick'n'dirty" solution as follows, which surely isnt
optimal

$body = str_replace(chr(227).chr(136).chr(177), '#mojihack1#', $body);
$body = str_replace(chr(227).chr(136).chr(178), '#mojihack2#', $body);
$body = str_replace(chr(227).chr(136).chr(185), '#mojihack3#', $body);

$body = mb_convert_encoding($body, "SJIS", "UTF-8");

$body = str_replace('#mojihack1#', chr(135).chr(138), $body);
$body = str_replace('#mojihack2#', chr(135).chr(139), $body);
$body = str_replace('#mojihack3#', chr(135).chr(140), $body);




------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=33793&edit=1

Reply via email to