ID: 33793 Updated by: [EMAIL PROTECTED] Reported By: lars dot jensen at careercross dot com -Status: Open +Status: Assigned Bug Type: mbstring related Operating System: FreeBSD 5.3 PHP Version: 5.0.3 -Assigned To: +Assigned To: moriyoshi New Comment:
Assigned to the maintainer. Previous Comments: ------------------------------------------------------------------------ [2005-07-21 03:19:47] lars dot jensen at careercross dot com Description: ------------ Writing a class to handle conversion of UTF-8 input into SJIS using usual $body = mb_convert_encoding($body, "SJIS", "UTF-8"); function, usually works, but so far by testing, I identified three kanji's which makes this function fail to convert these correctly and thus causing mojibake The three UTF-8 characters is identified as follows chr(227).chr(136).chr(177) chr(227).chr(136).chr(178) chr(227).chr(136).chr(185) which in SJIS corresponds to chr(135).chr(138) chr(135).chr(139) chr(135).chr(140) Reproduce code: --------------- I created a "quick'n'dirty" solution as follows, which surely isnt optimal $body = str_replace(chr(227).chr(136).chr(177), '#mojihack1#', $body); $body = str_replace(chr(227).chr(136).chr(178), '#mojihack2#', $body); $body = str_replace(chr(227).chr(136).chr(185), '#mojihack3#', $body); $body = mb_convert_encoding($body, "SJIS", "UTF-8"); $body = str_replace('#mojihack1#', chr(135).chr(138), $body); $body = str_replace('#mojihack2#', chr(135).chr(139), $body); $body = str_replace('#mojihack3#', chr(135).chr(140), $body); ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/?id=33793&edit=1