From: lunter at interia dot pl Operating system: all PHP version: 6CVS-2009-01-12 (CVS) PHP Bug Type: Unicode Engine related Bug description: binary representation of unicode
Description: ------------ converting binary<->string without charset translating for view binary representation of unicode or generate unicode from valid binary consists unicode sequenses note that: unicode_encode/unicode_decode using charset translating, see Reproduce code Example 1: You have (binary)$b. It consists two bytes: 11001110 10110010 Its length in binary representation is two. It is also valid one-length UTF-8 char(946) (greek small letter beta) How to conver it ($b) into one-char UTF-8 string?? When we try $u=(string)$b, it gives two-char UTF-8 string. Example 2: You have (string)$u UTF-8 one-char string. It consists chr(946) (greek small letter beta) Now You have to see two bytes binary representation of this (11001110 10110010). There is no way to convert it without charset translation... Reproduce code: --------------- <? $s=chr(946); print(strlen($s)); print('<br>'); $b=unicode_encode($s,'iso-8859-1'); print(strlen($b)); ?> Expected result: ---------------- 1 (unicode 1 char) 2 (binary 2 bytes) [11001110 10110010] Actual result: -------------- 1 1 no way to converting binary<->string without charset translating in binary we have length = 1 but it is 2 bytes -- Edit bug report at http://bugs.php.net/?id=47076&edit=1 -- Try a CVS snapshot (PHP 5.2): http://bugs.php.net/fix.php?id=47076&r=trysnapshot52 Try a CVS snapshot (PHP 5.3): http://bugs.php.net/fix.php?id=47076&r=trysnapshot53 Try a CVS snapshot (PHP 6.0): http://bugs.php.net/fix.php?id=47076&r=trysnapshot60 Fixed in CVS: http://bugs.php.net/fix.php?id=47076&r=fixedcvs Fixed in CVS and need be documented: http://bugs.php.net/fix.php?id=47076&r=needdocs Fixed in release: http://bugs.php.net/fix.php?id=47076&r=alreadyfixed Need backtrace: http://bugs.php.net/fix.php?id=47076&r=needtrace Need Reproduce Script: http://bugs.php.net/fix.php?id=47076&r=needscript Try newer version: http://bugs.php.net/fix.php?id=47076&r=oldversion Not developer issue: http://bugs.php.net/fix.php?id=47076&r=support Expected behavior: http://bugs.php.net/fix.php?id=47076&r=notwrong Not enough info: http://bugs.php.net/fix.php?id=47076&r=notenoughinfo Submitted twice: http://bugs.php.net/fix.php?id=47076&r=submittedtwice register_globals: http://bugs.php.net/fix.php?id=47076&r=globals PHP 4 support discontinued: http://bugs.php.net/fix.php?id=47076&r=php4 Daylight Savings: http://bugs.php.net/fix.php?id=47076&r=dst IIS Stability: http://bugs.php.net/fix.php?id=47076&r=isapi Install GNU Sed: http://bugs.php.net/fix.php?id=47076&r=gnused Floating point limitations: http://bugs.php.net/fix.php?id=47076&r=float No Zend Extensions: http://bugs.php.net/fix.php?id=47076&r=nozend MySQL Configuration Error: http://bugs.php.net/fix.php?id=47076&r=mysqlcfg