ID:               47033
 User updated by:  lunter at interia dot pl
 Reported By:      lunter at interia dot pl
 Status:           Bogus
 Bug Type:         Unicode Engine related
 Operating System: all
 PHP Version:      6CVS-2009-01-08 (snap)
 New Comment:

Johannes, false.

Try this:

<?
 $s=chr(946);
 print(strlen($s));

 print('<br>');

 $b=unicode_encode($s,'iso-8859-1');

 print(strlen($b));
?>

unicode chr(946) in binary encoding has len = 2 bytes, not 1


Previous Comments:
------------------------------------------------------------------------

[2009-01-11 22:13:45] johan...@php.net

We have unicode_encode() / unicode_decode().

------------------------------------------------------------------------

[2009-01-08 07:42:45] lunter at interia dot pl

Description:
------------
converting binary<->string without charset translating


Two functions for converting binary<->string without translating
charset needed.
It is very usefull because binary data can consists utf-8 substring,
you need to convert to string type.
Also when you have to see binary representation of utf-8 string, or
operating on it as binaries.


Example 1:

You have (binary)$b. It consists two bytes: 11001110 10110010
Its length in binary representation is two.
It is also valid one-length UTF-8 char(946) (greek small letter beta)
How to conver it ($b) into one-char UTF-8 string??
When we try $u=(string)$b, it gives two-char UTF-8 string.


Example 2:

You have (string)$u UTF-8 one-char string. It consists chr(946) (greek
small letter beta)
Now You have to see two bytes binary representation of this (11001110
10110010).
There is no way to convert it without charset translation...




Reproduce code:
---------------
;;;;;;;;;;;;;;;;;;;;
; Unicode settings ;
;;;;;;;;;;;;;;;;;;;;

unicode.semantics = off
unicode.runtime_encoding = iso-8859-1
unicode.script_encoding = utf-8
unicode.output_encoding = utf-8
unicode.from_error_mode = U_INVALID_SUBSTITUTE
unicode.from_error_subst_char = 3f




Expected result:
----------------
way to converting binary<->string without charset translating



Actual result:
--------------
no way to converting binary<->string without charset translating





------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=47033&edit=1

Reply via email to