ID:               47076
 User updated by:  lunter at interia dot pl
 Reported By:      lunter at interia dot pl
 Status:           Open
 Bug Type:         Unicode Engine related
 Operating System: all
 PHP Version:      6CVS-2009-01-12 (CVS)
 New Comment:

There is no way to calculate base64, sha1 from unicode string
(unicode.script_encoding = UTF-8 [or UTF-16LE,UTF-16BE]) starting from
$unicode=chr(946)

because we don't have method to convert mutli-byte character sets to
it's binary representation 

$unicode=chr(946)

When stript encoding UTF-8, chr(946)
base64($unicode) is zrI=
sha1($unicode) is 25b9b2c8a851851c7e0f1cff29a93a6aa6895f34

When stript encoding UTF-16LE, chr(946)
base64($unicode) is sgM=
sha1($unicode) is e84c936ce61a692fcc5a402b3b9b733592ba0b67

When stript encoding UTF-16BE, chr(946)
base64($unicode) is A7I=
sha1($unicode) is 2403f70ce33aeec4e21a519ffebb2864afc89fda


Previous Comments:
------------------------------------------------------------------------

[2009-01-12 13:39:02] lunter at interia dot pl

USE OLD PHP 5.x

// ---

valids values of UTF-16LE char(946) base64 / sha1

<?
 print('UTF-16LE char(946):<br>');
 print('base64: '.base64_encode(chr(178).chr(3)).'<br>');
 print('sha1: '.sha1(chr(178).chr(3)).'<br>');
?>

// ---

valids values of UTF-16BE char(946) base64 / sha1

<?
 print('UTF-16BE char(946):<br>');
 print('base64: '.base64_encode(chr(3).chr(178)).'<br>');
 print('sha1: '.sha1(chr(3).chr(178)).'<br>');
?>

------------------------------------------------------------------------

[2009-01-12 13:25:01] lunter at interia dot pl

USE OLD PHP 5.x

valids values of UTF-8 char(946) base64 / sha1

<?
 print('UTF-8 char(946):<br>');
 print('base64: '.base64_encode(chr(206).chr(178)).'<br>');
 print('sha1: '.sha1(chr(206).chr(178)).'<br>');
?>

------------------------------------------------------------------------

[2009-01-12 12:54:41] lunter at interia dot pl

All examples above in utf-8
Imagine that using utf-16, sha1 and base64 will be not the same.

------------------------------------------------------------------------

[2009-01-12 12:50:23] lunter at interia dot pl

Please imagine that unicode chr(946) in binary have two bytes [11001110
10110010].

------------------------------------------------------------------------

[2009-01-12 12:45:36] lunter at interia dot pl

Example 4:

<?
 print('You have to calculate sha1 of unicode chr(946)<br>');
 print('regular sha1 of unicode chr(946) (\uceb2) is:
25b9b2c8a851851c7e0f1cff29a93a6aa6895f34'.'<br><br>');

 $unicode=chr(946);

 print(sha1((binary)$unicode));
 print('<br>');
 print(sha1(unicode_encode($unicode,'iso-8859-1')));
 print('<br>');

// print(sha1(uni2bin($unicode))); //
25b9b2c8a851851c7e0f1cff29a93a6aa6895f34
?>

------------------------------------------------------------------------

The remainder of the comments for this report are too long. To view
the rest of the comments, please view the bug report online at
    http://bugs.php.net/47076

-- 
Edit this bug report at http://bugs.php.net/?id=47076&edit=1

Reply via email to