ID: 16069
Updated by: [EMAIL PROTECTED]
Reported By: [EMAIL PROTECTED]
Status: Analyzed
Bug Type: ICONV related
Operating System: win32, Linux
PHP Version: 4.1.2
New Comment:
Yes, I know I am a exception :) and most PHP user don't have to care
about this problem. Besides it seems that glibc ICONV can not handle
CP932 characters properly.
Previous Comments:
------------------------------------------------------------------------
[2002-03-14 20:56:46] [EMAIL PROTECTED]
I've fixed it whole ago for systems supports iconv in libc.
(Recent Linux/glibc is one of them)
For systems uses libiconv, there is problem still.
(I didn't fix problem with libiconv, since I don't use libiconv ;)
------------------------------------------------------------------------
[2002-03-14 09:40:57] [EMAIL PROTECTED]
conversion between CP932(a variant of Shift_JIS charset) and any
Japanese charset other than CP932 unexpectantly failed when
transliteration mode is specified like "EUC-JP//TRANSLIT" on the output
encoding and the transliteration requires some larger buffer than
strlen(input_buf) * sizeof(ucs4_t).
testing script:
<?php
for( $i = 0; $i < 20; ++$i ) {
print $i.":".iconv( "EUC-JP", "Shift_JIS", iconv( "CP932",
"EUC-JP//TRANSLIT", "abcd".str_repeat( "****", $i ) ) )."<BR>";
}
for( $i = 0; $i < 20; ++$i ) {
print $i.":".iconv( "EUC-JP", "Shift_JIS", iconv( "CP932",
"EUC-JP//TRANSLIT", "abcd".str_repeat( "++++", $i ) ) )."<BR>";
}
?>
where "****" is ONE character described as "SQUARE MIRIBAARU" (0x876D)
and "++++" is ONE character described as "SQUARE AARU" (0x8765) on
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP932.TXT
------------------------------------------------------------------------
--
Edit this bug report at http://bugs.php.net/?id=16069&edit=1