ID: 48147 Updated by: lbarn...@php.net Reported By: kulakov74 at yandex dot ru -Status: Bogus +Status: Verified Bug Type: ICONV related Operating System: Linux PHP Version: 5.2.6
Previous Comments: ------------------------------------------------------------------------ [2009-05-06 18:18:07] kulakov74 at yandex dot ru No. The fact the script displays the notice "iconv(): Detected an illegal character ..." in both cases is not related to the fact whether the option is implemented: this is controlled by error_reporting(E_ALL). The option IGNORE only controls whether iconv will stop at the character or not. Also, the length of the resulting string is different (greater) with IGNORE, and while without it the string ends at exactly where the illegal character is, with IGNORE it ends at a random point where no such characters occur. Also, I did not mention - this is not the only file I converted, many others were converted correctly with the option, and their length only decreased a little. But there were 2 files which were truncated, 1 of them (the smaller) is used for the test case. Can you run the test with the latest PHP releases? Actually this is why I reported the bug. I tried it on other servers with PHP 4.3.3, 5.1.4, 5.1.6, 5.2.4 and 5.2.6 and yep! - I finally found one with 5.2.9 (built Feb 27 2009) and it displayed the same results everywhere. I repeat, the TRANSLIT option works fine, while it does the same and even more. ------------------------------------------------------------------------ [2009-05-06 14:38:39] j...@php.net It just means you're using glibc iconv implementation which does not have the IGNORE parameter implemented. ------------------------------------------------------------------------ [2009-05-06 05:13:10] kulakov74 at yandex dot ru Here goes the script. I'm not sure about the limit on external resources - I have the file to convert, so it is downloaded. <?php error_reporting(E_ALL); $Body1=file_get_contents("http://www.oppcharts.com/iconv.html"); echo(strlen($Body1)."\n"); $Body2=iconv('UTF-8', 'ISO-8859-1', $Body1); echo(strlen($Body2)."\n"); $Body2=iconv('UTF-8', 'ISO-8859-1//IGNORE', $Body1); echo(strlen($Body2)."\n"); ?> ------------------------------------------------------------------------ [2009-05-04 14:52:10] kulakov74 at yandex dot ru Description: ------------ iconv() without //IGNORE as known cuts the string at the first illegal character, but with //IGNORE it should not. Still, I get a truncated text, but not at the point where the character is. Sorry the actual PHP version is 5.2.6, but I cannot upgrade it. Just to let you know. Can you test that with the last version? Please download the file from http://www.oppcharts.com/iconv.html Reproduce code: --------------- $Body1=... //read the file echo(strlen($Body1)."\n"); $Body2=iconv('UTF-8', 'ISO-8859-1', $Body1); echo(strlen($Body2)."\n"); $Body2=iconv('UTF-8', 'ISO-8859-1//IGNORE', $Body1); echo(strlen($Body2)."\n"); Expected result: ---------------- 15323 Notice: iconv(): Detected an illegal character in input string in /home/doldon/html/tdnam/dev.php on line 18 3588 ----------------------------------- 15323 15321 - I can get this if I use //TRANSLIT or when I run the test on my home Windows PHP 4 Actual result: -------------- 15323 Notice: iconv(): Detected an illegal character in input string in /home/doldon/html/tdnam/dev.php on line 18 3588 ----------------------------------- 15323 Notice: iconv(): Detected an illegal character in input string in /home/doldon/html/tdnam/dev.php on line 18 8157 - THIS IS THE PROBLEM ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/?id=48147&edit=1