ID: 42290 Comment by: babson at gmail dot com Reported By: arysin at gmail dot com Status: No Feedback Bug Type: mbstring related Operating System: * PHP Version: 5.2CVS-2007-08-14 Assigned To: hirokawa New Comment:
I am using PHP version 5.2.9 and have the same problem. I tried sample by arysin and got the same result as he did. What can be done? Previous Comments: ------------------------------------------------------------------------ [2009-04-15 16:04:55] rvorojbit at gmail dot com I am also having the exact same problem now as was described in the previous post last year!!! Is there any workaround for this bug? I didn't find any in google... ------------------------------------------------------------------------ [2008-05-03 07:38:12] admin at bg-history dot info I got the same problem with UTF-8 encoding, using Cyrillic. While trying to make "search highlight" neither "eregi_replace", nor "str-ireplace" functions actually "got" the capital letter... for example: $str="общи"; $newstr="Общи"; $bodytext = str_ireplace($str, "<span style=\"color: #FF0000\">".$str."</span>", $bodytext); $bodytext2 = str_ireplace($newstr, "<span style=\"color: #FF0000\">".$newstr."</span>", $bodytext); in $bodytext there is a word "Общи". Although I used case insensitive replace, only in $bodytext2 the word is highlighted. I've searched a lot for an issue, that solves that problem, and found none. P.S. Sorry for my English, hope it's understandable. ------------------------------------------------------------------------ [2007-09-12 01:00:00] php-bugs at lists dot php dot net No feedback was provided for this bug for over a week, so it is being suspended automatically. If you are able to provide the information that was originally requested, please do so and change the status of the bug back to "Open". ------------------------------------------------------------------------ [2007-08-21 15:46:44] hirok...@php.net arysin, What kind of encoding you are using ? For UTF-8 and ISO-8859-1, 0x8a is assigned to Line Tab. c.f.: http://en.wikipedia.org/wiki/ISO_8859-1 http://en.wikipedia.org/wiki/UTF-8 In my understanding, 0x8a shouldn't be interpreted as upper letter of 0x9a for ISO-8859-1/UTF-8. If you are using CP1252 (Windows-1252), it is understandable, but, CP1252 is not supported yet in the Oniguruma library (multibyte regex engine of mbstring). http://en.wikipedia.org/wiki/Windows-1252 ------------------------------------------------------------------------ [2007-08-19 20:05:04] j...@php.net I'm using the bundled PCRE library. I don't remember what the version is. ------------------------------------------------------------------------ The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at http://bugs.php.net/42290 -- Edit this bug report at http://bugs.php.net/?id=42290&edit=1