Edit report at http://bugs.php.net/bug.php?id=42290&edit=1
ID: 42290
Comment by: bubalula at gmail dot com
Reported by: arysin at gmail dot com
Summary: mb_eregi_replace() is not case-insensitive with
multibyte pattern
Status: No Feedback
Type: Bug
Package: mbstring related
Operating System: *
PHP Version: 5.2CVS-2007-08-14
Assigned To: hirokawa
Block user comment: N
New Comment:
I have the same problem in version 5.2.12.
I don't know why this bug isn't taken seriously as it creates big
problems for us working with non latin languages.
Previous Comments:
------------------------------------------------------------------------
[2009-09-30 13:12:10] babson at gmail dot com
I am using PHP version 5.2.9 and have the same problem.
I tried sample by arysin and got the same result as he did.
What can be done?
------------------------------------------------------------------------
[2009-04-15 16:04:55] rvorojbit at gmail dot com
I am also having the exact same problem now as was described in the
previous post last year!!! Is there any workaround for this bug? I
didn't find any in google...
------------------------------------------------------------------------
[2008-05-03 07:38:12] admin at bg-history dot info
I got the same problem with UTF-8 encoding, using Cyrillic.
While trying to make "search highlight" neither "eregi_replace", nor
"str-ireplace" functions actually "got" the capital letter...
for example:
$str="общи";
$newstr="Общи";
$bodytext = str_ireplace($str, "<span style=\"color:
#FF0000\">".$str."</span>", $bodytext);
$bodytext2 = str_ireplace($newstr, "<span style=\"color:
#FF0000\">".$newstr."</span>", $bodytext);
in $bodytext there is a word "Общи". Although I
used case insensitive replace, only in $bodytext2 the word is
highlighted.
I've searched a lot for an issue, that solves that problem, and found
none.
P.S. Sorry for my English, hope it's understandable.
------------------------------------------------------------------------
[2007-09-12 01:00:00] php-bugs at lists dot php dot net
No feedback was provided for this bug for over a week, so it is
being suspended automatically. If you are able to provide the
information that was originally requested, please do so and change
the status of the bug back to "Open".
------------------------------------------------------------------------
[2007-08-21 15:46:44] [email protected]
arysin,
What kind of encoding you are using ?
For UTF-8 and ISO-8859-1, 0x8a is assigned to Line Tab.
c.f.: http://en.wikipedia.org/wiki/ISO_8859-1
http://en.wikipedia.org/wiki/UTF-8
In my understanding, 0x8a shouldn't be interpreted as
upper letter of 0x9a for ISO-8859-1/UTF-8.
If you are using CP1252 (Windows-1252), it is understandable,
but, CP1252 is not supported yet in the Oniguruma library
(multibyte regex engine of mbstring).
http://en.wikipedia.org/wiki/Windows-1252
------------------------------------------------------------------------
The remainder of the comments for this report are too long. To view
the rest of the comments, please view the bug report online at
http://bugs.php.net/bug.php?id=42290
--
Edit this bug report at http://bugs.php.net/bug.php?id=42290&edit=1