Edit report at http://bugs.php.net/bug.php?id=42290&edit=1

 ID:                 42290
 Comment by:         bubalula at gmail dot com
 Reported by:        arysin at gmail dot com
 Summary:            mb_eregi_replace() is not case-insensitive with
                     multibyte pattern
 Status:             No Feedback
 Type:               Bug
 Package:            mbstring related
 Operating System:   *
 PHP Version:        5.2CVS-2007-08-14
 Assigned To:        hirokawa
 Block user comment: N

 New Comment:

I have the same problem in version 5.2.12.

I don't know why this bug isn't taken seriously as it creates big
problems for us working with non latin languages.


Previous Comments:
------------------------------------------------------------------------
[2009-09-30 13:12:10] babson at gmail dot com

I am using PHP version 5.2.9 and have the same problem.

I tried sample by arysin and got the same result as he did.



What can be done?

------------------------------------------------------------------------
[2009-04-15 16:04:55] rvorojbit at gmail dot com

I am also having the exact same problem now as was described in the
previous post last year!!! Is there any workaround for this bug? I
didn't find any in google...

------------------------------------------------------------------------
[2008-05-03 07:38:12] admin at bg-history dot info

I got the same problem with UTF-8 encoding, using Cyrillic.



While trying to make "search highlight" neither "eregi_replace", nor
"str-ireplace" functions actually "got" the capital letter...



for example:



$str="общи";



$newstr="Общи";



$bodytext = str_ireplace($str, "<span style=\"color:
#FF0000\">".$str."</span>", $bodytext);



$bodytext2 = str_ireplace($newstr, "<span style=\"color:
#FF0000\">".$newstr."</span>", $bodytext);



in $bodytext there is a word "&#1054;&#1073;&#1097;&#1080;". Although I
used case insensitive replace, only in $bodytext2 the word is
highlighted.



I've searched a lot for an issue, that solves that problem, and found
none. 



P.S. Sorry for my English, hope it's understandable.

------------------------------------------------------------------------
[2007-09-12 01:00:00] php-bugs at lists dot php dot net

No feedback was provided for this bug for over a week, so it is
being suspended automatically. If you are able to provide the
information that was originally requested, please do so and change
the status of the bug back to "Open".

------------------------------------------------------------------------
[2007-08-21 15:46:44] hirok...@php.net

arysin,



What kind of encoding you are using ?



For UTF-8 and ISO-8859-1, 0x8a is assigned to Line Tab.



  c.f.: http://en.wikipedia.org/wiki/ISO_8859-1

       http://en.wikipedia.org/wiki/UTF-8



In my understanding, 0x8a shouldn't be interpreted as

upper letter of 0x9a for ISO-8859-1/UTF-8.



If you are using CP1252 (Windows-1252), it is understandable,

but, CP1252 is not supported yet in the Oniguruma library

(multibyte regex engine of mbstring).

http://en.wikipedia.org/wiki/Windows-1252





------------------------------------------------------------------------


The remainder of the comments for this report are too long. To view
the rest of the comments, please view the bug report online at

    http://bugs.php.net/bug.php?id=42290


-- 
Edit this bug report at http://bugs.php.net/bug.php?id=42290&edit=1

Reply via email to