Edit report at https://bugs.php.net/bug.php?id=60380&edit=1
ID: 60380 Comment by: ni...@php.net Reported by: evrinoma at gmail dot com Summary: preg_split and old version split Status: Open Type: Bug Package: *Programming Data Structures Operating System: Linux CenOS5 CentOS6 PHP Version: 5.3.8 Block user comment: N Private report: N New Comment: You are dealing with two levels of unescaping here. The first is the usual for string, the second is additionally done by PCRE. E.g. if you write "\\\\" the actually resulting string will just be "\\" (because \\ is unescaped to \). This is then passed to PCRE, which itself again does unescaping, so the \\ is turned into \. That's why you need four backslashes to produce one literal backslash in a regular expression. In your particular case three work too, but only because the following character does not form an escape sequence. Previous Comments: ------------------------------------------------------------------------ [2012-08-30 19:24:38] cjbf at cam dot ac dot uk Three \s are needed to match one \. Could that at least be documented if it's not wrong? $text='AA\\nB and B\r';// contains \n and \r print_r(preg_split('/\\\n|\\\r/', $text)); gives Array ( [0] => AA [1] => B and B [2] => ) ------------------------------------------------------------------------ [2012-08-30 19:02:43] cjbf at cam dot ac dot uk Put simply, preg_split treats "\\" wrongly. It ought to search for an actual "\" but never matches it. ------------------------------------------------------------------------ [2011-11-25 07:15:00] evrinoma at gmail dot com Description: ------------ --splite------- array[0]=0123456789-- array[1]=qwertyuiop-- array[2]=9876543210-- array[3]=poiuytrewq-- array[4]=0123456789-- array[5]=asdfghjkl-- --preg_splite-- array[0]=0123456789\qwertyuiop-- <<<<<<<<< ??? array[1]=9876543210-- array[2]=poiuytrewq-- array[3]=0123456789-- array[4]=asdfghjkl-- array[5]=-- --------------- Test script: --------------- <HTML> <HEAD> <TITLE> BUG </TITLE> </HEAD> <BODY> <? printf(" --splite------- <BR>\n"); $date="0123456789\\qwertyuiop 9876543210.poiuytrewq:0123456789 asdfghjkl"; $var = split( "[-\\. :]",$date ); printf(" array[0]=%s-- <BR>\n",$var[0]); printf(" array[1]=%s-- <BR>\n",$var[1]); printf(" array[2]=%s-- <BR>\n",$var[2]); printf(" array[3]=%s-- <BR>\n",$var[3]); printf(" array[4]=%s-- <BR>\n",$var[4]); printf(" array[5]=%s-- <BR>\n",$var[5]); printf(" --preg_splite-- <BR>\n"); $var = preg_split( "#[-\\. :]#",$date); printf(" array[0]=%s-- <BR>\n",$var[0]); printf(" array[1]=%s-- <BR>\n",$var[1]); printf(" array[2]=%s-- <BR>\n",$var[2]); printf(" array[3]=%s-- <BR>\n",$var[3]); printf(" array[4]=%s-- <BR>\n",$var[4]); printf(" array[5]=%s-- <BR>\n",$var[5]); printf(" --------------- <BR>\n"); ?> </BODY> </HTML> ------------------------------------------------------------------------ -- Edit this bug report at https://bugs.php.net/bug.php?id=60380&edit=1