ID: 47221 Updated by: sjo...@php.net Reported By: sgnutzmann at yahoo dot de -Status: Open +Status: Verified Bug Type: Arrays related Operating System: win32 only - Windows XP PHP Version: 5.2.8 New Comment:
Could reproduce. This code shows the time taken to array_diff two arrays for the builtin array_diff and for a PHP function fast_array_diff I wrote. <?php $a = $b = array(); for ($i = 0; $i < 10000; $i++) { $a[] = "s" . ($i * 102121 % 433061); $b[] = "s" . ($i * 102121 % 433003); } $start = microtime(true); $res1 = array_diff($a, $b); echo "Built-in array_diff duration: ".(microtime(true) - $start)."\n"; include('http://www.gissen.nl/files/fast_array_diff.php'); $start = microtime(true); $res2 = fast_array_diff($a, $b); echo "Fast_array_diff duration: ".(microtime(true) - $start)."\n"; sort($res1); sort($res2); assert($res1 == $res2); ?> Output: Built-in array_diff duration: 11.8710849285 Fast_array_diff duration: 0.254959106445 Previous Comments: ------------------------------------------------------------------------ [2009-02-08 13:03:18] j...@php.net Please don't send me more spam. (never saw that and any direct mails to me will be deleted anyway) ------------------------------------------------------------------------ [2009-01-27 13:55:35] sgnutzmann at yahoo dot de I just try the latest version 'php-5.2-win32-VC6-x86-latest.msi' (2009-Jan-27 12:00:00). This version has the same problem as PHP 5.2.8 (no return in 5 minutes from array_diff(), infinite loop?). I sent my test dataset 'TestData.txt' to j...@php.net. ------------------------------------------------------------------------ [2009-01-27 12:09:33] sgnutzmann at yahoo dot de PHP 5.2.6 has the same problem as PHP 5.2.8 ------------------------------------------------------------------------ [2009-01-27 10:32:48] sgnutzmann at yahoo dot de Complete test script (size of generated test file 5,865 KB) <?php $handle = fopen('TestData.txt','rb'); // size of first array $buffer = fgets($handle, 256); $buffer = str_replace("\r",'',$buffer); $buffer = str_replace("\n",'',$buffer); $count = (int) $buffer; echo 'Size of first array: '.$count."\r\n"; // elements of first array $idSales = array(); for ( $i = 0; $i < $count; $i++ ) { $buffer = fgets($handle, 256); $buffer = str_replace("\r",'',$buffer); $buffer = str_replace("\n",'',$buffer); $idSales[] = $buffer; } // for ( $i = 0; $i < $count; $i++ ) // size of second array $buffer = fgets($handle, 256); $buffer = str_replace("\r",'',$buffer); $buffer = str_replace("\n",'',$buffer); $count = (int) $buffer; echo 'Size of second array: '.$count."\r\n"; // elements of second array $idInv = array(); for ( $i = 0; $i < $count; $i++ ) { $buffer = fgets($handle, 256); $buffer = str_replace("\r",'',$buffer); $buffer = str_replace("\n",'',$buffer); $idInv[] = $buffer; } // for ( $i = 0; $i < $count; $i++ ) fclose($handle); echo "Start of array_diff\r\n"; $unknown = array_diff ( $idSales, $idInv ); echo 'Number of unknown identifier '.count($unknown)."\r\n"; ?> First lines of test file: 76906 #00/1109 #00/1162 #00/1163 #00/1335 #00/1337 Result, if I use PHP 5.2.4: Size of first array: 76906 Size of second array: 433959 Start of array_diff Number of unknown identifier 17826 No result from array_diff, if I use PHP 5.2.8 (without any extension) ------------------------------------------------------------------------ [2009-01-26 17:58:51] sgnutzmann at yahoo dot de Description: ------------ I use the function array_diff() to compare two sorted string-arrays with numerical keys (array sizes are 76,906 and 433,959, string sizes in all array elements less than 20 characters). With PHP 5.2.4 the function returns very fast (just few seconds), with PHP 5.2.8 I kill PHP.exe after 30 minutes(!) without result. PHP.INI: memory_limit = 1536M extension=php_pdo.dll extension=php_zip.dll extension=php_pdo_odbc.dll Reproduce code: --------------- // $Sales and $Inv read previously from file system $idSales = array(); foreach ( $Sales as $i => $data ) $idSales[$i] = '#'.$data[2]; array_multisort ($idSales, $Sales); $idInv = array(); foreach ( $Inv as $i => $data ) $idInv[$i] = '#'.$data[1]; array_multisort ($idInv, $Inv); echo "Start array_diff\n"; $unknown = array_diff ( $idSales, $idInv ); echo "End array_diff\n"; Expected result: ---------------- see description Actual result: -------------- no result in 30 minutes ------------------------------------------------------------------------ -- Edit this bug report at http://bugs.php.net/?id=47221&edit=1