From: arturm at union dot com dot pl Operating system: Windows PHP version: 5.1.6 PHP Bug Type: DOM XML related Bug description: Wrong charset used in loadHTML()
Description: ------------ If you load HTML using DOM::loadHTML() wrong charset is used when non US-ASCII characters are used in source before charset declaration in meta tag. Reproduce code: --------------- <?php header("Content-type: text/plain; charset=UTF-8"); $doc = new DOMDocument(); $doc->loadHTML('<title>ą</title>' .'<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">' .'<p>ąęółść</p>'); echo $doc->encoding; echo $doc->textContent; ?> Expected result: ---------------- UTF-8ąęółść Actual result: -------------- UTF-8ąąęółść -- Edit bug report at http://bugs.php.net/?id=39269&edit=1 -- Try a CVS snapshot (PHP 4.4): http://bugs.php.net/fix.php?id=39269&r=trysnapshot44 Try a CVS snapshot (PHP 5.2): http://bugs.php.net/fix.php?id=39269&r=trysnapshot52 Try a CVS snapshot (PHP 6.0): http://bugs.php.net/fix.php?id=39269&r=trysnapshot60 Fixed in CVS: http://bugs.php.net/fix.php?id=39269&r=fixedcvs Fixed in release: http://bugs.php.net/fix.php?id=39269&r=alreadyfixed Need backtrace: http://bugs.php.net/fix.php?id=39269&r=needtrace Need Reproduce Script: http://bugs.php.net/fix.php?id=39269&r=needscript Try newer version: http://bugs.php.net/fix.php?id=39269&r=oldversion Not developer issue: http://bugs.php.net/fix.php?id=39269&r=support Expected behavior: http://bugs.php.net/fix.php?id=39269&r=notwrong Not enough info: http://bugs.php.net/fix.php?id=39269&r=notenoughinfo Submitted twice: http://bugs.php.net/fix.php?id=39269&r=submittedtwice register_globals: http://bugs.php.net/fix.php?id=39269&r=globals PHP 3 support discontinued: http://bugs.php.net/fix.php?id=39269&r=php3 Daylight Savings: http://bugs.php.net/fix.php?id=39269&r=dst IIS Stability: http://bugs.php.net/fix.php?id=39269&r=isapi Install GNU Sed: http://bugs.php.net/fix.php?id=39269&r=gnused Floating point limitations: http://bugs.php.net/fix.php?id=39269&r=float No Zend Extensions: http://bugs.php.net/fix.php?id=39269&r=nozend MySQL Configuration Error: http://bugs.php.net/fix.php?id=39269&r=mysqlcfg