ID: 43347 User updated by: pschmandra at hotmail dot com Reported By: pschmandra at hotmail dot com Status: Open Bug Type: XML related Operating System: Red Hat Linux Enterprise 5 PHP Version: 5.1.6 New Comment:
List of Traditional Chinese RSS Feeds provided by the BBC that make xml_parse() function error out. newsrss.bbc.co.uk/rss/chinese/trad/news/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/world/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/china_news/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/taiwan_hk/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/uk/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/learn_english/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/business/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/sci/tech/rss.xml newsrss.bbc.co.uk/rss/chinese/trad/press/rss.xml Previous Comments: ------------------------------------------------------------------------ [2007-11-21 14:54:59] pschmandra at hotmail dot com Sorry, every time I try to send the link to the BBC feeds or I put in XML I get "Please do not SPAM our bug system". ------------------------------------------------------------------------ [2007-11-21 06:00:10] [EMAIL PROTECTED] Please show an example of your XML ------------------------------------------------------------------------ [2007-11-20 18:49:11] pschmandra at hotmail dot com All supported encodings listed below cause the xml_parse() function to error out parsing any Chinese Traditional RSS Feeds published by the BBC using PHP5.1.6 with a UTF-8 default_charset. xml_parser_create() xml_parser_create('') xml_parser_create('UTF-8') xml_parser_create('ISO-8859-1') xml_parser_create('US-ASCII') ------------------------------------------------------------------------ [2007-11-20 17:56:22] [EMAIL PROTECTED] Please also read http://php.net/manual/en/function.xml-parser-create.php carefully, especially: *** The optional encoding specifies the character encoding for the input/output in PHP 4. Starting from PHP 5, the input encoding is automatically detected, so that the encoding parameter specifies only the output encoding. In PHP 4, the default output encoding is the same as the input charset. If empty string is passed, the parser attempts to identify which encoding the document is encoded in by looking at the heading 3 or 4 bytes. In PHP 5.0.0 and 5.0.1, the default output charset is ISO-8859-1, while in PHP 5.0.2 and upper is UTF-8. The supported encodings are ISO-8859-1, UTF-8 and US-ASCII. *** ------------------------------------------------------------------------ [2007-11-20 16:49:39] [EMAIL PROTECTED] What was the full configure line used to configure PHP in this case? And I don't see any crash there, just a normal error for passing data with encoding which isn't supported by the xml library in use.. ------------------------------------------------------------------------ The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at http://bugs.php.net/43347 -- Edit this bug report at http://bugs.php.net/?id=43347&edit=1