ID: 22108 Comment by: kamor at worldonline dot fr Reported By: bugzilla at jellycan dot com Status: Assigned Bug Type: Feature/Change Request Operating System: Any PHP Version: All (as of the current implementation) Assigned To: moriyoshi New Comment:
Lazy solution for doctype detection under IE: put the doctype declaration on the first line of the main script (before all include() ) And let's hope that the following buggy chars will not interfer with my beautiful pages ;) Previous Comments: ------------------------------------------------------------------------ [2003-07-29 01:15:23] yyasarr at hotmail dot com php really doesn't ignore the utf-8 BOM This is A BUG ! but can be solved easily SOLVE METHOD: 1. Open any simple text editor. Such as NC (Norton Commander). 2. F4 for editing. 3. Delete first 3 (three) or 2 (two) bytes, characters [depends on file type ] file types: ----------- UTF-8: EF BB BF // 3(three) bytes Unicode big endian : FE FF // 2 (two) bytes Unicode: FF FE // 2 (two) bytes 4. Check the result. i am using this method. if u know better one, write it. __________________ Yashar Alekberzade ------------------------------------------------------------------------ [2003-07-19 09:01:50] ipa at assis dot lt Simple Example: I have multilanguage system where my text strings ar written as huge array in one file. It has been saved in UTF-8 with simple Notepad, which allows to save this format. when i include() this file to access text strings, it outputs that three bytes, so IE have displayed visible breakline with that HTML. Result: cannot write multilanguage PHP scripts even with notepad. Have to use iconv() what i dificult, because not every hoster have compiled PHP with it. ------------------------------------------------------------------------ [2003-07-03 15:05:35] jaanus at heeringson dot com One thing i forgot to mention in my previous post is that multiple BOM's also inhibit Internet Explorer 6 from rendering pages in standards-compliance mode, which is a big issue when developing by w3c standards. ------------------------------------------------------------------------ [2003-07-03 14:59:44] jaanus at heeringson dot com There are some consequenses that are not mentioned in the above comments. More than one BOM renders a perfectly valid XHTM file non-valid. Apparently the XHTML specification allows BOM characters, but only ONE, not multiple which is what you get if you include other utf-8 files (include, require). This in turn results in the possibility that the xml declaration in not read since it is not found. This is the case with the w3c validator. ------------------------------------------------------------------------ [2003-06-04 03:11:51] [EMAIL PROTECTED] That script appears to be written in UTF-16. As for UTF-16, it could actually be a parser problem as well, but this report addresses the issue related to UTF-8. ------------------------------------------------------------------------ The remainder of the comments for this report are too long. To view the rest of the comments, please view the bug report online at http://bugs.php.net/22108 -- Edit this bug report at http://bugs.php.net/?id=22108&edit=1