From:             soapergem at gmail dot com
Operating system: Windows XP
PHP version:      5.3.0
PHP Bug Type:     Filesystem function related
Bug description:  fgets reads the UTF-8 Byte Order Mark literally

Description:
------------
When text files are saved with UTF-8 encoding, a few characters are saved
at the front called the "Byte Order Mark" (read more about it on
Wikipedia). They are supposed to remain hidden and just be used as
meta-data to indicate that the file is saved with UTF-8 formatting. Their
hex values are EF BB BF, which is represented in ASCII by "".

The trouble is that when you read in a UTF-8 text file with either fgets
or fgetcsv, PHP misinterprets the BOM as literal text and includes it with
all the other text.

Reproduce code:
---------------
<?php

if ( $fp = fopen('ut8_text_file.txt') ) {

    echo fgets($fp);
    fclose($fp);

}

?>

Expected result:
----------------
Whatever text is saved on the first line of the text file.

Actual result:
--------------
Whatever text is saved on the first line of the text file.

-- 
Edit bug report at http://bugs.php.net/?id=49350&edit=1
-- 
Try a snapshot (PHP 5.2):            
http://bugs.php.net/fix.php?id=49350&r=trysnapshot52
Try a snapshot (PHP 5.3):            
http://bugs.php.net/fix.php?id=49350&r=trysnapshot53
Try a snapshot (PHP 6.0):            
http://bugs.php.net/fix.php?id=49350&r=trysnapshot60
Fixed in SVN:                        
http://bugs.php.net/fix.php?id=49350&r=fixed
Fixed in SVN and need be documented: 
http://bugs.php.net/fix.php?id=49350&r=needdocs
Fixed in release:                    
http://bugs.php.net/fix.php?id=49350&r=alreadyfixed
Need backtrace:                      
http://bugs.php.net/fix.php?id=49350&r=needtrace
Need Reproduce Script:               
http://bugs.php.net/fix.php?id=49350&r=needscript
Try newer version:                   
http://bugs.php.net/fix.php?id=49350&r=oldversion
Not developer issue:                 
http://bugs.php.net/fix.php?id=49350&r=support
Expected behavior:                   
http://bugs.php.net/fix.php?id=49350&r=notwrong
Not enough info:                     
http://bugs.php.net/fix.php?id=49350&r=notenoughinfo
Submitted twice:                     
http://bugs.php.net/fix.php?id=49350&r=submittedtwice
register_globals:                    
http://bugs.php.net/fix.php?id=49350&r=globals
PHP 4 support discontinued:          http://bugs.php.net/fix.php?id=49350&r=php4
Daylight Savings:                    http://bugs.php.net/fix.php?id=49350&r=dst
IIS Stability:                       
http://bugs.php.net/fix.php?id=49350&r=isapi
Install GNU Sed:                     
http://bugs.php.net/fix.php?id=49350&r=gnused
Floating point limitations:          
http://bugs.php.net/fix.php?id=49350&r=float
No Zend Extensions:                  
http://bugs.php.net/fix.php?id=49350&r=nozend
MySQL Configuration Error:           
http://bugs.php.net/fix.php?id=49350&r=mysqlcfg

Reply via email to