On 25.07.12 08:09, Ulrich Eckhardt wrote:

Am 24.07.2012 17:01, schrieb cpppw...@gmail.com:
     reader = codecs.getreader(encoding)
     lines  =  []
     with open(filename, 'rb') as f:
         lines  = reader(f, 'strict').readlines(keepends=False)

where encoding == 'utf-16-be'
Everything works fine, except that lines[0] is equal to
codecs.BOM_UTF16_BE
Is this behaviour correct, that the BOM is still present?

Yes, assuming the first line only contains that BOM. Technically it's a
space character, and why should those be removed?

If the first "character" in the file is a BOM the file encoding is probably not utf-16-be but utf-16.

Servus,
   Walter

--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to