[issue9241] SAXParseError on unicode (Japanese) file

2010-07-13 Thread Amaury Forgeot d'Arc
Amaury Forgeot d'Arc added the comment: Your file contains the byte \x1a == EOF. You should not open it in text mode, but in binary mode, otherwise it's truncated. import xml.sax xml.sax.parse(open("ff1a.xml", 'rb'), xml.sax.ContentHandler()) works on all versions I tried. -- nosy: +

[issue9241] SAXParseError on unicode (Japanese) file

2010-07-13 Thread Gianfranco
New submission from Gianfranco : When parsing a UTF-16 little-endian encoded XML file containing some japanese characters, the xml.sax.parse function raises a SAXParseException exception saying "no element found". Problem arises with/on: Python 2.5.2/Windows XP Pro SP3 32 bit Python 2.6.4/Wind