Hellmut Weber wrote:

> Hi,
> i'm new here in this list.
> 
> i'm developing a little program using an xml document. So far it's easy
> going, but when parsing an xml document which contains the EURO symbol
> ('€') then I get an error:
> 
> UnicodeEncodeError: 'charmap' codec can't encode character u'\xa4' in
> position 11834: character maps to <undefined>
> 
> the relevant piece of code is:
> 
> from xml.dom.minidom import Document, parse, parseString
> ...
> doc = parse(inFIleName)

The contents of the file must be encoded with the proper encoding which is
given in the XML-header, or has to be utf-8 if no header is given.

From the above I think you have a latin1-based document. Does the encoding
header match?


> 
> [EMAIL PROTECTED] usexml $ locale
> [EMAIL PROTECTED]
> LC_CTYPE="[EMAIL PROTECTED]"
> LC_NUMERIC="[EMAIL PROTECTED]"
> LC_TIME="[EMAIL PROTECTED]"
> LC_COLLATE="[EMAIL PROTECTED]"
> LC_MONETARY="[EMAIL PROTECTED]"
> LC_MESSAGES="[EMAIL PROTECTED]"
> LC_PAPER="[EMAIL PROTECTED]"
> LC_NAME="[EMAIL PROTECTED]"
> LC_ADDRESS="[EMAIL PROTECTED]"
> LC_TELEPHONE="[EMAIL PROTECTED]"
> LC_MEASUREMENT="[EMAIL PROTECTED]"
> LC_IDENTIFICATION="[EMAIL PROTECTED]"
> [EMAIL PROTECTED]

This is irrelevant.

Diez
-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to