Re: [Zope3-dev] zope.tal.xmlparser.XMLParser() dislikes unicode

Bernd Dorn Sun, 14 Jan 2007 01:48:28 -0800


On 13.01.2007, at 18:49, Andreas Jung wrote:

Hi,

the XMLParser.parseString() method  raises an exception

 File "/opt/python-2.4.4/lib/python2.4/unittest.py", line 260, in run
   testMethod()
File "/Users/ajung_data/sandboxes/Zope/Zope/lib/python/zope/tal/tests/test_xmlparser.py", line 127, in test_xx
   self._run_check(xml, ())
File "/Users/ajung_data/sandboxes/Zope/Zope/lib/python/zope/tal/tests/test_xmlparser.py", line 106, in _run_check
   parser.parseString(source)
File "/Users/ajung_data/sandboxes/Zope/Zope/lib/python/zope/tal/xmlparser.py", line 77, in parseString
   self.parser.Parse(s, 1)
UnicodeEncodeError: 'ascii' codec can't encode characters inposition 43-48: ordinal not in range(128)
if the string to be parsed is a unicode strings and contains somenon-asciichars. The following snippet from a private unittest(test_xmlparsers.py)
shows the error.

   def test_xx(self):
xml = unicode('<?xml version="1.0" encoding="utf-8"?><foo>√º√∂√§</foo>', 'iso-8859-15')
       self._run_check(xml, ())
I am not sure if this behavior is intentional?! Is the XMLParsersupposedto deal with unicode strings or will it only accept a standardPython string? A workaround inside parseString() would to check forunicodeand convert the string on-the-fly to a Python string with utf-8encoding.This is possibly a limitation of the underlying Expat parser...anyrecommendation how to deal with this issue?

IMHO it should only accept strings, because in the value should be axml string and therefore always has to be encoded in 'utf-8' or inthe encoding specified in the processing instruction.


Bernd

Andras




_______________________________________________
Zope3-dev mailing list
Zope3-dev@zope.org
Unsub: http://mail.zope.org/mailman/options/zope3-dev/zope-mailinglist%40mopa.at


_______________________________________________
Zope3-dev mailing list
Zope3-dev@zope.org
Unsub: http://mail.zope.org/mailman/options/zope3-dev/archive%40mail-archive.com

Re: [Zope3-dev] zope.tal.xmlparser.XMLParser() dislikes unicode

Reply via email to