Łukasz wrote:
I have a problem with my XML parser (created with libraries from
xml.sax package). When parser finds a invalid character (in CDATA
section) for example �, throws an exception SAXParseException.
Is there any way to just ignore this kind of problem. Maybe there is a
way to set up parser in less strict mode?
I know that I can catch this exception and determine if this is this
kind of problem and then ignore this, but I am asking about any global
setting.
The parser from libxml2 that lxml provides has a recovery option, i.e. it
can keep parsing regardless of errors and will drop the broken content.
However, it is *always* better to fix the input, if you get any hand on it.
Broken XML is *not* XML at all. If you can't fix the source, you can never
be sure that the data you received is in any way complete or even usable.
Stefan
--
http://mail.python.org/mailman/listinfo/python-list