On Sun, Mar 02, 2008 at 08:29:32PM +0100, Hans Martin wrote:
> Hi,
> 
> I assume, this would break any XML standard, but still...
> 
> I like to parse XML with all entity references ignored,
> i.e. defined or undefined entities neither substituted
> nor removed. So far, with libxml2 2.6.31 and Python 2.4
> it works only for the defined ones.
> 
> Is there a clean way to do this? Currently, I quote the
> ampersands of my input and unquote later. This works
> well, but feels hacky. Is there a better way?

  Predefined entities (lt, gt, amp, quot and apos) are always
substitued by XML parsers, I'm afraid what you're trying to do
is raw text processing, not text processing and better done 
with text processing tools and not XML ones.

Daniel

-- 
Red Hat Virtualization group http://redhat.com/virtualization/
Daniel Veillard      | virtualization library  http://libvirt.org/
[EMAIL PROTECTED]  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine  http://rpmfind.net/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to