On 6/7/06, Robin Haswell <[EMAIL PROTECTED]> wrote:
Hey there

I'm trying to use PyKDE to grab a DOM from HTML source for a
privoxy-like project. I was wondering if anyone could get me started on
this? I'm using PyKDE because I'm struggling to find an HTML parser I
can use which won't choke on the majority of HTML out there.
BeautifulSoup is unsuitable and I shudder at building PyXPCOM. Any help
would be very appreciated :-)

Have you tried xml.parsers.sgmlop? it's a newer parser lib that might
also work for your purposes. There is an sgml parser and an XML
parser, supporting the same interface.

_______________________________________________
PyKDE mailing list    PyKDE@mats.imk.fraunhofer.de
http://mats.imk.fraunhofer.de/mailman/listinfo/pykde

Reply via email to