Re: Regular Expressions

[EMAIL PROTECTED] Mon, 12 Feb 2007 02:26:06 -0800

HTML: htmllib and HTMLParser (both in the Python library),
BeautifulSoup (again GIYF)


XML: xml.* in the Python library. ElementTree (recommended) is
included in Python 2.5; use xml.etree.cElementTree.


The source of HTMLParser and xmllib use regular expressions for
parsing out the data. htmllib calls sgmllib at the begining of it's
code--sgmllib starts off with a bunch of regular expressions used to
parse data. So the only real difference there I see is that someone
saved me the work of writing them ;0). I haven't looked at the source
for Beautiful Soup, though I have the sneaking suspicion that most
processing of html/xml is all based on regex's.

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Regular Expressions

Reply via email to