John J. Lee wrote: > Mark Harrison <[EMAIL PROTECTED]> writes: > > > Ahh, it's BeautifulSoup... > > Strictly that's not THE DOM, just A document object model. The DOM > proper is a standardised interface, which BeautifulSoup does not > implement. You could build a DOM using BeautifulSoup, though.
For a certain value of standardised, libxml2dom provides "the DOM" for HTML: import urllib, libxml2dom f = urllib.urlopen("http://www.python.org") s = f.read(); f.close() d = libxml2dom.parseString(s, html=1) print "There are", len(d.xpath("//table")), "tables in the document." See http://www.python.org/pypi/libxml2dom for more information. Paul -- http://mail.python.org/mailman/listinfo/python-list