[EMAIL PROTECTED] wrote:
>[...]
> Most of the data is in tables created by
> MS Front Page, and it puts out very buggy HTML (unclosed tables, anyone?) so
> HTML::TableExtract crashes.
Have you emailed the author of HTML::TableExtract? This sounds like a
problem with his module. (I don't know if he's on this list. I know
that at some point he wasn't, and I have recommended that he join.)
Alternately, you might try using HTML::TreeBuilder to build a full
syntax tree, and traversing it, looking for the elements you want.
(That reminds me that one of these days, I should make a method for
HTML::Element that will turn a whole tree into the corresponding trees
of XML elements, so that people can benefit from whatever XPath/like
geegaws are implemented for XML trees, since there's no point in from
now on reiterating all of the XML tools for HTML::Element trees.)
--
Sean M. Burke [EMAIL PROTECTED] http://www.netadventure.net/~sburke/