[EMAIL PROTECTED] wrote:
>[...]
> Most of the data is in tables created by
> MS Front Page, and it puts out very buggy HTML (unclosed tables, anyone?) so
> HTML::TableExtract crashes.

Have you emailed the author of HTML::TableExtract?  This sounds like a
problem with his module.  (I don't know if he's on this list.  I know
that at some point he wasn't, and I have recommended that he join.)

Alternately, you might try using HTML::TreeBuilder to build a full
syntax tree, and traversing it, looking for the elements you want.


(That reminds me that one of these days, I should make a method for
HTML::Element that will turn a whole tree into the corresponding trees
of XML elements, so that people can benefit from whatever XPath/like
geegaws are implemented for XML trees, since there's no point in from
now on reiterating all of the XML tools for HTML::Element trees.)

-- 
Sean M. Burke  [EMAIL PROTECTED]  http://www.netadventure.net/~sburke/

Reply via email to