: [EMAIL PROTECTED]
Subject: Re: What would take care of this?...
Daniel == Daniel Falkenberg [EMAIL PROTECTED] writes:
Daniel Would I now have to go ahead and use HTML::parser or
Daniel something of similar nature to extract headings?
Yeah, go with HTML::TokeParser.
Daniel !DOCTYPE
Sorry, that was not meant to escape as HTML mail... here we are in ASCII, I
hope:
At 11:11 AM 2/25/02 +1030, Daniel Falkenberg wrote:
I went ahead and started using HTML::TokeParser and I have read the
HTML::TokeParser manpage. I am sure my coding is correct but for some
reason the script
I cut too much out of my post:
At 11:11 AM 2/25/02 +1030, Daniel Falkenberg wrote:
$inputSite = URL OF CHOICE;
$tree = HTML::TreeBuilder-new;
$address = http://; . $inputSite;
$request = HTTP::Request-new('GET', $address);
$response = $ua-request($request);
my $found = 0;
Daniel == Daniel Falkenberg [EMAIL PROTECTED] writes:
Daniel Would I now have to go ahead and use HTML::parser or
Daniel something of similar nature to extract headings?
Yeah, go with HTML::TokeParser.
Daniel !DOCTYPE HTML PUBLIC -//IETF//DTD HTML//EN
Daniel HTMLHEADTITLEGet
--- Daniel Falkenberg [EMAIL PROTECTED] wrote:
Hey All,
Just wondering how I would go about extracting all
the data from heading 1 (h1) in the following HTML
code. I figured I could have used HTML::TableExtract
but then I realized ( :) ) there are not tables in
the following HTML.
Jonathan == Jonathan E Paton [EMAIL PROTECTED] writes:
Jonathan /h1([^]*?)\/h1/
Please don't ever try and parse HTML with regexps - I've had to work
with way too much code that did. There are many situations where your
regex would break, and the TokeParser code wasn't much longer. It's
Hey All,
Just wondering how I would go about extracting all the data from heading
1 (h1) in the following HTML code. I figured I could have used
HTML::TableExtract but then I realized ( :) ) there are not tables in
the following HTML. Would I now have to go ahead and use HTML::parser
or