Re: Strip html from filehandle? >was win command console exe how perl launch & capture its STDOUT?

Felix Geerinckx Fri, 31 May 2002 07:02:51 -0700

on Fri, 31 May 2002 01:57:19 GMT, [EMAIL PROTECTED] (Alan C.)
wrote: 

> my $output = qx(graburl 
> http://www.wrh.noaa.gov/cgi-bin/Sacramento/afd?SFOZFPSTO);
> open MYWTHR, ">myweathr"
>     or die "Cannot create mywthr_txt: $!";
> print MYWTHR "$output";
> #----end--
> But it pulls all content, including the html, from that url and
> writes to file. 
> 
> Can strip all html before writes to file?  If so, how?


I don't want/need graburl.exe - used LWP::Simple instead (no need to 
fork).

A quick look at the source of the html revealed that anything of 
interest is after a "<pre>" tag. (Incidentally, there was no 
"</pre>").

So the following code should grab the text only:

    #! /usr/bin/perl -w
    use strict;
    use LWP::Simple;

    my $output = get(
       "http://www.wrh.noaa.gov/cgi-bin/Sacramento/afd?SFOZFPSTO";);
    $output =~ s/(.*)<pre>//s; # the s-modifier is essential!
    print $output;

You could fit the line with the regex in your program and it should 
work.

-- 
felix

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Strip html from filehandle? >was win command console exe how perl launch & capture its STDOUT?

Reply via email to