Thanks!
-Original Message-
From: $Bill Luebkert [mailto:[EMAIL PROTECTED]
Sent: Wednesday, October 06, 2004 8:15 PM
To: Gary Nielson
Cc: [EMAIL PROTECTED]
Subject: Re: Question about parsing an html document
Gary Nielson wrote:
> I am trying to get the first paragraph of an arti
>> if ($line =~ /(.*?)\/p>/i) {
> if ($line =~ /(.*?)\/p>/is) {
The difference is 's' allows the '.' to match newlines - you probably
have one or two before the first tag. Thist
$line =~ s/[\r\n]+//g;
will help in case your running platform has a different '\n' ($\ or $/ I
always forget) de
L PROTECTED]
06/10/2004 07:15 PM
To
Gary Nielson <[EMAIL PROTECTED]>
cc
[EMAIL PROTECTED]
Subject
Re: Question about parsing
an html document
Gary Nielson wrote:
> I am trying to get the first paragraph of an article from an html
document.
> I am trying to do this by gettin
Gary Nielson wrote:
> I am trying to get the first paragraph of an article from an html document.
> I am trying to do this by getting the document from the web, using 'join' to
> make many lines one line, and then trying to isolate the text I want. Is
> this workable?
>
> Here's an example of the
Gary Nielson <[EMAIL PROTECTED]> wrote:
: I am trying to get the first paragraph of an article
: from an html document. I am trying to do this by
: getting the document from the web, using 'join' to
: make many lines one line, and then trying to isolate
: the text I want. Is this workable?
:
: He