RE: Question about parsing an html document

2004-10-07 Thread Gary Nielson
Thanks! -Original Message- From: $Bill Luebkert [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 06, 2004 8:15 PM To: Gary Nielson Cc: [EMAIL PROTECTED] Subject: Re: Question about parsing an html document Gary Nielson wrote: > I am trying to get the first paragraph of an arti

Re: Question about parsing an html document

2004-10-07 Thread Andy_Bach
>> if ($line =~ /(.*?)\/p>/i) { > if ($line =~ /(.*?)\/p>/is) { The difference is 's' allows the '.' to match newlines - you probably have one or two before the first tag. Thist $line =~ s/[\r\n]+//g; will help in case your running platform has a different '\n' ($\ or $/ I always forget) de

Re: Question about parsing an html document

2004-10-07 Thread Basil Daoust
L PROTECTED] 06/10/2004 07:15 PM To Gary Nielson <[EMAIL PROTECTED]> cc [EMAIL PROTECTED] Subject Re: Question about parsing an html document Gary Nielson wrote: > I am trying to get the first paragraph of an article from an html document. > I am trying to do this by gettin

Re: Question about parsing an html document

2004-10-06 Thread $Bill Luebkert
Gary Nielson wrote: > I am trying to get the first paragraph of an article from an html document. > I am trying to do this by getting the document from the web, using 'join' to > make many lines one line, and then trying to isolate the text I want. Is > this workable? > > Here's an example of the

RE: Question about parsing an html document

2004-10-06 Thread Charles K. Clarkson
Gary Nielson <[EMAIL PROTECTED]> wrote: : I am trying to get the first paragraph of an article : from an html document. I am trying to do this by : getting the document from the web, using 'join' to : make many lines one line, and then trying to isolate : the text I want. Is this workable? : : He