>From: "Daniel F. Savarese" <[EMAIL PROTECTED]>
>Reply-To: [EMAIL PROTECTED]
>To: [EMAIL PROTECTED]
>Subject: Re: I want to process HTML with DOM,but the HTML is 
>no-well-formed!
>Date: Sat, 29 Sep 2001 13:08:23 -0400
>
>
>In message <[EMAIL PROTECTED]>, "Surfbird Fang" 
>writes:
> >Yes,I want to process HTML with DOM, but the ordinary HTML isn't
> >well-formed.
>
>That's something of a problem I guess.  A thing to keep in mind is not
>to count on a single regular expression to do all of the work for
>you when processing HTML.  I'm speaking in general, and not necessarily
>to your specific use where it might be just fine.  The thing regular
>expressions cannot do without additional tokenization and parsing logic
>is solve the matching nested parentheses problem (same thing as open
>tag/end tag).  A lot of times people run into trouble because they
>try to extract a bunch of stuff from HTML with just a single regular
>expression and get thrown off by nested tags.  At any rate, it was
>all side commentary.
>
>daniel
>
>


_________________________________________________________________
Get your FREE download of MSN Explorer at http://explorer.msn.com/intl.asp

Reply via email to