Jesse Houwing wrote:

BODY TABLEOBFU m{<td([^>]+|"[^"]+)>(<([^>]+|"[^"]+)>)*[a-z]{1,2}(<([^>]+|"[^"]+)>)*</td([^>]+|"[^"]+)>}i

I think you may want a * after the ) inside the <>. As it is, you're looking for either a bunch of characters that are not > or a quote followed by a bunch of characters that are not quote. In fact, I think what was really intended was something more like this (note that this also requires an ending quote on contained quoted strings and allows ""):


m{<td([^>"]+|"[^"]*")*>(<([^>"]+|"[^"]*")*>)*[a-z]{1,2}(<([^>"]+|"[^"]*")*>)*</td([^>"]+|"[^"]*")*>}i


The other problem with the pattern as written (with no *) is that the subpatterns don't match plain <td> or </td>, since they require at least one character between the td and the >.


--
Keith C. Ivey <[EMAIL PROTECTED]>
Washington, DC

Reply via email to