Rob Dixon wrote:
Jack Gates wrote:
s!(<|</)([^\!][A-Z0-9 ]+>)!$1\L$2\E!g;
s/(<|<\/)([^!][A-Z0-9 ]+>)/$1\L$2\E/g;

The RE above captures and replaces all HTML tags with lowercase as desired except for any tag that has only one letter such as <P>, <B> or <I>

It will get the </B>, </P> and </I>

It properly ignores the <!DOCTYPE> tag

What is the correct way to write the above RE?

HTML tag names can't contain spaces, so you want


Is there a difference between "\U</" and "\L</"?

Perl isn't a toolbox, but a small machine shop where you
can special-order certain sorts of tools at low cost and
in short order.                            -- Larry Wall

To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to