Ezio Melotti <ezio.melo...@gmail.com> added the comment:

The HTML 4.01 specifications says[0]:
"""
In certain cases, authors may specify the value of an attribute without any 
quotation marks. The attribute value may only contain letters (a-z and A-Z), 
digits (0-9), hyphens (ASCII decimal 45), periods (ASCII decimal 46), 
underscores (ASCII decimal 95), and colons (ASCII decimal 58). We recommend 
using quotation marks even when it is possible to eliminate them.
"""

The HTML 5 draft says[1]:
"""
The attribute name, followed by zero or more space characters, followed by a 
single U+003D EQUALS SIGN character, followed by zero or more space characters, 
followed by the attribute value, which, in addition to the requirements given 
above for attribute values, must not contain any literal space characters, any 
U+0022 QUOTATION MARK characters ("), U+0027 APOSTROPHE characters ('), U+003D 
EQUALS SIGN characters (=), U+003C LESS-THAN SIGN characters (<), U+003E 
GREATER-THAN SIGN characters (>), or U+0060 GRAVE ACCENT characters (`), and 
must not be the empty string.
"""

So maybe [^>\s] is a little too permissive here.

[0]: http://www.w3.org/TR/html4/intro/sgmltut.html#h-3.2.2
[1]: http://dev.w3.org/html5/spec/Overview.html#attributes-0

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue7311>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to