New submission from Ezio Melotti <ezio.melo...@gmail.com>:

HTMLParser has an internal method called unescape [0] used to convert named 
character references to the equivalent characters, and it does so by using 
html.entities.name2codepoint to recreate the equivalent of 
html.entities.entityrefs with the addition of &apos;.
Now that the html5 entities have been added to html.entities, the parser should 
use them instead of name2codepoint.

[0]: see Lib/html/parser.py:500

----------
assignee: ezio.melotti
components: Library (Lib)
messages: 163702
nosy: eric.araujo, ezio.melotti, r.david.murray
priority: normal
severity: normal
stage: needs patch
status: open
title: Refactor HTMLParser.unescape to use html.entities.html5
type: enhancement
versions: Python 3.3

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue15156>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to