Hi, I would like to import changes from:
The problem is that HTMLParser from 2.7.2 is not lenient and likes to throw 
exceptions, when html document is not well formed: 
http://bugs.python.org/issue13987

This often involves exception from BeautifoulSoup, which gains great speed up 
when using from pypy + HTMLParser from stdlib:
    "RuntimeWarning: Python's built-in HTMLParser cannot parse the given 
document. This is not a bug in Beautiful Soup. The best solution is to install 
an external parser (lxml or html5lib), and use Beautiful Soup with that 
parser. See http://www.crummy.com/software/BeautifulSoup/bs4/doc/#installing-
a-parser for help."

However lxml is not compatibile with PyPy, and html5lib is slow.

Can I port the HTMLParser.py from python 2.7.3 to PyPy?

-- 
  Robert Zaremba

_______________________________________________
pypy-dev mailing list
[email protected]
http://mail.python.org/mailman/listinfo/pypy-dev

Reply via email to