On 4 August 2014 10:29, Thomas Goirand <z...@debian.org> wrote:

> Also, fixing version 3 of beautifulsoup doesn't look very easy. It needs
> sgmllib, which is removed from Python 3, and it doesn't feel right to
> maintain sgmllib as a Python module for Python 3 (I tried, and with a
> few hacks, it works though...).
>

IIRC, that's why upstream only added Python 3 support in a new major
release with a new PyPI package name.

In terms of porting, the main thing to be aware of is that beautifulsoup4
no longer handles parsing HTML itself - now that lxml and html5lib offer
decent, tolerant HTML parsers, beautifulsoup is focused on the tree
traversal and manipulation APIs. So the APIs should be largely the same,
but the parsing may produce slightly different results.

> Is there anyone who wish to package beautifulsoup4

I think someone already did ;-)

https://packages.qa.debian.org/b/beautifulsoup4.html

Thomas

Reply via email to