On 02/01/2010 11:29 PM, Seth Woodworth wrote: > I would suggest, when possible, using the Html5lib parser and using > the traverser from BeautifulSoup. The author himself suggests[1] this > in any case of BS-3.1.0 or 3.0.8 behaving poorly. > > I have been doing work with python, BeautifulSoup and Html5Lib lately, > and I've been collecting and slowly improving python scripts (like > this) to liberate data from websites like Reddit or the Ubuntu forums. > I would love to get involved with the lastscrape.py script.
http://bugs.libre.fm/wiki/LastToLibre is the new way to do this. Last.fm has an API now, for people like us ;)
signature.asc
Description: OpenPGP digital signature
