I am trying to parse an html page. Have following error while doing that
src = sel.get_html_source()
links = re.findall(r'<a class="al4"[^<]*</a>', src)
for link in links:
print link
======================================================================
ERROR: test_new (__main__.NewTest)
----------------------------------------------------------------------
Traceback (most recent call last):
File "<stdin>", line 19, in test_new
UnicodeEncode Error: 'ascii' codec can't encode character u'\xae' in
position 90: ordinal not in range(128)
----------------------------------------------------------------------
Ran 1 test in 6.345s
_______________________________________________
Tutor maillist - [email protected]
http://mail.python.org/mailman/listinfo/tutor