i have a huge database that contains large amounts of html that i need to translate to ascii..
i have tried using html2text.py: http://www.aaronsw.com/2002/html2text/ but i could not figure out how to import it and use it as a library without getting errors everywhere.. so i decided to try using lynx with the -dump switch.. it works great from the command line, but i am having trouble capturing the output into a python variable.. the only way i have figured out how to do it is: s = subprocess(args='/sw/bin/lynx',stdout=subprocess.PIPE) but i can't figure out how to send it the "-dump" or the <filename.html> and retrieve the ouput.. any help would be appreciated.. -- http://mail.python.org/mailman/listinfo/python-list