A_H wrote:
Help!I've scraped a PDF file for text and all the minus signs come back as u'\xad'. Is there any easy way I can change them all to plain old ASCII '-' ??? str.replace complained about a missing codec. Hints?
Encoding it into a 'latin1' encoded string seems to work:
>>> print u'\xad'.encode('latin1')
-
-- http://mail.python.org/mailman/listinfo/python-list
-- http://mail.python.org/mailman/listinfo/python-list
