Re: Help with character encodings

2008-05-20 Thread Gary Herron
Gary Herron wrote: A_H wrote: Help! I've scraped a PDF file for text and all the minus signs come back as u'\xad'. Is there any easy way I can change them all to plain old ASCII '-' ??? str.replace complained about a missing codec. Hints? Encoding it into a 'latin1' encoded string see

Re: Help with character encodings

2008-05-20 Thread J. Cliff Dyer
On Tue, 2008-05-20 at 08:28 -0700, Gary Herron wrote: > A_H wrote: > > Help! > > > > I've scraped a PDF file for text and all the minus signs come back as > > u'\xad'. > > > > Is there any easy way I can change them all to plain old ASCII '-' ??? > > > > str.replace complained about a missing codec

Re: Help with character encodings

2008-05-20 Thread Gary Herron
A_H wrote: Help! I've scraped a PDF file for text and all the minus signs come back as u'\xad'. Is there any easy way I can change them all to plain old ASCII '-' ??? str.replace complained about a missing codec. Hints? Encoding it into a 'latin1' encoded string seems to work: >>> pr

Help with character encodings

2008-05-20 Thread A_H
Help! I've scraped a PDF file for text and all the minus signs come back as u'\xad'. Is there any easy way I can change them all to plain old ASCII '-' ??? str.replace complained about a missing codec. Hints? -- http://mail.python.org/mailman/listinfo/python-list