Kurt Peters wrote:
> I had done that about 21 revisions ago.
If you litter your module with code that is commented out it is hard to keep
track of what works and what doesn't.
> Nevertheless, why would you think
> that would work, when the code as shown doesn't?
Because he knows Python? Why don't /you/ try it before asking that question?
A good place to do "exploratory" programming is Python's interactive
interpreter. Here's a sample session:
Python 2.5.1 (r251:54863, Jul 31 2008, 23:17:43)
[GCC 4.1.3 20070929 (prerelease) (Ubuntu 4.1.2-16ubuntu2)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from pyPdf import PdfFileReader as PFR
>>> doc = PFR(open("SUA.pdf"))
>>> text = doc.getPage(3).extractText()
>>> type(text)
<type 'unicode'>
>>> text[:200]
u'2/16/08 7400.8P Table of Contents - Continued Section Page
\ xa773.49 New Hampshire (NH) 50
\xa773.50 New Jersey (NJ) 50 \xa773.51 New Mex
ico (NM) 51 \xa773.52 New York (NY) 56 \xa773.53 North '
>>> print text[:200].replace(u"\xa7", u"\n")
2/16/08 7400.8P Table of Contents - Continued Section Page
73.49 New Hampshire (NH) 50
73.50 New Jersey (NJ) 50
73.51 New Mexico (NM) 51
73.52 New York (NY) 56
73.53 North
Peter
--
http://mail.python.org/mailman/listinfo/python-list