Robert Berman wrote:
Dinesh,
I have pdftotext version 3.0.0. I have decided to use this to go from
PDF to text. It is not the ideal solution, but is is a certainly doable
solution.
Thank you,
Robert
Dinesh B Vadhia wrote:
The best converter so far is pdftotext from
http://www.glyphandcog.com/ who maintain an open source project at
http://www.foolabs.com/xpdf/.
It's not a Python library but you can call pdftotext from with Python
using os.system(). I used the pdftotext -layout option and that gave
the best result. hth.
dinesh
You can use subprocess;
#!/usr/bin/python
from subprocess import call
call(['pdftotext', 'test.pdf'])
-david
--
Powered by Gentoo GNU/Linux
http://linuxcrazy.com
_______________________________________________
Tutor maillist - Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor