Robert Berman wrote:
Dinesh,

I have pdftotext version 3.0.0. I have decided to use this to go from PDF to text. It is not the ideal solution, but is is a certainly doable solution.

Thank you,

Robert

Dinesh B Vadhia wrote:
The best converter so far is pdftotext from http://www.glyphandcog.com/ who maintain an open source project at http://www.foolabs.com/xpdf/. It's not a Python library but you can call pdftotext from with Python using os.system(). I used the pdftotext -layout option and that gave the best result. hth. dinesh
You can use subprocess;
#!/usr/bin/python

from subprocess import call
call(['pdftotext', 'test.pdf'])

-david
--
Powered by Gentoo GNU/Linux
http://linuxcrazy.com
_______________________________________________
Tutor maillist  -  Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor

Reply via email to