First, thanks to everyone who contributed to this thread. I have a
number of possible solutions and a number of paths to pursue to
determine which avenue I should take to resolve this remaining issue. I
did try the itools library and while everything installed nicely, most
of the tests failed so I am not particularly overjoyed with the results.
Thank you Dinesh for the vote of sympathy. I do appreciate it.
I did use Adobe Reader to convert the history PDF file into a text file
and it did seem to do it faithfully. So now I will work out a parsing
function to extract my data and send it to a SQLLITE database.
I am thrilled both with the number of suggestions I have received from
this group and the quality of the suggestions.
Thanks again,
Robert Berman
Norman Khine wrote:
the itools library from hforge.org has a PDF2TEXT implementation itools.pdf
http://www.hforge.org/itools
norman
On Tue, Apr 21, 2009 at 8:44 PM, Dayo Adewunmi <contactd...@gmail.com> wrote:
Emile van Sebille wrote:
Robert Berman wrote:
<snip>
Have any of you worked with such a library, or do you know of one or two
I can download and work with? Hopefully, they have reasonable documentation.
My development environment is:
Python
Linux
Ubuntu version 8.10
I've used
[r...@fcfw2 /]# /usr/bin/pdftotext -v
pdftotext version 2.01
Copyright 1996-2002 Glyph & Cog, LLC
[r...@fcfw2 /]# cat /etc/issue
Red Hat Linux release 9 (Shrike)
HTH,
Emile
_______________________________________________
Tutor maillist - Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor
Hi Robert,
pdftotext is part of poppler-utils, an Ubuntu package which can be installed
like so:
sudo aptitude install poppler-utils
But I to would be interested in finding a python library/module for this.
Regards,
Dayo
_______________________________________________
Tutor maillist - Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor
_______________________________________________
Tutor maillist - Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor
_______________________________________________
Tutor maillist - Tutor@python.org
http://mail.python.org/mailman/listinfo/tutor