Re: [Tutor] PDF to text conversion

2009-04-22 Thread Dinesh B Vadhia
result. hth. dinesh Message: 4 Date: Tue, 21 Apr 2009 18:37:39 -0400 From: Robert Berman berma...@cfl.rr.com Subject: Re: [Tutor] PDF to text conversion To: tutor@python.org Message-ID: 49ee4ab3.4040...@cfl.rr.com

Re: [Tutor] PDF to text conversion

2009-04-22 Thread Robert Berman
Message: 4 Date: Tue, 21 Apr 2009 18:37:39 -0400 From: Robert Berman berma...@cfl.rr.com mailto:berma...@cfl.rr.com Subject: Re: [Tutor] PDF to text conversion To: tutor@python.org mailto:tutor@python.org Message-ID: 49ee4ab3.4040...@cfl.rr.com mailto:49ee4ab3.4040...@cfl.rr.com Content

Re: [Tutor] PDF to text conversion

2009-04-22 Thread David
Robert Berman wrote: Dinesh, I have pdftotext version 3.0.0. I have decided to use this to go from PDF to text. It is not the ideal solution, but is is a certainly doable solution. Thank you, Robert Dinesh B Vadhia wrote: The best converter so far is pdftotext from

[Tutor] PDF to text conversion

2009-04-21 Thread Robert Berman
Hi, I must convert a history file in PDF format that goes from May of 1988 to current date. Readings are taken twice weekly and consist of the date taken mm/dd/yy and the results appearing as a 10 character numeric + special characters sequence. This is obviously an easy setup for a very

Re: [Tutor] PDF to text conversion

2009-04-21 Thread bob gailer
Robert Berman wrote: Hi, I must convert a history file in PDF format that goes from May of 1988 to current date. Readings are taken twice weekly and consist of the date taken mm/dd/yy and the results appearing as a 10 character numeric + special characters sequence. This is obviously an

Re: [Tutor] PDF to text conversion

2009-04-21 Thread عماد نوفل
On Tue, Apr 21, 2009 at 12:54 PM, bob gailer bgai...@gmail.com wrote: Robert Berman wrote: Hi, I must convert a history file in PDF format that goes from May of 1988 to current date. Readings are taken twice weekly and consist of the date taken mm/dd/yy and the results appearing as a 10

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Robert Berman
Bob, Thank you for the quick reply. I am acquainted with that method, and that will certainly work to do some really serious testing; but, the data collection is an ongoing process and the users are requesting that every month the latest entries (8) are brought into the system. What is

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Robert Berman
Hello Emad, I have seriously looked at the documentation associated with pyPDF. This seems to have the page as its smallest element of work, and what i need is a line by line process to go from .PDF format to Text. I don't think pyPDF will meet my needs but thank you for bringing it to my

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Emile van Sebille
Robert Berman wrote: snip Have any of you worked with such a library, or do you know of one or two I can download and work with? Hopefully, they have reasonable documentation. My development environment is: Python Linux Ubuntu version 8.10 I've used [r...@fcfw2 /]# /usr/bin/pdftotext -v

Re: [Tutor] PDF to text conversion

2009-04-21 Thread johnf
On Tuesday 21 April 2009 10:36:59 am Robert Berman wrote: Bob, Thank you for the quick reply. I am acquainted with that method, and that will certainly work to do some really serious testing; but, the data collection is an ongoing process and the users are requesting that every month the

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Dinesh B Vadhia
...@cfl.rr.com Subject: Re: [Tutor] PDF to text conversion To: Emad Nawfal ( ) emadnaw...@gmail.com Cc: tutor@python.org Message-ID: 49ee05f0.3080...@cfl.rr.com Content-Type: text/plain; charset=windows-1256; format=flowed Hello Emad, I have seriously looked at the documentation associated

Re: [Tutor] PDF to text conversion

2009-04-21 Thread johnf
Date: Tue, 21 Apr 2009 13:44:16 -0400 From: Robert Berman berma...@cfl.rr.com Subject: Re: [Tutor] PDF to text conversion To: Emad Nawfal ( ) emadnaw...@gmail.com Cc: tutor@python.org Message-ID: 49ee05f0.3080...@cfl.rr.com Content-Type: text/plain; charset=windows-1256; format=flowed

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Dayo Adewunmi
Emile van Sebille wrote: Robert Berman wrote: snip Have any of you worked with such a library, or do you know of one or two I can download and work with? Hopefully, they have reasonable documentation. My development environment is: Python Linux Ubuntu version 8.10 I've used [r...@fcfw2

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Martin Walsh
Robert Berman wrote: Hello Emad, I have seriously looked at the documentation associated with pyPDF. This seems to have the page as its smallest element of work, and what i need is a line by line process to go from .PDF format to Text. I don't think pyPDF will meet my needs but thank you

Re: [Tutor] PDF to text conversion

2009-04-21 Thread David
bob gailer wrote: Robert Berman wrote: Hi, I must convert a history file in PDF format that goes from May of 1988 to current date. Readings are taken twice weekly and consist of the date taken mm/dd/yy and the results appearing as a 10 character numeric + special characters sequence. This

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Norman Khine
the itools library from hforge.org has a PDF2TEXT implementation itools.pdf http://www.hforge.org/itools norman On Tue, Apr 21, 2009 at 8:44 PM, Dayo Adewunmi contactd...@gmail.com wrote: Emile van Sebille wrote: Robert Berman wrote: snip Have any of you worked with such a library, or do

Re: [Tutor] PDF to text conversion

2009-04-21 Thread Robert Berman
First, thanks to everyone who contributed to this thread. I have a number of possible solutions and a number of paths to pursue to determine which avenue I should take to resolve this remaining issue. I did try the itools library and while everything installed nicely, most of the tests failed