Re: [PHP] PDF to Text

2006-04-21 Thread Al
Jay Blanchard wrote: [snip] I am trying to find a way for a program to search through the text on a PDF. My first thought was to use pdftotext, but the PDFs generated by our commercial scanner/copier/printer machine do not seem to work with pdftotext... it just outputs two CRLFs. I've been

Re: [PHP] PDF to Text

2006-04-21 Thread Ray Hauge
On Thursday 20 April 2006 19:23, Richard Lynch wrote: > Actually, it's "possible" just bloody difficult. > > You're looking into a topic known as OCR (Optical Character Recognition). > > One OS project for this is: > GOCR (aka JOCR) > It's GOCR on freshmeat and JOCR on sourceforge because they name

RE: [PHP] PDF to Text

2006-04-20 Thread Richard Lynch
On Thu, April 20, 2006 8:59 pm, Jay Blanchard wrote: > [snip] >> I am trying to find a way for a program to search through the text >> on > a >> PDF. My first thought was to use pdftotext, but the PDFs generated >> by > our >> commercial scanner/copier/printer machine do not seem to work with >> pd

RE: [PHP] PDF to Text

2006-04-20 Thread Jay Blanchard
[snip] > I am trying to find a way for a program to search through the text on a > PDF. My first thought was to use pdftotext, but the PDFs generated by our > commercial scanner/copier/printer machine do not seem to work with > pdftotext... it just outputs two CRLFs. I've been looking around on th

Re: [PHP] PDF to Text

2006-04-20 Thread Ray Hauge
On Thursday 20 April 2006 18:06, Ray Hauge wrote: > Hello List, > > I am trying to find a way for a program to search through the text on a > PDF. My first thought was to use pdftotext, but the PDFs generated by our > commercial scanner/copier/printer machine do not seem to work with > pdftotext...

[PHP] PDF to Text

2006-04-20 Thread Ray Hauge
Hello List, I am trying to find a way for a program to search through the text on a PDF. My first thought was to use pdftotext, but the PDFs generated by our commercial scanner/copier/printer machine do not seem to work with pdftotext... it just outputs two CRLFs. I've been looking around on