In theory yes, I do not know how this will work in practice. You might need to run the command on each page of a PDF. At this point I do not believe that the Google OCR system I am employing support layout. I'm just starting to play with it.

As with any OCR system you have to have enough resolution in the image. At least 150 dpi.


Greg Kearney
535 S. Jackson St.
Casper, Wyoming 82601
307-224-4022
[EMAIL PROTECTED]



On Nov 22, 2007, at 10:29 PM, Rafael Bejarano wrote:

Hello Greg,

I would find such a program extremely useful. I often download pdf files that I cannot read unless I first print them. If I understand your suggestion, I could simply run such files through your program, which would convert them to text files. Is that right?

Also, would the lay-out of documents be preserved? For example, would the columns be preserved in a multiple column document?

Cordially,
Rafael Bejarano
On Nov 22, 2007, at 11:09 PM, Greg Kearney wrote:

OK folks time to try out an idea on everyone.

What I'm thinking of is a program I'm going to call iOCR. This program will take the front most document of Preview and run OCR on it and then open the resulting text in TextEdit. I'm thinking of using Googles open source OCR which I have working on a Mac. It seems to do a good job and can deal with upside down pages.

This will likely be an Intel, Leopard only thing at first.

So useful or not?

Greg Kearney
535 S. Jackson St.
Casper, Wyoming 82601
307-224-4022
[EMAIL PROTECTED]








Reply via email to