>
> I will happily recommend Tesseract.
>
> http://code.google.com/p/tesseract-ocr/
>
> Here's a how-to on how to do PDF to text, though I've yet to be able to
> convert PDF to TIFF yet...

I wrote a bash script to do that once.  Descends into subdirectories
etc. and makes a duplicate directory structure of tiffs.

It used pdftoppm and then ppm2tiff.

Seemed to work pretty good for me when I was testing.  Never really
used it for production work.

If your interested in the script (and you have some bash scripting
skills so you can read it), let me know in a private e-mail.  I send
you a copy.

Greg
-- 
Greg Freemyer
Litigation Triage Solutions Specialist
http://www.linkedin.com/in/gregfreemyer
First 99 Days Litigation White Paper -
http://www.norcrossgroup.com/forms/whitepapers/99%20Days%20whitepaper.pdf

The Norcross Group
The Intersection of Evidence & Technology
http://www.norcrossgroup.com
-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to