> > I will happily recommend Tesseract. > > http://code.google.com/p/tesseract-ocr/ > > Here's a how-to on how to do PDF to text, though I've yet to be able to > convert PDF to TIFF yet...
I wrote a bash script to do that once. Descends into subdirectories etc. and makes a duplicate directory structure of tiffs. It used pdftoppm and then ppm2tiff. Seemed to work pretty good for me when I was testing. Never really used it for production work. If your interested in the script (and you have some bash scripting skills so you can read it), let me know in a private e-mail. I send you a copy. Greg -- Greg Freemyer Litigation Triage Solutions Specialist http://www.linkedin.com/in/gregfreemyer First 99 Days Litigation White Paper - http://www.norcrossgroup.com/forms/whitepapers/99%20Days%20whitepaper.pdf The Norcross Group The Intersection of Evidence & Technology http://www.norcrossgroup.com -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]