On 13 July 2011 19:55, D. Hugh Redelmeier <hugh at mimosa.com> wrote: > I think I want to capture to image files (possibly PDF) but supplement > this with OCRed text to facilitate full-text search. ?I need a format > that will still be useful for decades into the future. ?I want the > capture process to be as easy as possible -- otherwise I'll never get > it done.
I wrote gscan2pdf http://gscan2pdf.sourceforge.net/ to do exactly this. It supports PDF or DjVu, which has better compression. It makes it simple to scan, do some image processing, OCR, and save. Regards Jeff