[iText-questions] Copying OCR'd hidden text from one PDF to another while retaining original images?

Arthur Murray Thu, 12 Jan 2012 15:07:24 -0800

Is there an example snippet that can help with this or a pointer on
how to approach this?


I have a scanned book as a PDF, for example this google one:
http://ia600307.us.archive.org/21/items/lightsandshadow00whipgoog/lightsandshadow00whipgoog.pdf

When I OCR this in AcrobatX the filesize grows from 12 megs to 54 megs
(the images get bigger even though I use Searchable Image "Exact").
I'd like to open the original non-OCRd PDF and copy the OCRed hidden
text from the second larger OCR'd PDF into it, hopefully retaining a
smaller image filesize but gaining the ability to search and highlight
the PDF.

Thanks.

------------------------------------------------------------------------------
RSA(R) Conference 2012
Mar 27 - Feb 2
Save $400 by Jan. 27
Register now!
http://p.sf.net/sfu/rsa-sfdev2dev2
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

[iText-questions] Copying OCR'd hidden text from one PDF to another while retaining original images?

Reply via email to