Leonard Rosenthol wrote:

At 01:51 PM 3/23/2006, Nicholas Mistry wrote:

I am using iText to merge a series of tiff documents into PDF. After the merge we use acrobat professional 7's OCR to allow us to search the entire document.
 This works great.


        OK.


Recently, i added some page marks (text) at the top of each page (using iText), denoting page numbers, etc.. This now broke the ability to have Acrobat OCR
the PDF, since it now contains rendered text.


That is correct, as the Acrobat OCR engine will ONLY process "image only" documents.


My question, is there a way to add some annotations to the page, but still allow
acrobat to OCR the page?


I would recommend that you OCR first and THEN apply your "annotations".


My intial gut feeling is to render the text as images, and place them on the
doc... but i wanted ask if there was an easier way.


That won't help either, as you can only have a SINGLE image on the page for Acrobat to OCR it. It won't do it for multiple ones, IIRC.


Well, i just wrote a test program that inserts multiple tiff files on a page. Surprizingly acrobat actually OCR'd it. Waching the status bar closely, Acrobat first rasterizes the entire page, and then passes it to the OCR engine. This was tested on Acrobat 7 Professional, im not sure about previous versions.

Now this leads me to another question... Why couldnt they have rasterized the text as well? Or better yet, ignore the text portion completely.. Anyways, its a workaround... for now.. I am still interested in learning how to create searchable images, and may add it to the app later..

Thanks again!

-N



-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Reply via email to