At 06:59 AM 1/24/2006, Aaron J Weber wrote:
Since I'm batch-processing, I don't have the luxury of opening each document and trying to select some text from Acrobat to test. I need an automated method of pre-validating the PDF so it doesn't cause Capture to hang.

Thus my question: Does anyone have a snippet that "peeks" at a PDF's contents and checks to see if it's PDF+Text (of some kind)? Since I don't have a standard "search string" to check for, I guess I'd be content just checking if ANY text is readable/exists in the PDF or if the PDF is "Image Only".


There is nothing built into iText to do this, though you could certainly use the low level features to write something (assuming some understanding of how text works in PDF).

OR look at other solutions - be it open source tools like PdfBox or commercial solutions such PDFspy.

Leonard

---------------------------------------------------------------------------
Leonard Rosenthol                            <mailto:[EMAIL PROTECTED]>
Chief Technical Officer                      <http://www.pdfsages.com>
PDF Sages, Inc.                              215-938-7080 (voice)
                                             215-938-0880 (fax)



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Reply via email to