If you use PdfSmartCopy, you at least won't get duplicates from the same
document.  However, iText doesn't currently look to see if it has (for
example) two identical copies of a font from separate files.

It might be easier to come up with a separate utility to go over a given
PDF and remove redundant objects, and fix up references to them.  Not
easy, but quite possible.

Ugh.. My initial thought was to build a hash for each object in a PDF,
and look closely at hash matches to find duplicates.  But it won't work.
XObject A might be identical to XObject B, save that they reference
different copies of the same image.  A single pass hash of their
toString output would not catch the match, though it would remove the
redundant image.  Multiple passes would be necessary until a pass found
no matches.  Not Efficient at all, but serviceable. 

You might end up with a resource directory that had several aliases for
the same font/image/whatever, but I could live with that.


--Mark Storer
  Senior Software Engineer
  Cardiff.com
 
import legalese.Disclaimer;
Disclaimer<Cardiff> DisCard = null;
 
 

> -----Original Message-----
> From: mukkakish...@yahoo.co.in [mailto:mukkakish...@yahoo.co.in] 
> Sent: Thursday, September 02, 2010 2:28 AM
> To: itext-questions@lists.sourceforge.net
> Subject: [iText-questions] iText combine size large and 
> fonts/backgroundsduplicated
> 
> Hello Everyone,
> 
> We have an issue with huge size of combined PDF File using iText.
> 
> We have written a logic to combine the pdf files using iText 
> and we observed that, iText Combines PDF  successfully. But, 
> not removing duplicate fonts, backgrounds.
> 
> We tried to perform the same merge using Adobe Standard 
> version. The size out of it is 2 MB for 200 files.
> 
> But, the same for iText Merge is 15MB. So,we missed something 
> and upon investigation, we found that, it is with duplicate 
> fonts and backgrounds. Adobe does a consolidation of 
> duplication of fonts and page backgrounds.
> 
> Our source of pdf files are output from a Single Crystal 
> Report. Users will be combining these reports based on their 
> requirement from the Crystal Output files. So, there is a 
> single type of Crystal Report output in PDF format.
> 
> Could you please advise, if we have any Class/function to do this.
> 
> Thank you
> Kishore Mukka
> 
> --
> This message was sent on behalf of mukkakish...@yahoo.co.in 
> at openSubscriber.com 
> http://www.opensubscriber.com/message/itext-questi...@lists.so
urceforge.net/5762681.html
> 
> --------------------------------------------------------------
> ----------------
> This SF.net Dev2Dev email is sponsored by:
> 
> Show off your parallel programming skills.
> Enter the Intel(R) Threading Challenge 2010.
> http://p.sf.net/sfu/intel-thread-sfd
> _______________________________________________
> iText-questions mailing list
> iText-questions@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/itext-questions
> 
> Buy the iText book: http://www.itextpdf.com/book/ Check the 
> site with examples before you ask questions: 
> http://www.1t3xt.info/examples/ You can also search the 
> keywords list: http://1t3xt.info/tutorials/keywords/
> 
> 
> No virus found in this incoming message.
> Checked by AVG - www.avg.com
> Version: 9.0.851 / Virus Database: 271.1.1/3102 - Release 
> Date: 09/01/10 11:34:00
> 

------------------------------------------------------------------------------
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.itextpdf.com/book/
Check the site with examples before you ask questions: 
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/

Reply via email to