Re: pdf-image: Handling size blow-out caused by fonts, any way to coalesce/merge multiple subsets of same font?

Chris Bowditch Tue, 13 Dec 2011 09:08:04 -0800

On 09/12/2011 06:57, Craig Ringer wrote:

Hi all


Hi Craig,

With pdf-image, is there any way to coalesce or merge multipledifferent subsets of the same font into a single font subset with noduplicate glyphs? Eg 50 different "Helvetica (subset)" instances intoa single font in the output document?
Background:
I've just got Jeremias's pdf-image extension integrated into my code.It worked perfectly and immediately with little effort, which wasdelightful. Thankyou *VERY* much Jeremias for publishing that, it's afantastic tool and I'd love to see it in fop core.
I'm encountering an unexpected issue with it, though: the PDFsproduced by fop are *huge*. Examination with Acrobat Pro suggests that90% of the space is taken up by fonts. Looking at the font list, I seehuge numbers of copies of "Helvetica (subset)", "Helvetica Black(subset)" etc. That makes sense, since all the input PDFs have fontsembedded, and many use the same fonts. However, I'm including up to1000 PDFs in each output PDF so the size adds up to prohibitive levels.

We also have the same problem and have been trying to find a solution;There is a cache within the PDF plug-in, but as soon as you change theway it works, memory usage seems to balloon. We did manage tode-duplicate the fonts though. We're still investigating the memoryissue. If we find a solution we will let you know.

I'm wondering if there's any way to tell the pdf-image extension toembed certain fonts fully from supplied font files and avoid copyingthe matching subsets over from the input PDFs. If there isn't anythinglike that, any idea how practical it'd be?
For that matter, is the idea of collecting up all the subsets of afont as each pdf-image is embedded, then merging them into a singlenew embedded subset at the end completely insane? Or is it potentiallypractical? For that matter just keeping track of which glyphs aredefined in each subset and building a new subset from a master fontfile at the end that included all those glyphs would help a lot.
I'm *really* hoping to avoid having to keep on using EPS input andPostScript output to PDF via Distiller, so I'm willing to put somework into this.

An alternative that we are planning on using if the memory issues withthe plug-in can't be solved is to generate the PDFs from FOP separate tothe static PDFs that you are importing and then use PDFBox in a postprocess to join the PDFs together at the end. Not ideal but it works.


Thanks,

Chris


--
Craig Ringer

---------------------------------------------------------------------
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org

Re: pdf-image: Handling size blow-out caused by fonts, any way to coalesce/merge multiple subsets of same font?

Reply via email to