Hi Everybody,

I am looking for a clever way to merge some extra information into existing PDF files.

Background: We have a bunch of research papers (PDF) which should be "stamped" on the first page with some small box containing extra information. The information to add includes a link to the publisher data about the paper (http://doi.xxx). Until now this is done by editing each PDF manually in Acrobat, which is a real PITA. So I am developing a scriptable solution.

The general idea is to use a combination of LyX/LaTeX/BibTeX and Python to extract and typeset the information from a BibTeX database into a single page PDF, which is then stamped on (merged into) the first page of the existing PDF.

The challenge are hyperlinks and the toc. Both PDFs - the original one and the stamp - may contain hyperlinks which I do not want to lose in the merging process. The toc (if present) should be taken from the original paper:

1) My first idea was to use the pdfpages package: The stamp document just includes all pages of the original PDF, resulting in a merged version. However, pdfpages does not preserve hyperlinks of the included pages and the toc is lost.

2) Then I tried to use the pdftk tool (http://www.pdfhacks.com/pdftk/), which provides means stamp one PDF document into another:

pdftk paper.pdf stamp stamp.pdf output paper_stamped.pdf

Thereby, hyperlinks and toc of paper.pdf make it into the resulting paper_stamped.pdf - excellent. However, in this case the hyperlink from the stamp is lost.

So it seems that I can preserve hyperlinks from either the stamp or the original PDF, but not both :-(

Are there other tools or packages I can look into?
Is there a completely different approach I might try?

Thanks a lot!

Daniel


Reply via email to