[jira] [Comment Edited] (PDFBOX-3323) Cannot set destination meta data in PDFMergerUtility

Alexander Kriegisch (JIRA) Mon, 25 Apr 2016 07:54:49 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-3323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15256413#comment-15256413
 ]


Alexander Kriegisch edited comment on PDFBOX-3323 at 4/25/16 2:53 PM:
----------------------------------------------------------------------

Sorry, it only works on Windows. Our Jenkins server running under Linux 
produces bogus XMP during the merge. The relevant part of the PDF file when 
opened in a text editor looks like this:

{code}
</rdf:Description>
</rdf:RDF>
</x:xmpmeta><?xpacket end="w"?>
[Here is a series of 1042 NUL bytes, then CRLF while at the other lines there 
only is simple LF at the end of each line.]
endstream
endobj
{code}

*Update:* I also noticed that under Windows the XMP data have CRLF at the end 
of each XML line. Does that somehow account for a difference in buffer size? 
Just speculating. But OTOH my XMP section does not have 1042 lines of code, 
only 31.

*Update 2:* The document can be opened in Acrobat, the properties look good. 
Even veraPDF says it is PDF/A-1b compliant. But Apache Preflight says: "7.1: 
Error on MetaData, Failed to parse". The online checker at 
https://www.pdflib.com/knowledge-base/xmp-metadata/free-xmp-validator/ says: 
"XMP data is invalid. Reason: XMP metadata: Error parsing XML stream: not 
well-formed (invalid token) in line 32, column 0" Maybe it would be a good idea 
if PDFBox would create merged PDFs which are accepted by its own sibling 
Preflight. ;-)


was (Author: kriegaex):
Sorry, it only works on Windows. Our Jenkins server running under Linux 
produces bogus XMP during the merge. The relevant part of the PDF file when 
opened in a text editor looks like this:

{code}
</rdf:Description>
</rdf:RDF>
</x:xmpmeta><?xpacket end="w"?>
[Here is a series of 1042 NUL bytes, then CRLF while at the other lines there 
only is simple LF at the end of each line.]
endstream
endobj
{code}

*Update:* I also noticed that under Windows the XMP data have CRLF at the end 
of each XML line. Does that somehow account for a difference in buffer size? 
Just speculating. But OTOH my XMP section does not have 1042 lines of code, 
only 31.

*Update 2:* The document can be opened in Acrobat, the properties look good. 
Even veraPDF says it is PDF/A-1b compliant. But Apache Preflight says: "7.1: 
Error on MetaData, Failed to parse". The online checker at 
https://www.pdflib.com/knowledge-base/xmp-metadata/free-xmp-validator/ says: 
"XMP data is invalid. Reason: XMP metadata: Error parsing XML stream: not 
well-formed (invalid token) in line 32, column 0"

> Cannot set destination meta data in PDFMergerUtility
> ----------------------------------------------------
>
>                 Key: PDFBOX-3323
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3323
>             Project: PDFBox
>          Issue Type: Improvement
>    Affects Versions: 1.8.9, 2.0.0
>            Reporter: Alexander Kriegisch
>            Assignee: Tilman Hausherr
>              Labels: merge, metadata
>             Fix For: 2.0.1, 2.1.0
>
>
> When merging multiple PDFs into one compound document via 
> {{PDFMergerUtility}}, meta data like title, author, subject cannot be set but 
> seem to be taken from one of the input documents. This is usually not the 
> desired behaviour because as a user I have no direct influence on the meta 
> data. As a user I would like to explicitly set or at least overwrite certain 
> meta data for the destination document. Currently I can only set the 
> destination stream or file name, but not the meta data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Comment Edited] (PDFBOX-3323) Cannot set destination meta data in PDFMergerUtility

Reply via email to