[ https://issues.apache.org/jira/browse/PDFBOX-2643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297383#comment-14297383 ]
Maruan Sahyoun commented on PDFBOX-2643: ---------------------------------------- IMHO the xml is valid, as - xmpMM refers to the XMP Media Management Schema which is predefined in XMP as of January 2004 - xmpMM:DerivedFrom is a property describe in the schema as a ResourceRef where not all entries are required - stRef:instanceID and stRef:documentID are valid properties of xmlmm:DerivedFrom - stRef:instanceID and stRef:documentID both describing a URI for which the content is valid > "XMP type violation in stRef:instanceID" not reported by preflight > ------------------------------------------------------------------ > > Key: PDFBOX-2643 > URL: https://issues.apache.org/jira/browse/PDFBOX-2643 > Project: PDFBox > Issue Type: Sub-task > Components: Preflight > Affects Versions: 2.0.0 > Reporter: Tilman Hausherr > Attachments: PDFA_Conference_2009_nc.pdf > > > In the Bavaria test suite, PDFLib claims that the attached file is not a > valid PDF/A-1b file, because "Property stRef:instanceID in document XMP > requires scheme identifier" or "XMP type violation in stRef:instanceID" (They > make both claims in Bavaria.xml). > {code} > <rdf:Description rdf:about="" > xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/" > xmlns:stRef="http://ns.adobe.com/xap/1.0/sType/ResourceRef#"> > > <xmpMM:InstanceID>uuid:b429d411-e628-45ca-b932-d2c77fbe6cd3</xmpMM:InstanceID> > > <xmpMM:DocumentID>adobe:docid:indd:db084a4d-dbb2-11dc-ac34-beb3cc4028ec</xmpMM:DocumentID> > <xmpMM:RenditionClass>proof:pdf</xmpMM:RenditionClass> > <xmpMM:DerivedFrom rdf:parseType="Resource"> > > <stRef:instanceID>6544a661-c065-11dc-854c-dd4f35453e8b</stRef:instanceID> > > <stRef:documentID>adobe:docid:indd:fa7c6589-9f4a-11dc-9641-af983df728d7</stRef:documentID> > </xmpMM:DerivedFrom> > </rdf:Description> > {code} > PDF-Tools considers the file to be correct. But according to > http://www.pdflib.com/fileadmin/pdflib/pdf/pdfa/2009-05-04-Bavaria-report-on-PDFA-validation-accuracy.pdf > they don't raise the correct alarm for XMP violations. The PDFLib xmp > checker also considers the XMP to be correct. > "6544a661-c065-11dc-854c-dd4f35453e8b" does not look like a valid URI to me > although the regex mentioned at http://tools.ietf.org/html/rfc3986#appendix-B > thinks it is. > [~msahyoun] what do you get for that file? The Bavaria Testsuite is already 5 > years old, so maybe Adobe/Callas have improved their product. > (Another unreported error for that file is "xapGImg:height for xmp:Thumbnails > in document XMP does not match the actual base64-encoded image data") -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org