Hi Guillaume, in order to validate a PDF against the spec you have to make sure that all the content is compliant. PDFBox gives you methods to get the content in individual parts but at that point in time itself doesn't make sure that the pdf is valid. E.g. there is some code in PDFBox to work around issues found in real world PDF's which are outside the spec. In addition to that various versions of PDF introduced new capabilities. So you might also want to check if a PDF complies to a certain version or standard such as PDF/A or PDF/A.
For PDF/A there is a validator as part of the PDFBox project. The validator passes certain test suites and is consistently enhanced. So to ensure compliance with PDF/A this is you best option at that point in time. For other checks you would have to do it yourself. Maruan Sahyoun Am 22.01.2013 um 07:54 schrieb Guillaume Bailleul <[email protected]>: > Hi All, > > Is there a way to validate a pdf with PDFBox? I mean to ensure that > the document complies with the PDF Reference. > > My idea was to load the document and then ensure each xobject is > parsed (retrieving each one). Is it a good way to do it ? > > BR, > > Guillaume

