Hi Guillaume,

in order to validate a PDF against the spec you have to make sure that all the 
content is compliant. PDFBox gives you methods to get the content in individual 
parts but at that point in time itself doesn't make sure that the pdf is valid. 
E.g. there is some code in PDFBox to work around issues found in real world 
PDF's which are outside the spec.  In addition to that various versions of PDF 
introduced new capabilities. So you might also want to check if a PDF complies 
to a certain version or standard such as PDF/A or PDF/A. 

For PDF/A there is a validator as part of the PDFBox project. The validator 
passes certain test suites and is consistently enhanced. So to ensure 
compliance with PDF/A this is you best option at that point in time. For other 
checks you would have to do it yourself. 

Maruan Sahyoun

Am 22.01.2013 um 07:54 schrieb Guillaume Bailleul <[email protected]>:

> Hi All,
> 
> Is there a way to validate a pdf with PDFBox? I mean to ensure that
> the document complies with the PDF Reference.
> 
> My idea was to load the document and then ensure each xobject is
> parsed (retrieving each one). Is it a good way to do it ?
> 
> BR,
> 
> Guillaume

Reply via email to