[ 
https://issues.apache.org/jira/browse/PDFBOX-2610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14284144#comment-14284144
 ] 

Tilman Hausherr commented on PDFBOX-2610:
-----------------------------------------

I found this 2012 paper by [~johanvanderknijff] here: 
http://openpreservation.org/system/files/pdfProfilingJvdK19122012.pdf

{quote}
In addition to this I also ran Apache Preflight on the 'Bavaria test suite' 
(...)

Preflight raised an exception for 5 out of 85 files in the dataset (6%), which 
indicates that at this stage it may simply not be sufficiently stable or mature 
for operational use. 
{quote}

The good news is that there were no exceptions when I started. The false 
positives / false negatives are now down to 7 (from 16). The bad news is that 
the remaining problems are more difficult, and that I haven't investigated the 
reason why sometimes more PDFA errors are reported than in the Bavaria XML file.

> Expand Isartor test for Bavaria test suite and other tests
> ----------------------------------------------------------
>
>                 Key: PDFBOX-2610
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2610
>             Project: PDFBox
>          Issue Type: Task
>          Components: Preflight
>    Affects Versions: 2.0.0
>            Reporter: Tilman Hausherr
>            Assignee: Tilman Hausherr
>
> 1) Expand the isartor test code so that it can also check conforming 
> documents, i.e. documents that should not bring any errors. Support JBIG2.
> 2) Test the files from the Bavaria suite with preflight. I'll create 
> sub-issues on that one. I counted 16 where something doesn't work as intented.
> 3) Include the Bavaria tests in the build. Only if we agree on this one. If 
> not, I'll just keep it for myself as an additional regression test.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to