Hi Tilman,

that's very positive. Not only the number of failures is down by another 45%  
also the time has been reduced a lot. Might be a hint that some of the internal 
changes (parsing, closing …) and improvements in code quality start to pay off.

For the 79 files - could you be a little more specific which errors we get? Are 
these still the ones mentioned in you earlier post?

BR

Maruan

Am 23.01.2015 um 08:45 schrieb Tilman Hausherr <thaush...@t-online.de>:

> total: 231223, failed: 79, percentage failed (exceptions other than the 
> "allowed" ValidationExceptions): 0.03416585677769035%
> 
> This time it took only 2 days instead of 4. Maybe the change with closing 
> made it faster?
> 
> (This was done about a week ago, I forgot to send the posting)
> 
> Tilman
> 
> Am 05.12.2014 um 20:45 schrieb Tilman Hausherr:
>> Some numbers... it took 4-5 days
>> 
>> total: 231223, failed: 142, percentage failed: 0.06141257472336292
>> 
>> Of these, one can substract 33 OutOfMemoryErrors that happened near the end 
>> of the test. Isolated runs went fine.
>> 
>> about the rest:
>> 18 are the isSymbol stackoverflow
>> 9 are the getFontMatrix NPE
>> 33 are the "root must be of type Pages" errors
>> 
>> The rest is mostly related to very broken PDF files.
>> 
>> Tilman
>> 
>> 
>> Am 04.12.2014 um 14:55 schrieb Maruan Sahyoun:
>>> Hi Tilman,
>>> 
>>> that's very good news. I trust a lot of time went into reviewing the test 
>>> results. wo your and Tim's efforts this achievement wouldn't have been 
>>> possible.
>>> 
>>> BR
>>> 
>>> Maruan
>>> 
>>> Am 03.12.2014 um 21:04 schrieb Tilman Hausherr <thaush...@t-online.de>:
>>> 
>>>> I've now run preflight on half of the govdocs files. Every issue I have 
>>>> opened on preflight is related to that test. The failure rate (exceptions 
>>>> other than the "allowed" ValidationExceptions) is down from 1% when I 
>>>> started to 0.05% now. Most of the frequent exceptions (e.g. the one with 
>>>> NonTermimalField) have been fixed. Whats left now are exceptions related 
>>>> to messy files, and some of the font related issues.
>>>> 
>>>> Tilman
>>>> 
>>>> Am 03.11.2014 um 22:58 schrieb Tilman Hausherr:
>>>>> Am 03.11.2014 um 19:00 schrieb Tilman Hausherr:
>>>>>> It is not looking good, there is at least one NPEs issue coming.
>>>>> No more NPE after solving the two issues I opened today except 
>>>>> PDFBOX-1743.pdf which is a known problem.
>>>>> 
>>>>> Coming up soon: run preflight on the 231227 PDF files from digitalcorpora 
>>>>> to see what happens.
>>>>> 
>>>>> Tilman
>>>>> 
>>> 
>> 
> 

Reply via email to