Hi,

I've nothing to add but +++++1

BR
Andreas Lehmkühler

Am 23.01.2015 um 09:14 schrieb Maruan Sahyoun:
Hi Tilman,

let me take the opportunity to say thank you for your efforts around code quality and 
testing. That doesn't result in "hey that's a great new feature" but is a very 
important part of the development which is very often not directly visible but takes time 
and dedication.

Sincerly yours
Maruan

Am 23.01.2015 um 09:00 schrieb Tilman Hausherr <thaush...@t-online.de>:

Hi,

Besides the "very broken files" (which results in errors in bad parameters for 
the PDF operators), there are the out of memory exceptions on huge files. I think that 
there are at most 5-10 files left with problems that can be solved. I'll start a new test 
when the Isartor improvements are done with a bigger memory setting, and will also open 
issues on the exceptions that I believe can be fixed.

Tilman

Am 23.01.2015 um 08:54 schrieb Maruan Sahyoun:
Hi Tilman,

that's very positive. Not only the number of failures is down by another 45%  
also the time has been reduced a lot. Might be a hint that some of the internal 
changes (parsing, closing …) and improvements in code quality start to pay off.

For the 79 files - could you be a little more specific which errors we get? Are 
these still the ones mentioned in you earlier post?

BR

Maruan

Am 23.01.2015 um 08:45 schrieb Tilman Hausherr <thaush...@t-online.de>:

total: 231223, failed: 79, percentage failed (exceptions other than the 
"allowed" ValidationExceptions): 0.03416585677769035%

This time it took only 2 days instead of 4. Maybe the change with closing made 
it faster?

(This was done about a week ago, I forgot to send the posting)

Tilman

Am 05.12.2014 um 20:45 schrieb Tilman Hausherr:
Some numbers... it took 4-5 days

total: 231223, failed: 142, percentage failed: 0.06141257472336292

Of these, one can substract 33 OutOfMemoryErrors that happened near the end of 
the test. Isolated runs went fine.

about the rest:
18 are the isSymbol stackoverflow
9 are the getFontMatrix NPE
33 are the "root must be of type Pages" errors

The rest is mostly related to very broken PDF files.

Tilman


Am 04.12.2014 um 14:55 schrieb Maruan Sahyoun:
Hi Tilman,

that's very good news. I trust a lot of time went into reviewing the test 
results. wo your and Tim's efforts this achievement wouldn't have been possible.

BR

Maruan

Am 03.12.2014 um 21:04 schrieb Tilman Hausherr <thaush...@t-online.de>:

I've now run preflight on half of the govdocs files. Every issue I have opened on 
preflight is related to that test. The failure rate (exceptions other than the 
"allowed" ValidationExceptions) is down from 1% when I started to 0.05% now. 
Most of the frequent exceptions (e.g. the one with NonTermimalField) have been fixed. 
Whats left now are exceptions related to messy files, and some of the font related issues.

Tilman

Am 03.11.2014 um 22:58 schrieb Tilman Hausherr:
Am 03.11.2014 um 19:00 schrieb Tilman Hausherr:
It is not looking good, there is at least one NPEs issue coming.
No more NPE after solving the two issues I opened today except PDFBOX-1743.pdf 
which is a known problem.

Coming up soon: run preflight on the 231227 PDF files from digitalcorpora to 
see what happens.

Tilman







---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to