All,
I just posted the first stacktrace report from my initial partial batch run
of against govdocs1 here:
https://issues.apache.org/jira/secure/attachment/12744700/pdfbox_reports_2_0_0_20150709.zip
Caveats/Notes
The run yesterday did not include the fixes that were made in PDFBOX-2370 or
PDFBOX-2862.
I stopped the batch run early. This only covered ~50k pdfs.
I forgot to turn on accesspermission checking. Some of the pdfs in here would
normally have been skipped.
I haven't reviewed any of the exceptions. They may be caused by code on the
Tika side.
I'll plan to re-run with the latest trunk on Tuesday. I need to turn back to
the actual eval code for a bit. :)
Cheers,
Tim