All, I just posted the first stacktrace report from my initial partial batch run of against govdocs1 here: https://issues.apache.org/jira/secure/attachment/12744700/pdfbox_reports_2_0_0_20150709.zip
Caveats/Notes The run yesterday did not include the fixes that were made in PDFBOX-2370 or PDFBOX-2862. I stopped the batch run early. This only covered ~50k pdfs. I forgot to turn on accesspermission checking. Some of the pdfs in here would normally have been skipped. I haven't reviewed any of the exceptions. They may be caused by code on the Tika side. I'll plan to re-run with the latest trunk on Tuesday. I need to turn back to the actual eval code for a bit. :) Cheers, Tim