Can we / I access these files? Most differences are improvements or not meaningful, but there are a few I'd like to have a look, e.g.

commoncrawl3/commoncrawl3/XO/XOAAGISRMRPZQRZF4LSMJERGEYK5QI2T

the word "antrag" loses the first "a". Although maybe the "a" was a big one and gets assigned to another line.

Tilman

Am 02.06.2020 um 02:58 schrieb Tim Allison:

Reports are available here:
https://github.com/tballison/share/blob/master/tika_comparisons/reports-pdfbox-2.0.20.tgz

Looks like there are trivial differences in content with a slight
improvement over 2.0.19.  I don't see any differences in exceptions or
attachments.

Cheers,

         Tim



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to