Am 15.06.2022 um 12:19 schrieb Tim Allison:
Reports are here:
https://corpora.tika.apache.org/base/reports/pdfbox-3-20220614.tgz

govdocs1/372/372582.pdf
commoncrawl3/KH/KHDACXIPFMWP632LZ3S4TRRSZPDGHGM5
commoncrawl3/VN/VNCWMY6Y4C3XYWA65CQPPSNZSY6OQEEA

have lost text. But the first one is a mess even with with 2.0.26, and the two others are truncated. I wonder if we really need to bother.

Tilman

Reply via email to