The preliminary regression results for 2.4.0 are here:
https://corpora.tika.apache.org/base/reports/tika-2.4.0-reports.tgz

We have some new exceptions caused by the new http parser; many where
the files are truncated or malformed.  I view this as a good thing.

We have newly identified dgn7 and dgn8.

We have many more tika-ooxml and tika ole's being identified as more
specific xlsx, docx, etc, which is good.

The ppt that TIlman identified is a new exception in 2.4.0 as well,
and we need to fix that.

Once we fix the ppt issue, I'll rerun the regression tests.  Please
let me know if you see anything else.

Best,

            Tim

Reply via email to