The preliminary regression results for 2.4.0 are here: https://corpora.tika.apache.org/base/reports/tika-2.4.0-reports.tgz
We have some new exceptions caused by the new http parser; many where the files are truncated or malformed. I view this as a good thing. We have newly identified dgn7 and dgn8. We have many more tika-ooxml and tika ole's being identified as more specific xlsx, docx, etc, which is good. The ppt that TIlman identified is a new exception in 2.4.0 as well, and we need to fix that. Once we fix the ppt issue, I'll rerun the regression tests. Please let me know if you see anything else. Best, Tim