[
https://issues.apache.org/jira/browse/TIKA-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18076335#comment-18076335
]
Tim Allison commented on TIKA-4683:
-----------------------------------
I ran another intermediate regression test and found definite performance
improvements in charset detection with some frustrating regressions. I swapped
out to another method and think we're on a better footing. I'm running the eval
now with those changes.
In all likelihood, there will be surprises. If there are, I'll leave the
existing code as is, but switch the default encoding detection back to what we
have in 3.x and move forth with a final eval and then to alpha. We can tweak
the new encoding detection code later.
Let's see what we find...
> Prep for 4.0.0-ALPHA release
> ----------------------------
>
> Key: TIKA-4683
> URL: https://issues.apache.org/jira/browse/TIKA-4683
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: reports-4.0.0-20260411.tgz, reports.tar.gz
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)