[ 
https://issues.apache.org/jira/browse/TIKA-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18076335#comment-18076335
 ] 

Tim Allison commented on TIKA-4683:
-----------------------------------

I ran another intermediate regression test and found definite performance 
improvements in charset detection with some frustrating regressions. I swapped 
out to another method and think we're on a better footing. I'm running the eval 
now with those changes.

In all likelihood, there will be surprises. If there are, I'll leave the 
existing code as is, but switch the default encoding detection back to what we 
have in 3.x and move forth with a final eval and then to alpha. We can tweak 
the new encoding detection code later.

Let's see what we find...

> Prep for 4.0.0-ALPHA release
> ----------------------------
>
>                 Key: TIKA-4683
>                 URL: https://issues.apache.org/jira/browse/TIKA-4683
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>         Attachments: reports-4.0.0-20260411.tgz, reports.tar.gz
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to