[ 
https://issues.apache.org/jira/browse/TIKA-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18074371#comment-18074371
 ] 

Tim Allison edited comment on TIKA-4683 at 4/17/26 6:43 PM:
------------------------------------------------------------

Outlook is ok: we're no longer inlining the to/from/cc/bcc into the body where 
I don't think it belongs. If I'm misunderstanding the issue, though, please let 
me know. We did do some significant refactoring between 3.x and 4x on handling 
rtf bodies within msg, and you might be pointing to something else?

Encoding...I struggled with this quite a bit over the last week. I pushed a 
slightly better version of the encoding detector, but I know that it will still 
fail on short mostly ascii texts. I've reached the point of living with it for 
4.0.0-ALPHA and opening a ticket/blocker for 4.0.0-BETA.

 

x-java-pack200. That requires a fix in commons-compress or we need to add the 
workaround. I'd prefer to leave as is for 4.0.0-ALPHA.

 

I opened up 4? prs today for other issues found during a careful review of the 
regression results. Most of those are merged now, and I've kicked off another 
regression run.


was (Author: [email protected]):
Outlook is ok: we're no longer inlining the to/from/cc/bcc into the body where 
I don't think it belongs. If I'm misunderstanding the issue, though, please let 
me know. We did do some significant refactoring between 3.x and 4x on handling 
rtf bodies within html, and you might be pointing to something else?

Encoding...I struggled with this quite a bit over the last week. I pushed a 
slightly better version of the encoding detector, but I know that it will still 
fail on short mostly ascii texts. I've reached the point of living with it for 
4.0.0-ALPHA and opening a ticket/blocker for 4.0.0-BETA.

 

x-java-pack200. That requires a fix in commons-compress or we need to add the 
workaround. I'd prefer to leave as is for 4.0.0-ALPHA.

 

I opened up 4? prs today for other issues found during a careful review of the 
regression results. Most of those are merged now, and I've kicked off another 
regression run.

> Prep for 4.0.0-ALPHA release
> ----------------------------
>
>                 Key: TIKA-4683
>                 URL: https://issues.apache.org/jira/browse/TIKA-4683
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>         Attachments: reports-4.0.0-20260411.tgz, reports.tar.gz
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to