[
https://issues.apache.org/jira/browse/TIKA-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18074371#comment-18074371
]
Tim Allison edited comment on TIKA-4683 at 4/17/26 6:43 PM:
------------------------------------------------------------
Outlook is ok: we're no longer inlining the to/from/cc/bcc into the body where
I don't think it belongs. If I'm misunderstanding the issue, though, please let
me know. We did do some significant refactoring between 3.x and 4x on handling
rtf bodies within msg, and you might be pointing to something else?
Encoding...I struggled with this quite a bit over the last week. I pushed a
slightly better version of the encoding detector, but I know that it will still
fail on short mostly ascii texts. I've reached the point of living with it for
4.0.0-ALPHA and opening a ticket/blocker for 4.0.0-BETA.
x-java-pack200. That requires a fix in commons-compress or we need to add the
workaround. I'd prefer to leave as is for 4.0.0-ALPHA.
I opened up 4? prs today for other issues found during a careful review of the
regression results. Most of those are merged now, and I've kicked off another
regression run.
was (Author: [email protected]):
Outlook is ok: we're no longer inlining the to/from/cc/bcc into the body where
I don't think it belongs. If I'm misunderstanding the issue, though, please let
me know. We did do some significant refactoring between 3.x and 4x on handling
rtf bodies within html, and you might be pointing to something else?
Encoding...I struggled with this quite a bit over the last week. I pushed a
slightly better version of the encoding detector, but I know that it will still
fail on short mostly ascii texts. I've reached the point of living with it for
4.0.0-ALPHA and opening a ticket/blocker for 4.0.0-BETA.
x-java-pack200. That requires a fix in commons-compress or we need to add the
workaround. I'd prefer to leave as is for 4.0.0-ALPHA.
I opened up 4? prs today for other issues found during a careful review of the
regression results. Most of those are merged now, and I've kicked off another
regression run.
> Prep for 4.0.0-ALPHA release
> ----------------------------
>
> Key: TIKA-4683
> URL: https://issues.apache.org/jira/browse/TIKA-4683
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: reports-4.0.0-20260411.tgz, reports.tar.gz
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)