[
https://issues.apache.org/jira/browse/TIKA-4419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17953195#comment-17953195
]
Tim Allison edited comment on TIKA-4419 at 5/21/25 4:24 PM:
------------------------------------------------------------
I don't like the above hack. I'm not necessarily against it as a temporary
workaround.
Longer term, I've made a feature request to let us revert to the old jsoup
behavior: [https://github.com/jhy/jsoup/issues/2330]
:fingers-crossed:
was (Author: [email protected]):
I don't like the above hack. I'm not necessarily against it as a temporary
workaround.
Longer term, I've made a feature request to let us revert to the old jsoup
behavior: [https://github.com/jhy/jsoup/issues/2330]
> Deal with self-closeable tags handling change in jsoup 1.20.1
> -------------------------------------------------------------
>
> Key: TIKA-4419
> URL: https://issues.apache.org/jira/browse/TIKA-4419
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
> Attachments: tags-top500.txt
>
>
> On TIKA-4411, [~tilman] found a significant change in behavior for how jsoup
> 1.21.1 is handling self-closing tags. We need to figure out how to deal with
> this in a reasonable way.
>
> Ref:
> https://issues.apache.org/jira/browse/TIKA-4411?focusedCommentId=17952615&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-17952615
--
This message was sent by Atlassian Jira
(v8.20.10#820010)