[ https://issues.apache.org/jira/browse/TIKA-3798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17557407#comment-17557407 ]
Tim Allison commented on TIKA-3798: ----------------------------------- In Tika 2.4.0, we were using junrar 7.5.1. https://github.com/junrar/junrar/issues/73 shows infinite loops before 7.5.1 https://github.com/junrar/junrar/issues/81 is still open and has follow up infinite loops from fuzzing. In short, this is somewhat of a known issue that hasn't been solved yet even in 7.5.2 (I'm guessing, I'll test later today). > Tika hangs up with some RAR archives > ------------------------------------ > > Key: TIKA-3798 > URL: https://issues.apache.org/jira/browse/TIKA-3798 > Project: Tika > Issue Type: Bug > Environment: Windows, Tika 2.4.0 > Reporter: Mikhail Gushinets > Priority: Major > Attachments: MicrosoftTeams-image.png, rar-files.csv.gz > > > Passing to Tika rar archive might lead to hanging up. > When trying to unrar this file manually I get this message: "Checksum is not > calculated right of file as there might be a change of the metadata" > I understand that the probably reason is some kind of file corruption here > but it would be nice if Tika would just throw an exception in such case > rather than hanging up forever. -- This message was sent by Atlassian Jira (v8.20.7#820007)