[ 
https://issues.apache.org/jira/browse/TIKA-3798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17557469#comment-17557469
 ] 

Tim Allison commented on TIKA-3798:
-----------------------------------

I opened a PR to fix https://github.com/junrar/junrar/issues/81... we'll see 
where that goes.  We won't know that that is the source of this problem without 
access to the problematic junrar file.

Again, my larger point about isolating tika into a separate process is key.  We 
can and must fix the one off infinite loops, but there will be more.

> Tika hangs up with some RAR archives
> ------------------------------------
>
>                 Key: TIKA-3798
>                 URL: https://issues.apache.org/jira/browse/TIKA-3798
>             Project: Tika
>          Issue Type: Bug
>         Environment: Windows, Tika 2.4.0
>            Reporter: Mikhail Gushinets
>            Priority: Major
>         Attachments: MicrosoftTeams-image.png, rar-files.csv.gz
>
>
> Passing to Tika rar archive might lead to hanging up.
> When trying to unrar this file manually I get this message: "Checksum is not 
> calculated right of file as there might be a change of the metadata"
> I understand that the probably reason is some kind of file corruption here 
> but it would be nice if Tika would just throw an exception in such case 
> rather than hanging up forever.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to