[ 
https://issues.apache.org/jira/browse/TIKA-4072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17732717#comment-17732717
 ] 

Tim Allison commented on TIKA-4072:
-----------------------------------

If we back off to 4 0x00, the identification works:

{noformat}
<magic priority="50">
      <match value="0x9602" offset="0">
        <match value="0x8000" offset="4">
          <match value="0x00000000" offset="11"/>
        </match>
        <match value="0x0001" offset="4">
          <match value="0x00000000" offset="11"/>
        </match>
      </match>
    </magic>
{noformat}

> Add magic for Atari Floppy Disk Image Format
> --------------------------------------------
>
>                 Key: TIKA-4072
>                 URL: https://issues.apache.org/jira/browse/TIKA-4072
>             Project: Tika
>          Issue Type: Sub-task
>            Reporter: Gregory Lepore
>            Priority: Minor
>         Attachments: ANIME_B.ATR, GAMES050.ATR, Rambrandt.ATR
>
>
> The Atari Floppy Disk Image Format occurs 1566 times in the second most 
> recent Common Crawl dataset. There is no known mime type for this format.
>  
> The magic is 9602\{2}(80|00)\{2}0000000000
>  
> which is 9602 followed by two random bytes, then either an 80 or a 10, two 
> more random bytes, and 0000000000.
>  
> http://fileformats.archiveteam.org/wiki/ATR



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to