[ 
https://issues.apache.org/jira/browse/TIKA-3999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725530#comment-17725530
 ] 

Gregory Lepore commented on TIKA-3999:
--------------------------------------

I'm all for increasing the accuracy of format identification, hence the large 
effort to document hundreds of MOD formats. However... Most have poorly 
documented and highly variable format structures, so I'm not sure you would be 
able to pull much information out of the files without finding original 
documentation or reverse engineering the formats. And since these formats are 
already most of 25 years old...

 

That being said, identification is the first step to content and metadata 
extraction, plus it would probably minimize false positives in other files.

 

I could see implementing the identification of the files, but not worrying too 
much about pulling anything out of them. Not my call, just trying to help out 
in my areas of expertise. (Plus, Tim's title for the issue was a bit, um, 
sparse!)

> audio/xm audio/x-mod
> --------------------
>
>                 Key: TIKA-3999
>                 URL: https://issues.apache.org/jira/browse/TIKA-3999
>             Project: Tika
>          Issue Type: Sub-task
>            Reporter: Tim Allison
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to