[ 
https://jira.duraspace.org/browse/DS-1093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=28615#comment-28615
 ] 

Ivan Masár commented on DS-1093:
--------------------------------

We've been using jMimeMagic (Apache 2.0) for this purpose (outside of DSpace), 
which is more generic than Tika (also Apache 2.0) and detects a wider range of 
formats (it also aspires to re-use the libmagic magic numbers database). The 
only draback I see in jMimeMagic is that it currently detects Office Open XML 
(docx, xlsx, ...) files as application/zip instead of their specific MIME 
types. I just filed a bug for that: 
https://github.com/arimus/jmimemagic/issues/14

Generally, I agree that this would be great as a curation task.
                
> Verify file formats at the point of file upload.
> ------------------------------------------------
>
>                 Key: DS-1093
>                 URL: https://jira.duraspace.org/browse/DS-1093
>             Project: DSpace
>          Issue Type: New Feature
>            Reporter: Robin Taylor
>
> This is a new issue to cover the bit of DS-638 which was not implemented in 
> DSpace 1.8 viz. checking that the format of a file is what it claims to be 
> eg. is thing.pdf really a pdf or is it a jpeg ?
> Ideally this would be developed as a Curation Framework task that could be 
> invoked during the submission process. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Dspace-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to