[ https://issues.apache.org/jira/browse/TIKA-1881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15182137#comment-15182137 ]
Nick Burch commented on TIKA-1881: ---------------------------------- As mentioned on the Github pull request: For the Atom, RSS and RDF ones - is the magic required? Doesn't the XML detector get them already via the namespace? And without risk of mis-detecting text files which happen to mention feed or rss or rdf near the start? For the Postscript one - could you re-do this as text rather than hex, so it's easier to read? (Others look fine!) > On updating mime magic for existing mime types > ---------------------------------------------- > > Key: TIKA-1881 > URL: https://issues.apache.org/jira/browse/TIKA-1881 > Project: Tika > Issue Type: Improvement > Components: mime > Affects Versions: 1.11 > Reporter: Namitha Sanjeeva Ganiga > Priority: Minor > Labels: mime > Fix For: 1.11 > > > Updated Mime-Magic for 6 mime types: > 1. application/postscript : files begin with pattern "%!PS-Adobe-3.0 > EPSF-3.0". > 2. application/wordperfect: files begin with pattern "ÿWPC" . > 3. image/tiff : updated pattern for "MM.+" for Big endian format.(occur at > the beginning of files of tiff mime type) > 4. application/rdf+xml : updated pattern "rdf" ( from byte offset 5 to 400) > 5. application/atom+xml : updated pattern "feed" ( from byte offset 5 to 50) > 6. application/rss+xml : updated pattern "rss" ( from byte offset 5 to 50) > https://github.com/NamithaGS/tika/commit/780100767e24505a24595ea6db43978d0700e220 -- This message was sent by Atlassian JIRA (v6.3.4#6332)