[ 
https://issues.apache.org/jira/browse/TIKA-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15182205#comment-15182205
 ] 

Hudson commented on TIKA-1892:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.7 #923 (See 
[https://builds.apache.org/job/tika-trunk-jdk1.7/923/])
Magic for Mobipocket Ebook and ESRI Shapefiles from TIKA-1892 from Suman (nick: 
rev 74e71ebd871172c3473719d0814400f69d4c8913)
* tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml


> Mime Magic for application/x-mobipocket-ebook and application/x-shapefile
> -------------------------------------------------------------------------
>
>                 Key: TIKA-1892
>                 URL: https://issues.apache.org/jira/browse/TIKA-1892
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.12
>            Reporter: Suman Kashyap
>            Priority: Minor
>             Fix For: 1.13
>
>
> Our FHT analysis for mobipocket-ebook and shapefiles shows high corelation of 
> initial header bytes. Further inspection of these files over online available 
> and TREC polar data sets revealed presence of common bytes for mime 
> identification 
> patch content
> <mime-type type="application/x-netcdf">
>   <acronym>NETCDF</acronym>
>   <_comment>Network Common Data Format</_comment>
>   <magic priority="60">
>       <match value="CDF" type="string" offset="0" />
>   </magic>
>   <glob pattern="*.nc"/>
> </mime-type>
> <mime-type type="application/x-mobipocket-ebook">
>   <acronym>MOBI</acronym>
>   <_comment>Mobipocket Ebook</_comment>
>   <magic priority="60">
>       <match value="BOOKMOBI" type="string" offset="23" />
>   </magic>
>   <glob pattern="*.mobi"/>
> </mime-type>
> <mime-type type="application/x-shapefile">
>   <acronym>ESRI Shapefiles</acronym>
>   <_comment>ESRI Shapefiles</_comment>
>   <magic priority="60">
>       <match value="0x0000270a" type="big32" offset="2" />
>   </magic>
>   <glob pattern="*.shp"/>
> </mime-type>
>  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to