[ 
https://issues.apache.org/jira/browse/TIKA-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14357158#comment-14357158
 ] 

Hudson commented on TIKA-1286:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.7 #541 (See 
[https://builds.apache.org/job/tika-trunk-jdk1.7/541/])
TIKA-1286 Visio OOXML mimetypes, and non-container detection unit tests (nick: 
http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1665917)
* 
/tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
* 
/tika/trunk/tika-parsers/src/test/java/org/apache/tika/detect/TestContainerAwareDetector.java
* /tika/trunk/tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java
TIKA-1286 Sample Visio OOXML VSDX files from  Pascal Essiembre (nick: 
http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1665909)
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testVISIO.vsdm
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testVISIO.vsdx
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testVISIO.vssm
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testVISIO.vssx
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testVISIO.vstm
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testVISIO.vstx


> Adding MS Visio VSDX to mime-types detection
> --------------------------------------------
>
>                 Key: TIKA-1286
>                 URL: https://issues.apache.org/jira/browse/TIKA-1286
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.5
>         Environment: Any
>            Reporter: Pascal Essiembre
>            Priority: Minor
>              Labels: easyfix
>         Attachments: TIKA-1286.zip
>
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> Visio files under the Open Office XML (ooxml) format are not recognized by 
> the mim-type detector and always returns the family mime-type instead: 
> {{application/x-tika-ooxml}}.
> It turns out most Microsoft OOXML file formats are defined in the 
> tika-mimetypes.xml, but not not Visio.  I have created the list for someone 
> to add:
> {code:xml}
>   <mime-type type="application/vnd.ms-visio.drawing.main+xml">
>     <_comment>Office Open XML Visio Drawing (macro-free)</_comment>
>     <glob pattern="*.vsdx"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.template.main+xml">
>     <_comment>Office Open XML Visio Template (macro-free)</_comment>
>     <glob pattern="*.vstx"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.stencil.main+xml">
>     <_comment>Office Open XML Visio Stencil (macro-free)</_comment>
>     <glob pattern="*.vssx"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.drawing.macroEnabled.main+xml">
>     <_comment>Office Open XML Visio Drawing (macro-enabled)</_comment>
>     <glob pattern="*.vsdm"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.template.macroEnabled.main+xml">
>     <_comment>Office Open XML Visio Template (macro-enabled)</_comment>
>     <glob pattern="*.vstm"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.stencil.macroEnabled.main+xml">
>     <_comment>Office Open XML Visio Stencil (macro-enabled)</_comment>
>     <glob pattern="*.vssm"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to