[ 
https://issues.apache.org/jira/browse/TIKA-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Keith R. Bennett updated TIKA-14:
---------------------------------

    Attachment: tika-14.patch

Adds the Open Office file extension to the list of support file types, with:

+        } else if (name.endsWith(".odt")) {
+            return "application/vnd.oasis.opendocument.text";

Also adds more thorough testing in the test class.


> MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open 
> Office) files.
> ---------------------------------------------------------------------------------------
>
>                 Key: TIKA-14
>                 URL: https://issues.apache.org/jira/browse/TIKA-14
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 0.1-incubator
>            Reporter: Keith R. Bennett
>             Fix For: 0.1-incubator
>
>         Attachments: tika-14.patch
>
>
> MimeTypeUtils.getMimeType() returns the default mime type for .odt (Open 
> Office) files.
> Because of this, it is not possible to parse OpenOffice files at this time.  
> I did some brief research, and could not find a mime type for Open Office 
> files.  There was a comment that the mime type associated with these files is 
> application/zip, since Open Office document files are zipped files.  That, of 
> course, will not help us, since it would not be reasonable for us to assume 
> that all zip files have Open Office content.
> It is possible that there is now a mime type for Open Office documents, and I 
> just could not find it.  (I hope so.)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to