[ https://issues.apache.org/jira/browse/TIKA-1898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15203876#comment-15203876 ]
Steffen Netz commented on TIKA-1898: ------------------------------------ Hi Nick, You tested it with stdin and MIF, which is very straightforward. But with suffixes .doc as file input it doesn't work, and gives just a exception. Look at my tika_buf.log. I Created a patch under TIKA-1898. This is my very first push to an opensource project. Please give me an advice, if this is the right way. > backslashes in mime-type : application/vnd.mif are wrong > --------------------------------------------------------- > > Key: TIKA-1898 > URL: https://issues.apache.org/jira/browse/TIKA-1898 > Project: Tika > Issue Type: Bug > Components: config, core > Environment: Win64, Eclipse > Reporter: Steffen Netz > Priority: Minor > Labels: easyfix, patch > Attachments: test.doc, test.fm, test.mif, tika-bug.log > > > In > tika-core\src\main\resources\org\apache\tika\mime\tika-mimetypes.xml > there are the lines: > <match value="\<MakerFile" type="string" offset="0" /> > <match value="\<MIFFile" type="string" offset="0" /> > <match value="\<MakerDictionary" type="string" offset="0" /> > <match value="\<MakerScreenFont" type="string" offset="0" /> > <match value="\<MML" type="string" offset="0" /> > <match value="\<Book" type="string" offset="0" /> > <match value="\<Maker" type="string" offset="0" /> > wrong. > the backslashes must be removed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)