[ https://issues.apache.org/jira/browse/TIKA-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15172890#comment-15172890 ]
ASF GitHub Bot commented on TIKA-1877: -------------------------------------- GitHub user prasadns14 opened a pull request: https://github.com/apache/tika/pull/81 fix for TIKA-1877 contributed by prasadns14 Updated the tika-mimetypes.xml Also, added a new .fits file to test-documents and created a unit test too. You can merge this pull request into a Git repository by running: $ git pull https://github.com/prasadns14/tika TIKA-1877 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/tika/pull/81.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #81 ---- commit 602d237feec48bfd97bc2b2b38ea614b1ae2c55d Author: prasadns14 <prasadn...@gmail.com> Date: 2016-02-29T23:03:13Z fix for TIKA-1877 contributed by prasadns14 ---- > On updating the tika-mimetypes.xml to detect .fts file format, tika detector > does not return anything > ----------------------------------------------------------------------------------------------------- > > Key: TIKA-1877 > URL: https://issues.apache.org/jira/browse/TIKA-1877 > Project: Tika > Issue Type: Bug > Components: mime > Reporter: Prasad Nagaraj Subramanya > Priority: Minor > Attachments: > 3DEE2CE70CAD248DC8A46C2D0BD0BD6C21AACE54AC958264773390B39C8AF079, > 4E8D6B46E2366D7063DE3926AF0F976A0DCCD57A7E3B53B7D54768F16DD23984, > tika-mimetypes.xml > > > The match value for .fts file format in tika-mimetypes.xml is "SIMPLE = > T". > Tika detected a .fts file as application/octet-stream. On verifying the > header I found the value to be "SIMPLE = T"(just 16 spaces > before = and T) > I tried the following changes- > Change 1) Updated the existing match value. But the build failed > Change 2) Added a new match value <match value="SIMPLE = T" > type="string" offset="0"/> after the existing one. > But now, tika returns empty value. It neither identifies the file as .fts nor > as application/octet-stream. -- This message was sent by Atlassian JIRA (v6.3.4#6332)