[ 
https://issues.apache.org/jira/browse/TIKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256655#comment-14256655
 ] 

Nick Burch commented on TIKA-1502:
----------------------------------

As of r1647486, we now have mime types for SQLite3, MySQL (most) and Berkeley 
DB. We have magic for SQLite3, most of the MySQL formats (some are headerless), 
and expanded BDB ones.

One remaining issue is getting MimeTypesReaderTest.testReadParameterHeirarchy() 
to pass - for some reason the 3 level hierarchy of the BDB mime types is 
getting flattened to just two

> Mime magic for database file formats
> ------------------------------------
>
>                 Key: TIKA-1502
>                 URL: https://issues.apache.org/jira/browse/TIKA-1502
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.6
>            Reporter: Nick Burch
>
> I noticed today that Tika can't detect a lot of common database formats, such 
> as sqlite or Berkeley DB or MISAM
> The unix file utility got most of those, which makes me think that there's a 
> sensible-ish header on most we can write some mime magic for
> It'd therefore be good to add mime entries, with magic where possible, for 
> many of these common database file formats



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to