[ https://issues.apache.org/jira/browse/TIKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256663#comment-14256663 ]
Hudson commented on TIKA-1502: ------------------------------ SUCCESS: Integrated in tika-trunk-jdk1.7 #384 (See [https://builds.apache.org/job/tika-trunk-jdk1.7/384/]) Split the Berkeley DB mimetypes into three levels, and add a detection test (passes) and a heirarchy test (disabled as fails) TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647486) * /tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml * /tika/trunk/tika-core/src/test/java/org/apache/tika/mime/MimeTypesReaderTest.java * /tika/trunk/tika-parsers/src/test/java/org/apache/tika/mime/TestMimeTypes.java Start on magic for subtypes of Berkeley DB TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647485) * /tika/trunk/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml More test database files for TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647484) * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_2.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_3.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_4.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_5.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_btree_2.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_btree_3.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_btree_4.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_btree_5.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_hash_2.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_hash_3.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_hash_4.db * /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_hash_5.db > Mime magic for database file formats > ------------------------------------ > > Key: TIKA-1502 > URL: https://issues.apache.org/jira/browse/TIKA-1502 > Project: Tika > Issue Type: Improvement > Components: mime > Affects Versions: 1.6 > Reporter: Nick Burch > > I noticed today that Tika can't detect a lot of common database formats, such > as sqlite or Berkeley DB or MISAM > The unix file utility got most of those, which makes me think that there's a > sensible-ish header on most we can write some mime magic for > It'd therefore be good to add mime entries, with magic where possible, for > many of these common database file formats -- This message was sent by Atlassian JIRA (v6.3.4#6332)