[ 
https://issues.apache.org/jira/browse/TIKA-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18019418#comment-18019418
 ] 

Hudson commented on TIKA-1180:
------------------------------

FAILURE: Integrated in Jenkins build Tika ยป tika-main-jdk17 #891 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/891/])
TIKA-1180: Add MatroskaDetector for improved MKV/WEBM detection (#2251) 
(github: 
[https://github.com/apache/tika/commit/2b3dd510f3702ea005f2916122c24d66d203743a])
* (add) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/main/java/org/apache/tika/detect/MatroskaDetector.java
* (add) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/main/resources/META-INF/services/org.apache.tika.detect.Detector
* (add) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/test/java/org/apache/tika/detect/MatroskaDetectorTest.java


> Better Matroska MKV and WEBM Detection
> --------------------------------------
>
>                 Key: TIKA-1180
>                 URL: https://issues.apache.org/jira/browse/TIKA-1180
>             Project: Tika
>          Issue Type: New Feature
>          Components: detector
>    Affects Versions: 1.5
>            Reporter: Nick Burch
>            Priority: Major
>              Labels: new-parser
>         Attachments: sample-mkv.noext, sample-webm.noext
>
>
> Following the work on TIKA-1177, we now have mimetype entries for the various 
> formats which are based on the Matroska container (mkv, mka, webm etc). 
> However, we are unable to properly identify the specific type just from some 
> mime magic
> Instead, for fully accurate detection, we'll need a new Detector for the 
> Matroska family, which does some very simple container/stream processing to 
> work out what the container contains



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to