[
https://issues.apache.org/jira/browse/TIKA-1180?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18019444#comment-18019444
]
Hudson commented on TIKA-1180:
------------------------------
SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk17 #892 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk17/892/])
TIKA-1180 -- fixes for pr #2251 (tallison:
[https://github.com/apache/tika/commit/7582838982a567a0c5e888b414e691128a4d5b4c])
* (add)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/test/resources/test-documents/sample-webm.noext
* (delete)
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/resources/test-documents/testMKV.mkv
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/main/java/org/apache/tika/detect/MatroskaDetector.java
* (add)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/test/resources/test-documents/sample-mkv.noext
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/test/java/org/apache/tika/detect/MatroskaDetectorTest.java
* (add)
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/test/resources/test-documents/testMKV.mkv
* (edit)
tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/test/java/org/apache/tika/detect/TestDetectorLoading.java
> Better Matroska MKV and WEBM Detection
> --------------------------------------
>
> Key: TIKA-1180
> URL: https://issues.apache.org/jira/browse/TIKA-1180
> Project: Tika
> Issue Type: New Feature
> Components: detector
> Affects Versions: 1.5
> Reporter: Nick Burch
> Priority: Major
> Labels: new-parser
> Attachments: sample-mkv.noext, sample-webm.noext
>
>
> Following the work on TIKA-1177, we now have mimetype entries for the various
> formats which are based on the Matroska container (mkv, mka, webm etc).
> However, we are unable to properly identify the specific type just from some
> mime magic
> Instead, for fully accurate detection, we'll need a new Detector for the
> Matroska family, which does some very simple container/stream processing to
> work out what the container contains
--
This message was sent by Atlassian Jira
(v8.20.10#820010)