[ https://issues.apache.org/jira/browse/TIKA-3810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nick Burch resolved TIKA-3810. ------------------------------ Fix Version/s: 2.4.2 Resolution: Fixed > Vtt file (encoding UTF-8 with BOM) seen as text/plain > ----------------------------------------------------- > > Key: TIKA-3810 > URL: https://issues.apache.org/jira/browse/TIKA-3810 > Project: Tika > Issue Type: Bug > Components: core, detector, mime > Affects Versions: 2.3.0 > Reporter: Giorgiana Ciobanu > Priority: Major > Fix For: 2.4.2 > > Attachments: s5_windowEncoding_validFormat.vtt > > > Vtt file created on Windows (UTF-8 {+}with BOM{+}) is incorrectly detected as > _text/plain_ type and it should be _text/vtt_ . > The application using Tika and where the file is uploaded for mime type > detection is an Unix machine. > The vtt file is passed as inputstream to the Tika's default detector (we > don't want to detect mime type by the file extension). > Please find attached the vtt file that Tika is detecting as text/plain . -- This message was sent by Atlassian Jira (v8.20.10#820010)