Jean Coudon created TIKA-1842: --------------------------------- Summary: XML file detected as HTML Key: TIKA-1842 URL: https://issues.apache.org/jira/browse/TIKA-1842 Project: Tika Issue Type: Bug Components: mime Affects Versions: 1.11 Reporter: Jean Coudon Priority: Minor
A file containing this xml: {code:xml} <note> <to>Tove</to> <from>Jani</from> <heading>Reminder</heading> <body>Don't forget me this weekend!</body> </note> {code} is detected as text/html instead of application/xml (or at worst text/plain) when using this code sample: {code:java} Tika tika = new Tika(); System.out.println(tika.detect(new File("src/test/resources/file.xml")))); {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)