UTF-8 encoded XML is detected as text/plain because of UTF-8 BOM
Key: TIKA-897
URL: https://issues.apache.org/jira/browse/TIKA-897
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13258241#comment-13258241
]
Nick Burch commented on TIKA-897:
-
We had support for detecting XML files that are ASCII,
See https://builds.apache.org/job/Tika-trunk/836/changes
Changes:
[nick] TIKA-897 Detect XML files that start with the UTF-8 BOM, plus test
--
[...truncated 1063 lines...]
[WARNING] We have a duplicate
Hi,
On Fri, Apr 20, 2012 at 4:16 PM, Apache Jenkins Server
jenk...@builds.apache.org wrote:
message : Failed to execute goal org.apache.rat:apache-rat-plugin:0.7:check
(default)
on project tika-server: Too many unapproved licenses: 1
This is because of the reduced dependency pom written by