[jira] [Updated] (TIKA-2484) Improve CharsetDetector to recognize UTF-16LE/BE,UTF-32LE/BE and UTF-7 with/without BOMs correctly

2017-11-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2484: -- Attachment: charset.zip File from [~AndreasMeier] > Improve CharsetDetector to recognize UTF-16LE/BE,UTF

[jira] [Updated] (TIKA-2484) Improve CharsetDetector to recognize UTF-16LE/BE,UTF-32LE/BE and UTF-7 with/without BOMs correctly

2017-10-29 Thread Andreas Meier (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Meier updated TIKA-2484: Description: I would like to help to improve the recognition accuracy of the CharsetDetector. Theref

[jira] [Updated] (TIKA-2484) Improve CharsetDetector to recognize UTF-16LE/BE,UTF-32LE/BE and UTF-7 with/without BOMs correctly

2017-10-27 Thread Andreas Meier (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Meier updated TIKA-2484: Attachment: IUC10-ar.UTF-7.with-BOM IUC10-ar.UTF-7.without-BOM IUC10-a