[jira] [Commented] (TIKA-721) UTF16-LE not detected

2016-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15405785#comment-15405785 ] Tim Allison commented on TIKA-721: -- While working on TIKA-2038, I found that ICU4J is now correctly

[jira] [Commented] (TIKA-721) UTF16-LE not detected

2011-10-02 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119018#comment-13119018 ] Nick Burch commented on TIKA-721: - I'd suggest we check for invalid UTF-16 sequences (see

[jira] [Commented] (TIKA-721) UTF16-LE not detected

2011-10-02 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119035#comment-13119035 ] Michael McCandless commented on TIKA-721: - bq. I'd suggest we check for invalid

[jira] [Commented] (TIKA-721) UTF16-LE not detected

2011-10-02 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119038#comment-13119038 ] Robert Muir commented on TIKA-721: -- {quote} Finally, for the valid code points, I count how

[jira] [Commented] (TIKA-721) UTF16-LE not detected

2011-10-02 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119044#comment-13119044 ] Michael McCandless commented on TIKA-721: - {quote} bq. Finally, for the valid code

[jira] [Commented] (TIKA-721) UTF16-LE not detected

2011-09-19 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13107969#comment-13107969 ] Nick Burch commented on TIKA-721: - In CharsetRecog_Unicode on line 69 (inside