[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2012-07-24 Thread Tomas Safarik (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13421711#comment-13421711 ] Tomas Safarik commented on TIKA-431: Hello, it seems that I created duplicate issue TIK

[jira] Commented: (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2010-05-21 Thread Erik Hetzner (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12870079#action_12870079 ] Erik Hetzner commented on TIKA-431: --- See TIKA-341, apparently my suggestion (2) above is im

[jira] Commented: (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2010-05-26 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12871567#action_12871567 ] Jukka Zitting commented on TIKA-431: Agreed, we should be using the charset parameter of

[jira] Commented: (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2010-05-26 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12871824#action_12871824 ] Ken Krugler commented on TIKA-431: -- I should have some time soon to do a once-over on a bunc

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13103121#comment-13103121 ] Jan Høydahl commented on TIKA-431: -- ping() We've just got bitten by this, any chance for a

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-12 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13103144#comment-13103144 ] Ken Krugler commented on TIKA-431: -- Hi Jan - sorry for the delay. Would end of week be soon

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-13 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13103469#comment-13103469 ] Nick Burch commented on TIKA-431: - Any chance someone could work up a failing unit test for

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-14 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13104875#comment-13104875 ] Jan Høydahl commented on TIKA-431: -- End of week is good. As soon as 1.0 gets released, I'll

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-16 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106893#comment-13106893 ] Ken Krugler commented on TIKA-431: -- Some other things I should have mentioned with regards

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107130#comment-13107130 ] Robert Muir commented on TIKA-431: -- Shouldnt the charset from the http response header inst

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-17 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107214#comment-13107214 ] Ken Krugler commented on TIKA-431: -- Hi Robert, I'm assuming you're talking about the case

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107228#comment-13107228 ] Robert Muir commented on TIKA-431: -- I'm not sure even if its in both that it should be trus

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107234#comment-13107234 ] Robert Muir commented on TIKA-431: -- {quote} Though the iCU detection code isn't very good e

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-17 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107272#comment-13107272 ] Ken Krugler commented on TIKA-431: -- For analysis, I used Tika charset detection and compare

[jira] [Commented] (TIKA-431) Tika currently misuses the HTTP Content-Encoding header, and does not seem to use the charset part of the Content-Type header properly.

2011-09-17 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107273#comment-13107273 ] Ken Krugler commented on TIKA-431: -- Re "if there is any ambiguity, then its clearly wrong a