[jira] [Commented] (TIKA-881) HtmlParser sometimes(!) throws IOException while determining Html-Encoding

2012-03-22 Thread Ken Krugler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13235701#comment-13235701 ] Ken Krugler commented on TIKA-881: -- Hi Klaus - thanks for debugging this. I'll take a look

[jira] [Commented] (TIKA-539) Encoding detection is too biased by encoding in meta tag

2012-02-23 Thread Ken Krugler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13215145#comment-13215145 ] Ken Krugler commented on TIKA-539: -- Hi Daniel - I would file a separate issue that reports

[jira] [Commented] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-16 Thread Ken Krugler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187098#comment-13187098 ] Ken Krugler commented on TIKA-844: -- Hi Ray - could you provide more details on when/why a t

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Ken Krugler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187049#comment-13187049 ] Ken Krugler commented on TIKA-86: - For regex magic, I'd recommend compiling into FSM - e.g. u