[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2015-08-29 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14721159#comment-14721159 ] Ken Krugler commented on TIKA-369: -- Initial results from integrating language-detector (see

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2015-03-01 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342531#comment-14342531 ] Ken Krugler commented on TIKA-369: -- Hi Tyler - detection speed is an issue, but Tika also

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2015-03-01 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14342533#comment-14342533 ] Tyler Palsulich commented on TIKA-369: -- Thanks, Ken! In that case, I definitely agree.

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2015-02-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14341994#comment-14341994 ] Tyler Palsulich commented on TIKA-369: -- Is there any update on this? Language detection

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2013-02-19 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13581315#comment-13581315 ] Ken Krugler commented on TIKA-369: -- Some questions then about integrating

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2013-02-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573492#comment-13573492 ] Michael McCandless commented on TIKA-369: - The language-detection lib is now in

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2013-02-07 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573541#comment-13573541 ] Ted Dunning commented on TIKA-369: -- It is hard to object, but it would be good to replicate

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2013-02-07 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573670#comment-13573670 ] Ken Krugler commented on TIKA-369: -- I've been using language-detection in another project

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2013-02-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573753#comment-13573753 ] Robert Muir commented on TIKA-369: -- The DetectorFactory is definitely gnarly, but you can

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2012-11-18 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13499838#comment-13499838 ] Michael McCandless commented on TIKA-369: - +1 to cut over to

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2012-11-18 Thread Pander Musubi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13499847#comment-13499847 ] Pander Musubi commented on TIKA-369: language-detection uses a variable length n-grams.

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2012-11-17 Thread Pander Musubi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13499479#comment-13499479 ] Pander Musubi commented on TIKA-369: +1 for using

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2012-02-19 Thread Christian Moen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13211424#comment-13211424 ] Christian Moen commented on TIKA-369: - Does anyone have any thoughts on how we should

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2011-11-03 Thread Joseph Vychtrle (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13143436#comment-13143436 ] Joseph Vychtrle commented on TIKA-369: -- Imho the CERTAINTY_LIMIT is too rigorous. I was

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2011-11-03 Thread Joseph Vychtrle (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13143676#comment-13143676 ] Joseph Vychtrle commented on TIKA-369: -- Wouldn't it be better if the field wasn't

[jira] [Commented] (TIKA-369) Improve accuracy of language detection

2011-06-26 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13055242#comment-13055242 ] Jan Høydahl commented on TIKA-369: -- Any new thoughts on this one? Seems like LUCENE-826