[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15136077#comment-15136077
]
Ken Krugler commented on TIKA-1723:
---
OK, I've committed this code to a new tika-langdetec
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15132961#comment-15132961
]
Ken Krugler commented on TIKA-1723:
---
Good idea re gathering input - I just emailed the de
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15132613#comment-15132613
]
Tim Allison commented on TIKA-1723:
---
Agreed on the ease of building the new ld framework
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130676#comment-15130676
]
Ken Krugler commented on TIKA-1723:
---
[~talli...@apache.org] I must admit, focusing on thi
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15130360#comment-15130360
]
Tim Allison commented on TIKA-1723:
---
Come on over to the 2.x branch, the water is fine.
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729638#comment-14729638
]
Tim Allison commented on TIKA-1723:
---
Y, I agree...that's a potential mess/challenge/oppor
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729613#comment-14729613
]
Tim Allison commented on TIKA-1723:
---
Great. Thank you.
bq. 1. ...Doesn't that get into i
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729595#comment-14729595
]
Ken Krugler commented on TIKA-1723:
---
Biggest remaining issue before I commit is how to de
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729588#comment-14729588
]
Ken Krugler commented on TIKA-1723:
---
Hi Tim,
1. Not sure about "Make language detection
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729468#comment-14729468
]
Tim Allison commented on TIKA-1723:
---
Makes sense. I proposed moving it over just so that
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729250#comment-14729250
]
Ken Krugler commented on TIKA-1723:
---
Regarding the current detection code...
I'm going t
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14728940#comment-14728940
]
Tim Allison commented on TIKA-1723:
---
I forgot to mention that we'll need to modify the Ti
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14728524#comment-14728524
]
Chris A. Mattmann commented on TIKA-1723:
-
Ken this is great work. my +1 to move fo
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14728023#comment-14728023
]
Tim Allison commented on TIKA-1723:
---
Ken,
This looks great. And, yes, I wouldn't want
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14726266#comment-14726266
]
Ken Krugler commented on TIKA-1723:
---
Hi Tim - I just attached a new version of my patch,
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14723352#comment-14723352
]
Tim Allison commented on TIKA-1723:
---
My personal preference would be to add to whatever m
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14723347#comment-14723347
]
Tim Allison commented on TIKA-1723:
---
Agreed on complexity of multilingual lang id. You w
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14720702#comment-14720702
]
Ken Krugler commented on TIKA-1723:
---
I've also been thinking about how to use lang=xx and
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14720700#comment-14720700
]
Ken Krugler commented on TIKA-1723:
---
Hi Tim - re putting language detection into the hand
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14720675#comment-14720675
]
Ken Krugler commented on TIKA-1723:
---
Hi Tim - thanks for the fast review.
1. Re confiden
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14718494#comment-14718494
]
Tim Allison commented on TIKA-1723:
---
I've only taken a brief look, but I think that movin
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14717775#comment-14717775
]
Ken Krugler commented on TIKA-1723:
---
There are a number of TODO comments in the code, man
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14717772#comment-14717772
]
Ken Krugler commented on TIKA-1723:
---
The above work added the language-detector dependenc
[
https://issues.apache.org/jira/browse/TIKA-1723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14717768#comment-14717768
]
Ken Krugler commented on TIKA-1723:
---
Part of this work is looking to make the API for lan
24 matches
Mail list logo