[jira] Commented: (SOLR-2244) Add Language Identification support
[ https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934811#action_12934811 ] Grant Ingersoll commented on SOLR-2244: --- I'm going to suggest that we rename contrib/extraction to be contrib/tika and that we just roll all of these things under one area, that way we don't have to muck with libraries, etc. Heck, it might even make sense at this point to just move it into core. Add Language Identification support --- Key: SOLR-2244 URL: https://issues.apache.org/jira/browse/SOLR-2244 Project: Solr Issue Type: New Feature Reporter: Grant Ingersoll Assignee: Grant Ingersoll Attachments: solr2244.patch For starters, Tika has language identification capabilities that we can likely leverage, but moreover, make it easier for people to plug in language identification into the indexing process. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Commented: (SOLR-2244) Add Language Identification support
[ https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934813#action_12934813 ] Tommaso Teofili commented on SOLR-2244: --- bq. I'm going to suggest that we rename contrib/extraction to be contrib/tika and that we just roll all of these things under one area, that way we don't have to muck with libraries, etc. nice suggestion bq. Heck, it might even make sense at this point to just move it into core. +1 Add Language Identification support --- Key: SOLR-2244 URL: https://issues.apache.org/jira/browse/SOLR-2244 Project: Solr Issue Type: New Feature Reporter: Grant Ingersoll Assignee: Grant Ingersoll Attachments: solr2244.patch For starters, Tika has language identification capabilities that we can likely leverage, but moreover, make it easier for people to plug in language identification into the indexing process. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Commented: (SOLR-2244) Add Language Identification support
[ https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934816#action_12934816 ] Robert Muir commented on SOLR-2244: --- bq. Heck, it might even make sense at this point to just move it into core. non-option until SOLR-2088 is fixed. Solr core should work on turkish computers, too. Add Language Identification support --- Key: SOLR-2244 URL: https://issues.apache.org/jira/browse/SOLR-2244 Project: Solr Issue Type: New Feature Reporter: Grant Ingersoll Assignee: Grant Ingersoll Attachments: solr2244.patch For starters, Tika has language identification capabilities that we can likely leverage, but moreover, make it easier for people to plug in language identification into the indexing process. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Commented: (SOLR-2244) Add Language Identification support
[ https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934655#action_12934655 ] Grant Ingersoll commented on SOLR-2244: --- Cool, I will check it out. Add Language Identification support --- Key: SOLR-2244 URL: https://issues.apache.org/jira/browse/SOLR-2244 Project: Solr Issue Type: New Feature Reporter: Grant Ingersoll Attachments: solr2244.patch For starters, Tika has language identification capabilities that we can likely leverage, but moreover, make it easier for people to plug in language identification into the indexing process. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Commented: (SOLR-2244) Add Language Identification support
[ https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12933569#action_12933569 ] Tommaso Teofili commented on SOLR-2244: --- Cool, this would be a nice feature :) Add Language Identification support --- Key: SOLR-2244 URL: https://issues.apache.org/jira/browse/SOLR-2244 Project: Solr Issue Type: New Feature Reporter: Grant Ingersoll For starters, Tika has language identification capabilities that we can likely leverage, but moreover, make it easier for people to plug in language identification into the indexing process. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org