[jira] Commented: (SOLR-2244) Add Language Identification support

2010-11-23 Thread Grant Ingersoll (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934811#action_12934811
 ] 

Grant Ingersoll commented on SOLR-2244:
---

I'm going to suggest that we rename contrib/extraction to be contrib/tika and 
that we just roll all of these things under one area, that way we don't have to 
muck with libraries, etc.

Heck, it might even make sense at this point to just move it into core.

 Add Language Identification support
 ---

 Key: SOLR-2244
 URL: https://issues.apache.org/jira/browse/SOLR-2244
 Project: Solr
  Issue Type: New Feature
Reporter: Grant Ingersoll
Assignee: Grant Ingersoll
 Attachments: solr2244.patch


 For starters, Tika has language identification capabilities that we can 
 likely leverage, but moreover, make it easier for people to plug in language 
 identification into the indexing process.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-2244) Add Language Identification support

2010-11-23 Thread Tommaso Teofili (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934813#action_12934813
 ] 

Tommaso Teofili commented on SOLR-2244:
---

bq. I'm going to suggest that we rename contrib/extraction to be contrib/tika 
and that we just roll all of these things under one area, that way we don't 
have to muck with libraries, etc.

nice suggestion

bq. Heck, it might even make sense at this point to just move it into core.

+1

 Add Language Identification support
 ---

 Key: SOLR-2244
 URL: https://issues.apache.org/jira/browse/SOLR-2244
 Project: Solr
  Issue Type: New Feature
Reporter: Grant Ingersoll
Assignee: Grant Ingersoll
 Attachments: solr2244.patch


 For starters, Tika has language identification capabilities that we can 
 likely leverage, but moreover, make it easier for people to plug in language 
 identification into the indexing process.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-2244) Add Language Identification support

2010-11-23 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934816#action_12934816
 ] 

Robert Muir commented on SOLR-2244:
---

bq. Heck, it might even make sense at this point to just move it into core.

non-option until SOLR-2088 is fixed. Solr core should work on turkish 
computers, too.


 Add Language Identification support
 ---

 Key: SOLR-2244
 URL: https://issues.apache.org/jira/browse/SOLR-2244
 Project: Solr
  Issue Type: New Feature
Reporter: Grant Ingersoll
Assignee: Grant Ingersoll
 Attachments: solr2244.patch


 For starters, Tika has language identification capabilities that we can 
 likely leverage, but moreover, make it easier for people to plug in language 
 identification into the indexing process.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-2244) Add Language Identification support

2010-11-22 Thread Grant Ingersoll (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12934655#action_12934655
 ] 

Grant Ingersoll commented on SOLR-2244:
---

Cool, I will check it out.

 Add Language Identification support
 ---

 Key: SOLR-2244
 URL: https://issues.apache.org/jira/browse/SOLR-2244
 Project: Solr
  Issue Type: New Feature
Reporter: Grant Ingersoll
 Attachments: solr2244.patch


 For starters, Tika has language identification capabilities that we can 
 likely leverage, but moreover, make it easier for people to plug in language 
 identification into the indexing process.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Commented: (SOLR-2244) Add Language Identification support

2010-11-18 Thread Tommaso Teofili (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12933569#action_12933569
 ] 

Tommaso Teofili commented on SOLR-2244:
---

Cool, this would be a nice feature :)

 Add Language Identification support
 ---

 Key: SOLR-2244
 URL: https://issues.apache.org/jira/browse/SOLR-2244
 Project: Solr
  Issue Type: New Feature
Reporter: Grant Ingersoll

 For starters, Tika has language identification capabilities that we can 
 likely leverage, but moreover, make it easier for people to plug in language 
 identification into the indexing process.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org