[jira] Commented: (MAHOUT-344) Minhash based clustering

2010-03-24 Thread Cristi Prodan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12849402#action_12849402 ] Cristi Prodan commented on MAHOUT-344: -- I've studied the min-hash algorithm these days

[jira] Commented: (MAHOUT-344) Minhash based clustering

2010-03-30 Thread Cristi Prodan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851416#action_12851416 ] Cristi Prodan commented on MAHOUT-344: -- I ran the code on the last.fm data set (2.). D

[jira] Updated: (MAHOUT-344) Minhash based clustering

2010-04-03 Thread Cristi Prodan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristi Prodan updated MAHOUT-344: - Status: Patch Available (was: Open) Thank you guys for all the encouragement and advices. I'm

[jira] Updated: (MAHOUT-344) Minhash based clustering

2010-04-03 Thread Cristi Prodan (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristi Prodan updated MAHOUT-344: - Attachment: MAHOUT-344-v2.patch See comment above for this patch. > Minhash based clustering >

[jira] Created: (MAHOUT-365) [GSoC] Proposal to implement SimHash clustering on MapReduce

2010-04-07 Thread Cristi Prodan (JIRA)
[GSoC] Proposal to implement SimHash clustering on MapReduce Key: MAHOUT-365 URL: https://issues.apache.org/jira/browse/MAHOUT-365 Project: Mahout Issue Type: New Feature