[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-04-26 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860882#action_12860882 ] Ankur commented on MAHOUT-305: -- CooccurrenceCombiner caches items internally and increments

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-04-26 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860939#action_12860939 ] Ankur commented on MAHOUT-305: -- But the answer is the partitioner ? Yes Am I right that

[jira] Commented: (MAHOUT-344) Minhash based clustering

2010-03-31 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12851756#action_12851756 ] Ankur commented on MAHOUT-344: -- Drew, thanks for pitching in as I've been running super busy

[jira] Created: (MAHOUT-344) Minhash based clustering

2010-03-22 Thread Ankur (JIRA)
Minhash based clustering - Key: MAHOUT-344 URL: https://issues.apache.org/jira/browse/MAHOUT-344 Project: Mahout Issue Type: Bug Components: Clustering Reporter: Ankur Minhash clustering

[jira] Updated: (MAHOUT-344) Minhash based clustering

2010-03-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-344: - Affects Version/s: 0.3 Assignee: Ankur Minhash based clustering -

[jira] Updated: (MAHOUT-344) Minhash based clustering

2010-03-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-344: - Attachment: MAHOUT-344-v1.patch As per Yonik's law of patches submitting my implementation. Please feel free to

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840601#action_12840601 ] Ankur commented on MAHOUT-320: -- Binary comparison looks more or less the same in both the

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840619#action_12840619 ] Ankur commented on MAHOUT-320: -- And yes I see the issue with (firstb1 - firstb2) thing in

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840636#action_12840636 ] Ankur commented on MAHOUT-320: -- Robin, Can you update your revision and create a fresh patch ?

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12841059#action_12841059 ] Ankur commented on MAHOUT-320: -- It still complains that it cannot find the file to patch -

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837192#action_12837192 ] Ankur commented on MAHOUT-305: -- Just picking random N % data for each user calculating avg

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837198#action_12837198 ] Ankur commented on MAHOUT-305: -- I am not proposing that we choose random subset over all

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837205#action_12837205 ] Ankur commented on MAHOUT-305: -- Well! not factoring ratings in the similarity metric but

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837230#action_12837230 ] Ankur commented on MAHOUT-305: -- *smile* There we go. Our last steps are essentially

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12836543#action_12836543 ] Ankur commented on MAHOUT-305: -- Sean, Thanks for filing the jira. Nothing points from our

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1283#action_1283 ] Ankur commented on MAHOUT-305: -- Hey Sean, Have you played with netflix dataset?

[jira] Assigned: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur reassigned MAHOUT-305: Assignee: Ankur Combine both cooccurrence-based CF M/R jobs ---

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12836725#action_12836725 ] Ankur commented on MAHOUT-305: -- Typically when doing train-test data split, we divide the data

[jira] Commented: (MAHOUT-305) Combine both cooccurrence-based CF M/R jobs

2010-02-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837123#action_12837123 ] Ankur commented on MAHOUT-305: -- With co-occurrence analysis we are dropping ratings. So if

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12794036#action_12794036 ] Ankur commented on MAHOUT-103: -- I skimmed through your version and what's present in .item

[jira] Issue Comment Edited: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12794036#action_12794036 ] Ankur edited comment on MAHOUT-103 at 12/23/09 12:41 PM: - I skimmed

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-23 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12794133#action_12794133 ] Ankur commented on MAHOUT-103: -- Your changes don't look too mutating and yes roughly speaking

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-22 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12793628#action_12793628 ] Ankur commented on MAHOUT-103: -- Evolving the code to integrate better with the existing stuff

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-17 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12791971#action_12791971 ] Ankur commented on MAHOUT-103: -- Ok, so here is the next version which I again re-wrote

[jira] Updated: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-17 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-103: - Attachment: run.sh prepare.pl mahout-103.patch.v2 Co-occurence based nearest

[jira] Updated: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-12-17 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-103: - Attachment: (was: jira-103.patch) Co-occurence based nearest neighbourhood

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-11-24 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781838#action_12781838 ] Ankur commented on MAHOUT-103: -- For this co-occurrence based recommender I am planning to

[jira] Updated: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-11-17 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-103: - Attachment: mahout-103.patch.v1 Ok, so here's the revised version of the algorithm that this jira proposes to

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-11-17 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12778911#action_12778911 ] Ankur commented on MAHOUT-103: -- Thanks for the quick lookup, appreciate that :-). Putting in

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-11-12 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776939#action_12776939 ] Ankur commented on MAHOUT-103: -- Re-post an updated patch Sure I'll have the updated code

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-11-12 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776966#action_12776966 ] Ankur commented on MAHOUT-103: -- In that case dropping ratings might not be such a good idea

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-03-17 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12682915#action_12682915 ] Ankur commented on MAHOUT-103: -- Hey Sean, Thanks for review comments. Some specific questions

[jira] Assigned: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-01-29 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur reassigned MAHOUT-103: Assignee: Ankur Co-occurence based nearest neighbourhood

[jira] Updated: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-01-29 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-103: - Attachment: jira-103.patch Ok here is a quick patch with just enough documentation and no unit tests or dummy

[jira] Commented: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-01-29 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668475#action_12668475 ] Ankur commented on MAHOUT-103: -- I hoping to make the above improvements after I get some

[jira] Created: (MAHOUT-103) Co-occurence based nearest neighbourhood

2009-01-20 Thread Ankur (JIRA)
Co-occurence based nearest neighbourhood Key: MAHOUT-103 URL: https://issues.apache.org/jira/browse/MAHOUT-103 Project: Mahout Issue Type: New Feature Components: Collaborative Filtering

[jira] Commented: (MAHOUT-19) Hierarchial clusterer

2009-01-13 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-19?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12663319#action_12663319 ] Ankur commented on MAHOUT-19: - Hi Karl, Welcome back :-) Can you share the following few things

[jira] Updated: (MAHOUT-4) Simple prototype for Expectation Maximization (EM)

2008-04-10 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-4?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-4: --- Attachment: (was: Mahout_EM.patch) Simple prototype for Expectation Maximization (EM)

[jira] Updated: (MAHOUT-4) Simple prototype for Expectation Maximization (EM)

2008-02-21 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-4?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-4: --- Attachment: (was: PLSI_EM.patch) Simple prototype for Expectation Maximization (EM)

[jira] Updated: (MAHOUT-4) Simple prototype for Expectation Maximization (EM)

2008-02-21 Thread Ankur (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-4?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur updated MAHOUT-4: --- Attachment: Mahout_EM.patch Oops! Looks like my Subversive Eclipse plugin did something whacky while generating the