[jira] Commented: (MAHOUT-357) Implement a clustering algorithm on mapreduce

2010-04-01 Thread new user (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852732#action_12852732 ] new user commented on MAHOUT-357: - If it is so, then can you elaborate on exactly what do y

[jira] Updated: (MAHOUT-358) the score field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-01 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hui Wen Han updated MAHOUT-358: --- Attachment: screenshot-1.jpg the value has negative > the score field of output of > org.apache.mah

[jira] Created: (MAHOUT-358) the score field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative

2010-04-01 Thread Hui Wen Han (JIRA)
the score field of output of org.apache.mahout.cf.taste.hadoop.item.RecommenderJob has negative --- Key: MAHOUT-358 URL: https://issues.apache.org/jira/browse/MAHOUT-358

[jira] Commented: (MAHOUT-357) Implement a clustering algorithm on mapreduce

2010-04-01 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852722#action_12852722 ] Ted Dunning commented on MAHOUT-357: I think that you misunderstood me. Probably I w

[jira] Commented: (MAHOUT-357) Implement a clustering algorithm on mapreduce

2010-04-01 Thread new user (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852716#action_12852716 ] new user commented on MAHOUT-357: - yeah...it is the k-means algorithm. But, the difference

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852707#action_12852707 ] Hui Wen Han commented on MAHOUT-350: it works now, I run following command: hadoop

[jira] Commented: (MAHOUT-353) java.lang.NullPointerException in RecommenderMapper

2010-04-01 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852704#action_12852704 ] Hui Wen Han commented on MAHOUT-353: it works now, very thanks > java.lang.NullPointer

[jira] Commented: (MAHOUT-352) not compression the final output of RecommenderJob

2010-04-01 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852701#action_12852701 ] Hui Wen Han commented on MAHOUT-352: it works now, very thanks ,Sean Owen > not compr

[jira] Commented: (MAHOUT-357) Implement a clustering algorithm on mapreduce

2010-04-01 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852575#action_12852575 ] Ted Dunning commented on MAHOUT-357: The algorithm you describe is pretty much how k-m

[jira] Created: (MAHOUT-357) Implement a clustering algorithm on mapreduce

2010-04-01 Thread new user (JIRA)
Implement a clustering algorithm on mapreduce - Key: MAHOUT-357 URL: https://issues.apache.org/jira/browse/MAHOUT-357 Project: Mahout Issue Type: New Feature Reporter: new user As I m

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852483#action_12852483 ] Sean Owen commented on MAHOUT-350: -- I might well have broken your classes in the recent ch

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852477#action_12852477 ] Jake Mannix commented on MAHOUT-350: bq. I suppose I hadn't wanted to be presumptuous

Re: Reg. Netflix Prize Apache Mahout GSoC Application (SVD option)

2010-04-01 Thread Sean Owen
It'd be a matter of making a brand-new distributed recommender. It need not have anything to do with SVDRecommender, which is a fine but separate non-parallel implementation. Tacking on distributed slope-one is fairly easy, I think. Both together, with testing, documentation, etc. are certainly bi

Re: Reg. Netflix Prize Apache Mahout GSoC Application (SVD option)

2010-04-01 Thread Richard Simon Just
Just looking for some clarification. As a GSoC project would the SVD option mentioned below be a case of integrating the distributed SVD of MAHOUT-180 with the existing SVDRecommender? If so is there still a full GSoC project there? or would I need to combine it with say making the slope-one reco

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852471#action_12852471 ] Sean Owen commented on MAHOUT-350: -- Yes it should move to common. I suppose I hadn't wante

[jira] Commented: (MAHOUT-356) ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure

2010-04-01 Thread Kris Jack (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852457#action_12852457 ] Kris Jack commented on MAHOUT-356: -- I was installing mahout yesterday and just did an svn

[jira] Commented: (MAHOUT-356) ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852455#action_12852455 ] Sean Owen commented on MAHOUT-356: -- You're using the latest code, which is somewhere past

[jira] Commented: (MAHOUT-356) ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure

2010-04-01 Thread Kris Jack (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852454#action_12852454 ] Kris Jack commented on MAHOUT-356: -- I have the latest svn update. I changed to my trunk/c

[jira] Commented: (MAHOUT-356) ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852448#action_12852448 ] Sean Owen commented on MAHOUT-356: -- What .jar file are you using? This class is part of th

[jira] Created: (MAHOUT-356) ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure

2010-04-01 Thread Kris Jack (JIRA)
ClassNotFoundException: org.apache.mahout.math.function.IntDoubleProcedure -- Key: MAHOUT-356 URL: https://issues.apache.org/jira/browse/MAHOUT-356 Project: Mahout Issue

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852360#action_12852360 ] Drew Farris commented on MAHOUT-350: bq. (Incidentally, now might be a good time to plu

[jira] Resolved: (MAHOUT-353) java.lang.NullPointerException in RecommenderMapper

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-353. -- Resolution: Fixed Assignee: Sean Owen Tentatively resolving with Hui's final suggested change (ju

[jira] Updated: (MAHOUT-355) Misleading JavaDoc comment in FPGrowth

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-355: - Resolution: Fixed Fix Version/s: 0.4 Status: Resolved (was: Patch Available) Looks so

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852336#action_12852336 ] Sean Owen commented on MAHOUT-350: -- OK, try my latest commit. --input and --output are now

[jira] Resolved: (MAHOUT-352) not compression the final output of RecommenderJob

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-352. -- Resolution: Duplicate Fix Version/s: 0.4 Assignee: Sean Owen I'm going to roll this int

[jira] Commented: (MAHOUT-354) make the output of RecommenderJob more readable

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852318#action_12852318 ] Sean Owen commented on MAHOUT-354: -- Try commenting out jobConf.setBoolean("mapred.

[jira] Commented: (MAHOUT-353) java.lang.NullPointerException in RecommenderMapper

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852317#action_12852317 ] Sean Owen commented on MAHOUT-353: -- Ah... I think you are correct. Let me construct anothe

[jira] Commented: (MAHOUT-352) not compression the final output of RecommenderJob

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852316#action_12852316 ] Sean Owen commented on MAHOUT-352: -- Hui I think I might understand the problem now. Try us

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852315#action_12852315 ] Sean Owen commented on MAHOUT-350: -- Thanks Drew, I see the parsing now that you prompted m

[jira] Updated: (MAHOUT-355) Misleading JavaDoc comment in FPGrowth

2010-04-01 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-355: -- Attachment: MAHOUT-355.patch > Misleading JavaDoc comment in FPGrowth > ---

[jira] Updated: (MAHOUT-355) Misleading JavaDoc comment in FPGrowth

2010-04-01 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-355: -- Status: Patch Available (was: Open) I added a null check in the code, changed the Java

Re: My ideas for GSoC 2010

2010-04-01 Thread Cristian Prodan
Thanks Robin, I will try have a look at that. Cristi. On Thu, Apr 1, 2010 at 9:36 AM, Robin Anil wrote: > Why dont you try it on 20 newsgroups. There are about 17-18 unique topics > and couple of overlapping ones. You can easily find issues with the > clustering code with that dataset. Once its

[jira] Commented: (MAHOUT-350) add one "JobName" and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Hui Wen Han (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852273#action_12852273 ] Hui Wen Han commented on MAHOUT-350: parameter -Dmapred.job.name=HADOOP_REC_tap_tag -D