Jenkins build is still unstable: Mahout-Quality #1566

2012-06-29 Thread Apache Jenkins Server
See

Jenkins build is still unstable: Mahout-Quality #1565

2012-06-29 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-1035) Hotspot in recommenditembased – UnsymmetrifyMapper job

2012-06-29 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404314#comment-13404314 ] Hudson commented on MAHOUT-1035: Integrated in Mahout-Quality #1565 (See [https://builds

[jira] [Commented] (MAHOUT-1034) ERROR in Navie Bayes Training(trainnb)

2012-06-29 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404306#comment-13404306 ] Robin Anil commented on MAHOUT-1034: it seems you are running the example on a hadoop

Jenkins build is back to normal : Mahout-Examples-Classify-20News #65

2012-06-29 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-1034) ERROR in Navie Bayes Training(trainnb)

2012-06-29 Thread Leting Wu (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404266#comment-13404266 ] Leting Wu commented on MAHOUT-1034: --- I tried Mahout 0.6. Does not look well either. My

[jira] [Comment Edited] (MAHOUT-1034) ERROR in Navie Bayes Training(trainnb)

2012-06-29 Thread Leting Wu (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404266#comment-13404266 ] Leting Wu edited comment on MAHOUT-1034 at 6/29/12 10:06 PM: -

[jira] [Updated] (MAHOUT-1035) Hotspot in recommenditembased – UnsymmetrifyMapper job

2012-06-29 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1035: --- Attachment: MAHOUT-1035.patch Changed a few other places where like was cautiously u

[jira] [Resolved] (MAHOUT-1035) Hotspot in recommenditembased – UnsymmetrifyMapper job

2012-06-29 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1035. Resolution: Fixed > Hotspot in recommenditembased – UnsymmetrifyMapper job > -

Re: kmeans does not calculate distance from the centroid in 0.7 or 0.8

2012-06-29 Thread Pat Ferrel
For what it's worth I'm calculating the distance for every doc as I go through clusteredPoints. Since I'm not storing the results in memory this works OK but is a tad slower. My code is also now dependent on the distance measure where it was not before. However this is not blocking me. Maybe i

Jenkins build is back to normal : Mahout-Examples-Cluster-Reuters #170

2012-06-29 Thread Apache Jenkins Server
See

Build failed in Jenkins: Mahout-Examples-Classify-20News #64

2012-06-29 Thread Apache Jenkins Server
See Changes: [ssc] RegressionResultAnalyzer must use US locale -- [...truncated 6207 lines...] 888 441 889 440 890 440 891 440 892 440 893 440 894 440 89

Re: kmeans does not calculate distance from the centroid in 0.7 or 0.8

2012-06-29 Thread Jeff Eastman
I just tried removing the normalization step and DisplayKMeans produces exactly the same result. Since the pdfs vector us just an accumulation of pdf values I think perhaps the normalization isn't necessary. The only gotcha would be if a ClusterClassifier were ever used as an AbstractVectorClas

Re: kmeans does not calculate distance from the centroid in 0.7 or 0.8

2012-06-29 Thread Jeff Eastman
+dev@m.a.o Let's have this conversation for everybody on the list too The pdf() of all DistanceMeasureClusters is: public double pdf(VectorWritable vw) { return 1 / (1 + measure.distance(vw.get(), getCenter())); } for CosineDistance, the pdf values should be distributed on 1..2. Aha!

[jira] [Commented] (MAHOUT-1034) ERROR in Navie Bayes Training(trainnb)

2012-06-29 Thread Leting Wu (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404074#comment-13404074 ] Leting Wu commented on MAHOUT-1034: --- Thank you very much for the reply. I was hoping to

[jira] [Updated] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2012-06-29 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-1030: - Attachment: MAHOUT-1030.patch Here's a small patch that actually sets the documented weights in

[jira] [Updated] (MAHOUT-1035) Hotspot in recommenditembased – UnsymmetrifyMapper job

2012-06-29 Thread Bhaskar Devireddy (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bhaskar Devireddy updated MAHOUT-1035: -- Attachment: patch_1035_ver2.patch Made changes to the initial patch and incorporated y

[jira] [Commented] (MAHOUT-1034) ERROR in Navie Bayes Training(trainnb)

2012-06-29 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403988#comment-13403988 ] Robin Anil commented on MAHOUT-1034: I cannot reproduce anything. Can you give me the

[jira] [Assigned] (MAHOUT-1035) Hotspot in recommenditembased – UnsymmetrifyMapper job

2012-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned MAHOUT-1035: - Assignee: Sebastian Schelter (was: Sean Owen) > Hotspot in recommenditembased – Unsymmetrif