Never mind on this, I read some emails out of context and now realize
this has been addressed.
On Mar 19, 2009, at 6:57 AM, Grant Ingersoll (JIRA) wrote:
[ https://issues.apache.org/jira/browse/MAHOUT-99?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683426
#action_12683426 ]
Grant Ingersoll commented on MAHOUT-99:
---------------------------------------
For the record, I ran Canopy independently, and that worked just fine.
Improving speed of KMeans
-------------------------
Key: MAHOUT-99
URL: https://issues.apache.org/jira/browse/MAHOUT-99
Project: Mahout
Issue Type: Improvement
Components: Clustering
Reporter: Pallavi Palleti
Assignee: Grant Ingersoll
Fix For: 0.1
Attachments: MAHOUT-99-1.patch, Mahout-99.patch,
MAHOUT-99.patch
Improved the speed of KMeans by passing only cluster ID from mapper
to reducer. Previously, whole Cluster Info as formatted s`tring was
being sent.
Also removed the implicit assumption of Combiner runs only once
approach and the code is modified accordingly so that it won't
create a bug when combiner runs zero or more than once.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.