Never mind on this, I read some emails out of context and now realize this has been addressed.

On Mar 19, 2009, at 6:57 AM, Grant Ingersoll (JIRA) wrote:


[ https://issues.apache.org/jira/browse/MAHOUT-99?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683426 #action_12683426 ]

Grant Ingersoll commented on MAHOUT-99:
---------------------------------------

For the record, I ran Canopy independently, and that worked just fine.





Improving speed of KMeans
-------------------------

               Key: MAHOUT-99
               URL: https://issues.apache.org/jira/browse/MAHOUT-99
           Project: Mahout
        Issue Type: Improvement
        Components: Clustering
          Reporter: Pallavi Palleti
          Assignee: Grant Ingersoll
           Fix For: 0.1

Attachments: MAHOUT-99-1.patch, Mahout-99.patch, MAHOUT-99.patch


Improved the speed of KMeans by passing only cluster ID from mapper to reducer. Previously, whole Cluster Info as formatted s`tring was being sent. Also removed the implicit assumption of Combiner runs only once approach and the code is modified accordingly so that it won't create a bug when combiner runs zero or more than once.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Reply via email to