Dan Filimon created MAHOUT-1224:
-----------------------------------

             Summary: Add the option of running a StreamingKMeans pass in the 
Reducer before BallKMeans
                 Key: MAHOUT-1224
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1224
             Project: Mahout
          Issue Type: New Feature
          Components: Clustering
    Affects Versions: 0.8
            Reporter: Dan Filimon


Sometimes, the number of points passed to the reducer from the mappers in the 
StreamingKMeansDriver job is too large to fit into memory.

In that case, applying another StreamingKMeans pass can collapse the mapper 
intermediate clusters to a more manageable size to be clustered.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to