Dan Filimon created MAHOUT-1224: ----------------------------------- Summary: Add the option of running a StreamingKMeans pass in the Reducer before BallKMeans Key: MAHOUT-1224 URL: https://issues.apache.org/jira/browse/MAHOUT-1224 Project: Mahout Issue Type: New Feature Components: Clustering Affects Versions: 0.8 Reporter: Dan Filimon
Sometimes, the number of points passed to the reducer from the mappers in the StreamingKMeansDriver job is too large to fit into memory. In that case, applying another StreamingKMeans pass can collapse the mapper intermediate clusters to a more manageable size to be clustered. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira