On Jun 26, 2009, at 11:32 AM, Grant Ingersoll wrote:
Isn't the KMeansJob pretty much redundant, assuming we add a
parameter to KMeansDriver to take in the number of reduce tasks?
Also, the variable naming in KMeansJob that the number of reduce
tasks (numCentroids) is actually the "k" in k-Means, even if this
value is currently fixed at 2 if using KMeansDriver? I'm trying to
make arg handling easier for MAHOUT-138.
It also deletes the OutputPath if it exists.
I'm going to delete the Job file and fold this functionality into
KMDriver.
-Grant