I am using KMeans as part of a long pipeline. Suppose I give Kmeans a SequenceFile containing Key as IntWritable and value as VectorWritable where the Keys are IDs for the Vectors, is there a utility or an option to get KMeans to spit out the IDs that belong to a cluster rather than the WeightedVectorWritable bean?
Thanks Esh
