Yes. I have been working (slowly) on moving some very fast single pass clustering into Mahout. My work in progress currently does very fast clustering of small dense vectors and it should scale to sparse vectors fairly well with some small changes.
See https://github.com/tdunning/knn for more info. On Wed, Sep 12, 2012 at 7:26 PM, Elaine Gan <elaine-...@gmo.jp> wrote: > Any ways to improve on the mahout kmeans to speed it up? >