So Florents, can you say how this works better than 1 of n coding and then using a simple scaled Euclidean metric?
Beyond that, how would this scale? On Sun, Jun 2, 2013 at 2:39 PM, Florents Tselai <tse...@dmst.aueb.gr> wrote: > I've noticed (correct me if I'm wrong) that mahout lacks algorithms > specialized in clustering data with categorical attributes. > > Would the community be interested in the implementation of algorithms like > ROCK <http://www.cis.upenn.edu/~sudipto/mypapers/categorical.pdf> ? > > I'm currently working on this area (applied-research project) and I'd like > to have my code open-sourced. >