Hi list, I was talking to Isabel Drost in December, and we talked about a nice paper from last year's KDD conference that suggests a neat trick that allows doing SGD for matrix factorization in parallel.
She said this would be interesting for some of you here. Here is the paper: http://www.mpi-inf.mpg.de/~rgemulla/publications/gemulla11dsgd.pdf Note that the authors themselves implemented it already in Hadoop. Maybe someone would like to pick this up. I am still trying to find my way around the Mahout/Taste source code, so do not expect anything from me too soon ;-) Best regards, Zeno
