This sounds awesome! If you can get the algorithm working I would be more than happy to help integrate it into the Algorithms Framework (so other people could use it too).
Trevor Grant Data Scientist https://github.com/rawkintrevo http://stackexchange.com/users/3002022/rawkintrevo http://trevorgrant.org *"Fortunate is he, who is able to know the causes of things." -Virgil* On Wed, Mar 29, 2017 at 1:41 PM, Adi Haviv <adiha...@gmail.com> wrote: > I wish I could. i wasn't able to find any solution (with mahout or any > other) that can do kmeans on over 10M sparse vectors. > > happy to connect and collaborate on a solution if you like. please contact > me on a private email (or on linkedin -Adi Haviv). > > Thanks, > Adi > > On Wed, Mar 29, 2017 at 11:41 AM, KHATWANI PARTH BHARAT < > h2016...@pilani.bits-pilani.ac.in> wrote: > > > No,i am trying to write the kmeans from scratch using Mahout DSL's > > Distributed Row Matrix. > > And i am not getting how proceed. Can you help me with that. > > > > > > On Wed, Mar 29, 2017 at 9:04 PM, Adi Haviv <adiha...@gmail.com> wrote: > > > > > Is it working? I never got any of the mahout clustering to work on > eBay's > > > data. > > > > > > On Mar 29, 2017 11:30 AM, "KHATWANI PARTH BHARAT" < > > > h2016...@pilani.bits-pilani.ac.in> wrote: > > > > > > > Sir, > > > > I am trying to write the kmeans clustering algorithm using Mahout > > Samsara > > > > but i am bit confused > > > > about how to leverage Distributed Row Matrix for the same. Can > anybody > > > help > > > > me with same. > > > > > > > > > > > > > > > > > > > > > > > > Thanks > > > > Parth Khatwani > > > > > > > > > > > > > -- > Adi Haviv. >