Hi Devan and Xiangrui, Can you please explain the cost and optimization function of the KNN alogorithim that is being used?
Thank and Regards, Sudipta On Thu, Jan 22, 2015 at 6:59 PM, DEVAN M.S. <msdeva...@gmail.com> wrote: > Thanks Xiangrui Meng will try this. > > And, found this https://github.com/kaushikranjan/knnJoin also. > Will this work with double data ? Can we find out z value of > *Vector(10.3,4.5,3,5)* ? > > > > > > > On Thu, Jan 22, 2015 at 12:25 AM, Xiangrui Meng <men...@gmail.com> wrote: > >> For large datasets, you need hashing in order to compute k-nearest >> neighbors locally. You can start with LSH + k-nearest in Google >> scholar: http://scholar.google.com/scholar?q=lsh+k+nearest -Xiangrui >> >> On Tue, Jan 20, 2015 at 9:55 PM, DEVAN M.S. <msdeva...@gmail.com> wrote: >> > Hi all, >> > >> > Please help me to find out best way for K-nearest neighbor using spark >> for >> > large data sets. >> > >> > > -- Sudipta Banerjee Consultant, Business Analytics and Cloud Based Architecture Call me +919019578099