The problem with k means is we have to define the no of cluster which I dont want in this case So thinking for something like hierarchical clustering any idea and suggestions?
On 21 April 2015 at 20:51, Jeetendra Gangele <gangele...@gmail.com> wrote: > I have a requirement in which I want to match the company name .. and I am > thinking to solve this using clustering technique. > > Can anybody suggest which algo I should Use in Spark and how to evaluate > the running time and accuracy for this particular problem. > > I checked K means looks good. > Any idea suggestions? > >