Hi Mahout, I have crawled the text data from some of the URLs. I would like to do k-means clustering in Mahout.
Could you let me know what are the steps involved in k-means and how we prepare input for the k-means algorithm. Another question is do we have any evaluation method to identify the accuracy of my cluster. Thanks, Venkat