Hi all, I am a complete newbie in action ... despite I have gone throuh the book of the same collection;) I would like to classify a number of items - each of them being characterize by a number of vectors. I thought it will be a good idea first to classify the vectors. Unfortunately my items number keep on growing so what I have done so far is a small piece of code constructing the mahout dense vectors on the fly setting the name as my item name. As far as I understand those vectors are kept in memory ... What are the next steps for me ? Storing those vectors on disk I assume :) Then creating some canopies and then using kmean to create my clusters. Can you guide me trough some steps ?
Then I have more questions ? Can mahout determine an "optimal" number of clusters ? Once a set of clusters exist and new items are added, is it possible to update the existing clusters ? Is it possible to add clusters at alower cost than recreting it ? T hanks for your help and time Regards Guillaume
