Hi all,

I am a complete newbie in action ... despite I have gone throuh the book of the 
same collection;)
I would like to classify a number of items - each of them being characterize by 
a number of vectors. I thought it will be a good idea first to classify the 
vectors. Unfortunately my items number keep on growing so what I have done so 
far is a small piece of code constructing the mahout dense vectors on the fly 
setting the name as my item name. As far as I understand those vectors are kept 
in memory ...
What are the next steps for me ?
Storing those vectors on disk I assume :)
Then creating some canopies and then using kmean to create my clusters.
Can you guide me trough some steps ?

Then I have more questions ?
Can mahout determine an "optimal" number of clusters ?
Once a set of clusters exist and new items are added, is it possible to update 
the existing clusters ? Is it possible to add clusters at alower cost than 
recreting it ?

T hanks for your help and time
Regards
Guillaume

Reply via email to