> On the other hand, if we use 7 clusters, > > > *> k = kmeans(iris[,1:4], centers=7, nstart=10)* > > *> table(iris$Species, k$cluster)* > > cluster > 1 2 3 4 5 6 7 > setosa* 0 0 28 0 22 0 0* > versicolor* 0 7 0 20 0 0 23* > virginica* 12 0 0 1 0 24 13* > > Each cluster is now composed of almost exactly one species. Only cluster > 4 has any impurity and it is 95% composed of just versicolor samples. > @Ted,
How about cluster 7? it seems it is not as a demonstrable improvement, or i don't get something
