Hi all, Simple question re k-means. If I have a data set with columns that are on different scales (say col 1 has var=100 and col2 var=2), will this make a difference to the k-means algorithm? It seems as though it does. If so, should we first standardize the columns of the dataset so that each column is given equal weight?
JJ [[alternative HTML version deleted]] ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.