Hi all,

Simple question re k-means. If I have a data set with columns that are on
different scales (say col 1 has var=100 and col2 var=2), will this make a
difference to the k-means algorithm? It seems as though it does. If so,
should we first standardize the columns of the dataset so that each column
is given equal weight?

JJ

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to