Re: [R] hclust, does order of data matter?

Reshmi Chowdhury Mon, 15 Nov 2010 14:22:16 -0800

Here is the code I am using:

m <- read.csv("data_unsorted.csv",header=TRUE)
m <- na.omit(m)
cs <- hclust(dist(t(m),method="euclidean"),method="complete")
ds <- as.dendrogram(cs)

In this case, m is a 106x40 matrix of doubles.  When I change the order of
the columns, I get different results...

Thanks,
RC

On Mon, Nov 15, 2010 at 2:13 PM, Peter Langfelder <
peter.langfel...@gmail.com> wrote:

> On Mon, Nov 15, 2010 at 2:07 PM, rchowdhury <rchowdh...@alumni.upenn.edu>
> wrote:
> >
> > Hello,
> >
> > I am using the hclust function to cluster some data.  I have two separate
> > files with the same data.  The only difference is the order of the data
> in
> > the file.  For some reason, when I run the two files through the hclust
> > function, I get two completely different results.
> >
> > Does anyone know why this is happening?  Does the order of the data
> matter?
>
> No, order of the data should not matter. However, hclust takes a
> distance structure, not a matrix, so the problem may be in how you
> create the distance. Can you provide an example?
>
> Peter
>

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] hclust, does order of data matter?

Reply via email to