RE: [R] sapply behavior

2004-09-27 Thread Prof Brian Ripley
On Mon, 27 Sep 2004, Liaw, Andy wrote:

> The problem is that temp2 is a data frame, and the function you are
> sapply()ing to returns a row from a data frame.  A data frame is really a
> list, with each variable corresponding to a component.  If you extract a row
> of a data frame, you get another data frame, not a vector, even if all
> variables are the same type.  sapply() can really `simplify' the right way
> if it's given a vector (or matrix).  Consider:
> 
> > str(temp2)
> `data.frame':   6 obs. of  4 variables:
>  $ A1: int  5 2 4 6 1 3
>  $ A2: int  5 2 4 6 1 3
>  $ A3: int  5 2 4 6 1 3
>  $ A4: int  5 2 4 6 1 3
> > temp2 <- as.matrix(temp2)
> > str(temp2)
>  int [1:6, 1:4] 5 2 4 6 1 3 5 2 4 6 ...
>  - attr(*, "dimnames")=List of 2
>   ..$ : chr [1:6] "1" "2" "3" "4" ...
>   ..$ : chr [1:4] "A1" "A2" "A3" "A4"
> > str(sapply(1:6,function(x){xmat<-temp2[temp2[,1]==x,,drop=F]; xmat[1,]}))
>  int [1:4, 1:6] 1 1 1 1 2 2 2 2 3 3 ...
>  - attr(*, "dimnames")=List of 2
>   ..$ : chr [1:4] "A1" "A2" "A3" "A4"
>   ..$ : NULL
> 
> (The is.matrix() function probably just check whether the dim attribute is a
> vector of length 2, and not a data frame (as it says in ?is.matrix).  The
> newtemp2 object you get is a list with 24 components, each component is a
> vector of one integer, and has a dim attribute of c(4, 6).  Not what I would
> call a matrix.)

That *is* a matrix, though, and is useful for lists of length greater than
one.  A matrix in R is just a vector with a dim attribute (and length the
product of the dims's), so as well as any of the atomic vectors it can be
a generic vector aka list.

-- 
Brian D. Ripley,  [EMAIL PROTECTED]
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel:  +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UKFax:  +44 1865 272595

__
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] sapply behavior

2004-09-27 Thread Gabor Grothendieck
Elizabeth Purdom  stanford.edu> writes:

: 
: Hi,
: 
: I use sapply very frequently, but I have recently noticed a behavior of 
: sapply which I don't understand and have never seen before. Basically, 
: sapply returns what looks like a matrix,  says it a matrix, and appears to 
: let me do matrix things (like transpose). But it is also a list and behaves 
: like a list when I subset it, not a vector (so I can't sort a row for 
: instance). I don't know where this is coming from so as to avoid it, nor 
: how to handle the beast that sapply is returning. I double checked my old 
: version of R and apparently this same thing happens in 1.8.0, though I 
: never experienced it. I had a hard time reproducing it, and I don't know 
: what's setting it off, but the code below seems to do it for me. (I'm using 
: R on Windows XP, either 1.8.0 or 1.9.1)
: 
: Thanks for any help,
: Elizabeth Purdom
: 
: 
:  > temp2<-matrix(sample(1:6,6,replace=F),byrow=F,nrow=6,ncol=4)
:  > colnames(temp2)<-paste("A",as.character(1:4),sep="")
:  > temp2<-as.data.frame(temp2)
:  > 
: newtemp2<-sapply((1:6),function(x){xmat<-temp2[temp2[,1]==x,,drop=F];return
(xmat[1,])})
:  > print(newtemp2) #looks like matrix
: [,1] [,2] [,3] [,4] [,5] [,6]
: A1 123456
: A2 123456
: A3 123456
: A4 123456
:  > is.matrix(newtemp2) #says it's matrix
: [1] TRUE
:  > class(newtemp2)
: [1] "matrix"
:  > is.list(newtemp2) #but also list
: [1] TRUE
:  > newtemp2[,1] #can't subset and get a vector back; same thing happens for 
: rows.
: $A1
: [1] 1
: 
: $A2
: [1] 1
: 
: $A3
: [1] 1
: 
: $A4
: [1] 1
: #other things about it:
:  > names(newtemp2)
: NULL
:  > dimnames(newtemp2)
: [[1]]
: [1] "A1" "A2" "A3" "A4"
: 
: [[2]]
: NULL
:  > dim(newtemp2)
: [1] 4 6
:  > length(newtemp2)
: [1] 24


The problem is that your function is returning a one row data frame
and when sapply tries to simplify the resulting list of 6 data frames 
that gives a list with dimensions rather what you were expecting
which is a vector with dimensions.

Let us call the original anonymous function in your post (i.e. the one
passed to sapply there), f.  We can modify it to produce f2 which is like
f except that we wrap the return expression in c() to turn it into a
vector:

  f2 <- function(x){xmat<-temp2[temp2[,1]==x,,drop=F];return(c(xmat[1,]))}
  sapply(1:6, f2)

If you really do want to return a one row data frame then
use rbind to bind the data frames together rather than sapply:

   do.call("rbind", lapply(1:6, f))

__
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


Re: [R] sapply behavior

2004-09-27 Thread Douglas Bates
Elizabeth Purdom wrote:
I use sapply very frequently, but I have recently noticed a behavior of 
sapply which I don't understand and have never seen before. Basically, 
sapply returns what looks like a matrix,  says it a matrix, and appears 
to let me do matrix things (like transpose). But it is also a list and 
behaves like a list when I subset it, not a vector (so I can't sort a 
row for instance). I don't know where this is coming from so as to avoid 
it, nor how to handle the beast that sapply is returning. I double 
checked my old version of R and apparently this same thing happens in 
1.8.0, though I never experienced it. I had a hard time reproducing it, 
and I don't know what's setting it off, but the code below seems to do 
it for me. (I'm using R on Windows XP, either 1.8.0 or 1.9.1)

Thanks for any help,
Elizabeth Purdom
 > temp2<-matrix(sample(1:6,6,replace=F),byrow=F,nrow=6,ncol=4)
 > colnames(temp2)<-paste("A",as.character(1:4),sep="")
 > temp2<-as.data.frame(temp2)
It is this coercion to the data frame that is injecting a list-like 
property into the result.  Try your script without that line and it will 
work as you expect.

 > newtemp2<-sapply((1:6),function(x){xmat<-temp2[temp2[,1]==x,,drop=F];return(xmat[1,])}) 
 > print(newtemp2) #looks like matrix
   [,1] [,2] [,3] [,4] [,5] [,6]
A1 123456
A2 123456
A3 123456
A4 123456
The best thing to do in a situation like this is to use the str function 
to see the details of the structure of the object.

__
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


RE: [R] sapply behavior

2004-09-27 Thread Liaw, Andy
The problem is that temp2 is a data frame, and the function you are
sapply()ing to returns a row from a data frame.  A data frame is really a
list, with each variable corresponding to a component.  If you extract a row
of a data frame, you get another data frame, not a vector, even if all
variables are the same type.  sapply() can really `simplify' the right way
if it's given a vector (or matrix).  Consider:

> str(temp2)
`data.frame':   6 obs. of  4 variables:
 $ A1: int  5 2 4 6 1 3
 $ A2: int  5 2 4 6 1 3
 $ A3: int  5 2 4 6 1 3
 $ A4: int  5 2 4 6 1 3
> temp2 <- as.matrix(temp2)
> str(temp2)
 int [1:6, 1:4] 5 2 4 6 1 3 5 2 4 6 ...
 - attr(*, "dimnames")=List of 2
  ..$ : chr [1:6] "1" "2" "3" "4" ...
  ..$ : chr [1:4] "A1" "A2" "A3" "A4"
> str(sapply(1:6,function(x){xmat<-temp2[temp2[,1]==x,,drop=F]; xmat[1,]}))
 int [1:4, 1:6] 1 1 1 1 2 2 2 2 3 3 ...
 - attr(*, "dimnames")=List of 2
  ..$ : chr [1:4] "A1" "A2" "A3" "A4"
  ..$ : NULL

(The is.matrix() function probably just check whether the dim attribute is a
vector of length 2, and not a data frame (as it says in ?is.matrix).  The
newtemp2 object you get is a list with 24 components, each component is a
vector of one integer, and has a dim attribute of c(4, 6).  Not what I would
call a matrix.)

HTH,
Andy


> From: Elizabeth Purdom
> 
> Hi,
> 
> I use sapply very frequently, but I have recently noticed a 
> behavior of 
> sapply which I don't understand and have never seen before. 
> Basically, 
> sapply returns what looks like a matrix,  says it a matrix, 
> and appears to 
> let me do matrix things (like transpose). But it is also a 
> list and behaves 
> like a list when I subset it, not a vector (so I can't sort a row for 
> instance). I don't know where this is coming from so as to 
> avoid it, nor 
> how to handle the beast that sapply is returning. I double 
> checked my old 
> version of R and apparently this same thing happens in 1.8.0, 
> though I 
> never experienced it. I had a hard time reproducing it, and I 
> don't know 
> what's setting it off, but the code below seems to do it for 
> me. (I'm using 
> R on Windows XP, either 1.8.0 or 1.9.1)
> 
> Thanks for any help,
> Elizabeth Purdom
> 
> 
>  > temp2<-matrix(sample(1:6,6,replace=F),byrow=F,nrow=6,ncol=4)
>  > colnames(temp2)<-paste("A",as.character(1:4),sep="")
>  > temp2<-as.data.frame(temp2)
>  > 
> newtemp2<-sapply((1:6),function(x){xmat<-temp2[temp2[,1]==x,,d
> rop=F];return(xmat[1,])})
>  > print(newtemp2) #looks like matrix
> [,1] [,2] [,3] [,4] [,5] [,6]
> A1 123456
> A2 123456
> A3 123456
> A4 123456
>  > is.matrix(newtemp2) #says it's matrix
> [1] TRUE
>  > class(newtemp2)
> [1] "matrix"
>  > is.list(newtemp2) #but also list
> [1] TRUE
>  > newtemp2[,1] #can't subset and get a vector back; same 
> thing happens for 
> rows.
> $A1
> [1] 1
> 
> $A2
> [1] 1
> 
> $A3
> [1] 1
> 
> $A4
> [1] 1
> #other things about it:
>  > names(newtemp2)
> NULL
>  > dimnames(newtemp2)
> [[1]]
> [1] "A1" "A2" "A3" "A4"
> 
> [[2]]
> NULL
>  > dim(newtemp2)
> [1] 4 6
>  > length(newtemp2)
> [1] 24
> 
> __
> [EMAIL PROTECTED] mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! 
> http://www.R-project.org/posting-guide.html
> 
>

__
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html


[R] sapply behavior

2004-09-27 Thread Elizabeth Purdom
Hi,
I use sapply very frequently, but I have recently noticed a behavior of 
sapply which I don't understand and have never seen before. Basically, 
sapply returns what looks like a matrix,  says it a matrix, and appears to 
let me do matrix things (like transpose). But it is also a list and behaves 
like a list when I subset it, not a vector (so I can't sort a row for 
instance). I don't know where this is coming from so as to avoid it, nor 
how to handle the beast that sapply is returning. I double checked my old 
version of R and apparently this same thing happens in 1.8.0, though I 
never experienced it. I had a hard time reproducing it, and I don't know 
what's setting it off, but the code below seems to do it for me. (I'm using 
R on Windows XP, either 1.8.0 or 1.9.1)

Thanks for any help,
Elizabeth Purdom
> temp2<-matrix(sample(1:6,6,replace=F),byrow=F,nrow=6,ncol=4)
> colnames(temp2)<-paste("A",as.character(1:4),sep="")
> temp2<-as.data.frame(temp2)
> 
newtemp2<-sapply((1:6),function(x){xmat<-temp2[temp2[,1]==x,,drop=F];return(xmat[1,])})
> print(newtemp2) #looks like matrix
   [,1] [,2] [,3] [,4] [,5] [,6]
A1 123456
A2 123456
A3 123456
A4 123456
> is.matrix(newtemp2) #says it's matrix
[1] TRUE
> class(newtemp2)
[1] "matrix"
> is.list(newtemp2) #but also list
[1] TRUE
> newtemp2[,1] #can't subset and get a vector back; same thing happens for 
rows.
$A1
[1] 1

$A2
[1] 1
$A3
[1] 1
$A4
[1] 1
#other things about it:
> names(newtemp2)
NULL
> dimnames(newtemp2)
[[1]]
[1] "A1" "A2" "A3" "A4"
[[2]]
NULL
> dim(newtemp2)
[1] 4 6
> length(newtemp2)
[1] 24
__
[EMAIL PROTECTED] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html