Hi Tom,

>   I have a dataset consists of duplicated sequences within day for each 
> patient (see below data) and I want to reshape the data with patient as time 
> variable. However the reshape function only takes the first sequence of the 
> replicates and ignores the second. How can I 1) average the duplicates and 2) 
> give the duplicated sequences unique names before reshaping the data ?
>
>   > data
>      patient day  seq           y
>   1       10   1 acdf -0.52416066
>   2       10   1 cdsv  0.62551539
>   3       10   1 dlfg -1.54668047
>   4       10   1 acdf  0.82404978
>   5       10   1 cdsv -1.17459914
>   6       10   2 acdf  0.47238216

You mind find that the functions in the reshape package give you a bit
more flexibility.

# The reshape package expects data like to have
# the value variable named "value"
d2 <- rename(data, c("y" = "value"))

# I think this is the format you want, which will average over the reps
cast(d2, day + seq ~ patient, mean)


Hadley

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to