On Sat, 26 Jun 2010, Bryan Hanson wrote:

Hello All.  I must be missing the really obvious here:

mm <- function(d, i) median(d[i])
b1 <- boot(gravity$g, mm, R = 1000)
b1
b2 <- boot(gravity$g, mm, R = 1000, strata = gravity$series)
b2

Both b1 and b2 seem to have done (almost) the same thing, but it looks like
the strata argument in b2 has been ignored.  However, str(b1) vs str(b2)
does show that the strata have been noted correctly.  But b2$t is a 1000 x 1
array, not a 1000 x 8 array (gravity$series is a factor with 8 levels).

There is a more complex example in ?boot using the same data set that gives
a result that seems to make sense (2 levels in the factor, so $t has 2
columns).

I either misunderstand the expected behavior or I've missed some punctuation
or syntax detail.

Your punctuation and syntax is OK.

Note:

SISWR <- function(x) sample(x,length(x),repl=TRUE)
# no strata
var(replicate(1000,median(SISWR(gravity$g))))
[1] 0.4588338
# now stratify on series
gsplit <- split(gravity$g,gravity$series)
var(replicate(1000,median(unlist(lapply(gsplit,SISWR)))))
[1] 0.3882272

sqrt(.45) # this agrees  with b1
[1] 0.6708204
sqrt(.39) # this agrees with b2
[1] 0.6244998


The effect of stratification depends on the relative amount of variation within vs between strata. This suggests there is not a lot:

aov(g~series,gravity)
Call:
   aov(formula = g ~ series, data = gravity)

Terms:
                  series Residuals
Sum of Squares  2818.624  8239.376
Deg. of Freedom        7        73

Residual standard error: 10.62394
Estimated effects may be unbalanced



HTH,

Chuck


TIA, Bryan

*************
Bryan Hanson
Acting Chair
Professor of Chemistry & Biochemistry
DePauw University, Greencastle IN USA

sessionInfo()
R version 2.11.0 (2010-04-22)
x86_64-apple-darwin9.8.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] datasets  tools     grid      graphics  grDevices utils     stats
[8] methods   base

other attached packages:
[1] boot_1.2-42        brew_1.0-3         faraway_1.0.4
[4] GGally_0.2         xtable_1.5-6       mvbutils_2.5.1
[7] ggplot2_0.8.7      digest_0.4.2       reshape_0.8.3
[10] proto_0.3-8        ChemoSpec_1.43     R.utils_1.4.0
[13] R.oo_1.7.2         R.methodsS3_1.2.0  rgl_0.91
[16] lattice_0.18-5     mvoutlier_1.4      plyr_0.1.9
[19] RColorBrewer_1.0-2 chemometrics_0.8   som_0.3-5
[22] robustbase_0.5-0-1 rpart_3.1-46       pls_2.1-0
[25] pcaPP_1.8-1        mvtnorm_0.9-9      nnet_7.3-1
[28] mclust_3.4.4       MASS_7.3-5         lars_0.9-7
[31] e1071_1.5-23       class_7.3-2

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Charles C. Berry                            (858) 534-2098
                                            Dept of Family/Preventive Medicine
E mailto:cbe...@tajo.ucsd.edu               UC San Diego
http://famprevmed.ucsd.edu/faculty/cberry/  La Jolla, San Diego 92093-0901

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to