On Thu, 8 Apr 2010, ONKELINX, Thierry wrote:

Dear Thomas,

Thank you for your informative answer. We used epi.stratasize() to
estimate the required sample size per stratum. Notice in the example
below that it can select a sample size smaller than 2 in the very small
strata. Would you recommend to sample at least two items per stratum or
rather to merge some strata a priori until the sample size is at least
2?

Merging the strata would be best

Or is there a better way to estimate the sample size per stratum?
Note that the stratification only aims to get a good geographical
coverage (the strata a geographical regions). We are not interested in
estimates per stratum.

library(epiR)
N <- c(39, 270, 1060, 1336, 118, 26, 154, 10, 3)
epi.stratasize(strata.n = N, strata.mean = 0.9, epsilon = 0.05, method =
"proportion")
$strata.sample
[1]  2 15 57 72  6  1  8  1  0

$total.sample
[1] 162

The probability of sampling was proportional with the area (larger
polygons are more likely to be selected than smaller ones). So we will
use weights = I(1/Area), as you suggested.

If you are using probability proportional to size and you want to use 
finite-population correctsions, you also need to specify the fpc= argument 
differently. The simplest version is an approximation that uses only the 
marginal sampling probabilities
  svydesign(id=~1, fpc=~p, pps="brewer", strata=~strat
where p is a variable with the actual sampling probability (not just 
proportional to sampling probability).

Also, how did you do the sampling?  It's quite hard to do unequal probability 
sampling without replacement (the R sample() function doesn't actually  do it, 
though the sampling package does).

    -thomas

Thomas Lumley                   Assoc. Professor, Biostatistics
tlum...@u.washington.edu        University of Washington, Seattle

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to