"Robert A. LaBudde" <[EMAIL PROTECTED]> wrote in
news:[EMAIL PROTECTED]: 

> Given a set of data with a number of variables plus a response, I'd 
> like to obtain a randomized subset of the rows such that the
> marginal proportions of each variable are maintained closely in the
> subset to that of the dataset, and possibly maintaining as well the
> two-factor interaction marginal proportions as well for some pairs.
> 
> This must be a common problem in data mining, but I don't seem to be
> able to locate the proper library or function for doing this in R.
> 
> Thanks for any help.

Have you looked at the "sampling" package? I have never used it, but the 
strata() function appears to be capable.

-- 
David Winsemius

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to