On Mon, Oct 10, 2011 at 4:14 PM, Joshua Wiley <jwiley.ps...@gmail.com> wrote: > I could be waay off base here, but my concern about presplitting the data is > that you will have your data, and a second copy of our data that is something > like a list where each element contains the portion of the data for that > split. Good speed wise, bad memory wise. My hope with the technique I > showed (again I may not have accomplished it) was to only have at anyone > time, the original data and a copy of the particular elements being worked > with. Of course this is not an issue if you have plenty of memory.
That's exactly what plyr does behind the scenes. Hadley -- Assistant Professor / Dobelman Family Junior Chair Department of Statistics / Rice University http://had.co.nz/ ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.