Re: [R] Making tapply code more efficient

2009-03-09 Thread William Dunlap
re Inc - Spotfire Division wdunlap tibco.com -------------- [R] Making tapply code more efficient Doran, Harold HDoran at air.org Mon Mar 9 15:43:47 CET 2009 Previous message: [R] Making tapply code more efficient Next message: [R] rcorr.cens Goodman-Kruskal gamma Messages sorted by: [ date ] [ thr

Re: [R] Making tapply code more efficient

2009-03-09 Thread Doran, Harold
rld works beautifully. > -Original Message- > From: r-help-boun...@r-project.org > [mailto:r-help-boun...@r-project.org] On Behalf Of Doran, Harold > Sent: Monday, March 09, 2009 10:25 AM > To: ONKELINX, Thierry; jholt...@gmail.com > Cc: r-help@r-project.org > Subject: Re: [R] Maki

Re: [R] Making tapply code more efficient

2009-03-09 Thread Doran, Harold
From: ONKELINX, Thierry [mailto:thierry.onkel...@inbo.be] > Sent: Friday, February 27, 2009 10:24 AM > To: Doran, Harold; r-help@r-project.org > Subject: RE: [R] Making tapply code more efficient > > Hi Harold, > > What about this? You one have to make the crosstabulation

Re: [R] Making tapply code more efficient

2009-02-27 Thread jim holtman
On something the size of your data it took about 30 seconds to determine the number of unique teachers per student. > x <- cbind(sample(326397, 800967, TRUE), sample(20, 800967, TRUE)) > # split the data so you have the number of teachers per student > system.time(t.s <- split(x[,2], x[,1])) us

Re: [R] Making tapply code more efficient

2009-02-27 Thread ONKELINX, Thierry
data. ~ John Tukey -Oorspronkelijk bericht- Van: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org] Namens Doran, Harold Verzonden: vrijdag 27 februari 2009 15:47 Aan: r-help@r-project.org Onderwerp: [R] Making tapply code more efficient Previously, I posed the question pasted down bel

[R] Making tapply code more efficient

2009-02-27 Thread Doran, Harold
Previously, I posed the question pasted down below to the list and received some very helpful responses. While the code suggestions provided in response indeed work, they seem to only work with *very* small data sets and so I wanted to follow up and see if anyone had ideas for better efficiency. I