Hi,
Just a practical problem that I have run into here. I have a large data set
where every case (row) represents a person. Each person belongs to a
metropolitan area. I want to aggregate some of the individual results into
metro area statistics. Should be easy... but alas...
The data is sample data, so I want to apply weights. I can do this easily
enough in SPSS, using its apply weights and aggregate functions. The
complication... in addition to mean, s.d., and n, I want a median. SPSS
won't give it to me.
I thought I had the problem solved. Exported the data, imported it into
Minitab. Now, I can get any statistics I want, mean, median, s.d., n, but I
can't figure out how to apply the sample weights in Minitab. I am less
familiar with Minitab, so perhaps I am just missing something simple.
I also use R. I think R has the capacity to do this. Seems like it has
pretty flexible data manipulation options, but for such a large data set
(>1,200,000 census records) R runs into memory problems and freezes (even
with a sample of ~120,000).
Any ideas?
=================================================================
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
http://jse.stat.ncsu.edu/
=================================================================