Hi,

Just a practical problem that I have run into here.  I have a large data set
where every case (row) represents a person.  Each person belongs to a
metropolitan area.  I want to aggregate some of the individual results into
metro area statistics.  Should be easy... but alas...

The data is sample data, so I want to apply weights.  I can do this easily
enough in SPSS, using its apply weights and aggregate functions.  The
complication... in addition to mean, s.d., and n, I want a median.  SPSS
won't give it to me.

I thought I had the problem solved.  Exported the data, imported it into
Minitab.  Now, I can get any statistics I want, mean, median, s.d., n, but I
can't figure out how to apply the sample weights in Minitab.  I am less
familiar with Minitab, so perhaps I am just missing something simple.

I also use R.  I think R has the capacity to do this.  Seems like it has
pretty flexible data manipulation options, but for such a large data set
(>1,200,000 census records) R runs into memory problems and freezes (even
with a sample of ~120,000).

Any ideas?



=================================================================
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
                  http://jse.stat.ncsu.edu/
=================================================================

Reply via email to