Alexandre:
Sorry not to respond - it's been busy here.
You wrote:
> I tried to test this. In order to test this I tried to plot 3d the
> difference between variances function of two proportion.
> Here is my Maple formula:
> plot3d((10*x+10*y)/20*(1-(10*x+10*y)/20)*(1/10+1/10)-(x*(1-x)/10+y*(1-y)/10)
> ,x
> = 0 .. 1,y = 0 .. 1);
> where x=p1, y=p2, n1=n2=10
You are correct - I clearly either made a mistake some years ago, or
misremembered what I worked out then. Sorry - bad brain day.
Looking at it again, the inequality you present can be deduced easily
from the convexity of the function p(1-p). The pooled variance is
(essentially) the value of the function at ta + (1-t)b, the other is the
linear interpolation. And, clearly, the pooled variance is larger; as
the function is convex up, the linear interpolation is always less.
OK, so what *is* going on here? Checking a dozen or so sources, I
found that indeed both versions are used fairly frequently (BTW, I
myself use the pooled version, and the last few textbooks I've used do
so).
Then I did what I should have done years ago, and I tried a MINITAB
simulation. I saw that for (say) n1=n2=10, p1=p2=0.5, the unpooled
statistic tends to have a somewhat heavy-tailed distribution. This makes
sense: when the sample sizes are small the pooled variance estimator is
computed using a sample size for which the normal approximation works
better.
The advantage of the unpooled statistic is presumably higher power;
hoewever, in most cases, this is illusory. When p1 and p2 are close
together, you do not *get* much extra power. When they are far apart
and have moderate sample sizes you don't *need* extra power. And when
you have small samples it's not clear that you can *trust* the extra
power.
OK, that makes more sense. Sorry about the last attempt. If I'm still
being obtuse, somebody please straighten me!
-Robert Dawson
=================================================================
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
http://jse.stat.ncsu.edu/
=================================================================