Re: 2x2 tables in epi. Why Fisher test?

2001-05-11 Thread Juha Puranen

Ronald Bloom wrote:
 
 In sci.stat.consult Elliot Cramer [EMAIL PROTECTED] wrote:
  In sci.stat.consult Ronald Bloom [EMAIL PROTECTED] wrote:
  Herman as usual is absolutely correct; the validity of the Fisher test is
  analagous to the validity of regression tests which are derived
  conditional on x but, since the distribution does not involve x, are valid
  unconditionally even if the x's are random.
 
   If I take your analogy in the direction that leads back to
 the Fisher test, I should be able to paraphrase the above as
 
 the validity of the [Fisher test] which [is] derived conditional
 on [the fixed marginals] but, since the distribution does not
 involve [the fixed marginals], [is] valid unconditionally even
 if the [marginals] are random.
 
  Please clarify what is meant by the distribution does not
 involve [the fixed marginals].  I am not clear on this:
 the Fisher test statistic (hypergeometric upper tail probability)
 certainly *does* depend on the fixed marginals in this
 case -- they appear in every term in that tail sum.


Usual the assumptions for Fishers exact test  are  not true. 
What you can fix  are the row margins, or column margins or grand total
or
Element of row i and column j. 

In these cases the exact Fisher test is biased. 

At least in Survo  (may be in some other programs too) it is possible 
make the test also in these cases.  Look at

http://www.helsinki.fi/survo/q/qu1_03.html


regards 

Juha


-- 
Juha Puranen
Department of Statistics 
P.O.Box 54 (Unioninkatu 37), 00014 University of Helsinki, Finland
http://noppa5.pc.helsinki.fi


=
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
  http://jse.stat.ncsu.edu/
=



Re: 2x2 tables in epi. Why Fisher test?

2001-05-10 Thread Herman Rubin

In article 9deiug$l0h$[EMAIL PROTECTED],
Ronald Bloom  [EMAIL PROTECTED] wrote:

Significance tests for 2x2 tables require that the single observed
table be regarded as if it were, (under the null hypothesis of 
uniformity or independence) but a single instance drawn at  
random from a universe of replicates.  Insofar as there are at 
least three well-known distinct such sample spaces that one
might arguably propose as reasonable models of the universe
of replicates, different probability models by which the 
extremity of the observed table, under the null hypothesis,
do arise.

I can even provide more.  But from the standpoint of
classical statistics, it makes little difference.  From
the standpoint of decision theory it does, but then one
would not be doing anything like fixing a significance
level in the first place.

 This has given rise over the years to misunderstandings
between proponents of different small-sample inferential tests
of signifance for 2x2 tables.  But the disputes seem largely to
be due to the failure of the disputants to identify precisely
that particular probability setup which is correct for the
particular problem at hand.  

At least three distinct such ways of regarding a given 2x2 table can 
be distinguished:

1.) both row and column marginals regarded as fixed, and under
the null hypothesis of uniformity, the observed table is treated 
as a random sample from the finite set of permutations of all
2x2 tables satisfying that constraint.  This sample-space model
gives rise to the hypergeometric distribution for the 
probability of the observed table; thus the Fisher Exact test.

The advantage of this one is that an exact test of the
prescribed level can be produced.

2.) The two row (col)marginals are treated as independent; and the
observed table under the null hypothesis is regarded as 
being the result of two independent random samples from 
identical binomial distributions.  The significance test used
in this case is identical to the elementary test for the
difference between two sample proportions.

This is a much more complicated testing situation than you
seem to think.  Because of the nuisance parameters, it is
essentially impossible to come up with a natural test
at the precise level, especially for small samples.

3.) Only the total cell sum T is regarded as fixed.  The 
observed table, under the null hypothesis, is regarded as a 
random draw of four cell values satisfying the constraint
that their total T is specified.  This leads to a 
multinomial distribution.  

Each one of these probability setups 1-3 gives rise to a somewhat
different small-sample inferential test.  In particular, 
the schemes (1),(2),(3) give rise to distributions conditioned
on 3, 2, and 1 fixed parameters respectively.

But these parameters are unknown.  Testing with nuisance
parameters is very definitely not easy, and exact tests
are hard to come by.  Even in other types of problems,
conditional tests are often used.  In fact, in many
practical problems, the sample size itself need not be
fixed.  It is not uncommon to use the number of
observations as if it were a fixed sample size, and it is
easy to give examples where this can be shown not to do
what is wanted.

 Since, for
large cell values, the large-sample approximations to all
of these distributions (apparently?) converge to the
CHi-Squared distribution, it is only in situations with
small cell sizes that the controversy over choice of
probability model is of practical (?) import.

As long as the conditional probabilities are the same,
and one uses one of the scenarios you mentioned, the
distribution of the Fisher exact test given the marginals
is as stated.  Thus the probability that the test at a
given level rejects is precisely the stated level in all
of these cases, assuming that randomized testing is used.

If one uses a decision approach, none of this is correct,
even if the Fisher model happens to be true.
-- 
This address is for information only.  I do not claim that these views
are those of the Statistics Department or of Purdue University.
Herman Rubin, Dept. of Statistics, Purdue Univ., West Lafayette IN47907-1399
[EMAIL PROTECTED] Phone: (765)494-6054   FAX: (765)494-0558


=
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
  http://jse.stat.ncsu.edu/
=



Re: 2x2 tables in epi. Why Fisher test?

2001-05-10 Thread Ronald Bloom

In sci.stat.edu Herman Rubin [EMAIL PROTECTED] wrote:

Each one of these probability setups 1-3 gives rise to a somewhat
different small-sample inferential test.  In particular, 
the schemes (1),(2),(3) give rise to distributions conditioned
on 3, 2, and 1 fixed parameters respectively.

 But these parameters are unknown.  Testing with nuisance
 parameters is very definitely not easy, and exact tests
 are hard to come by.  Even in other types of problems,

  I was not here referring to the unknown nuisance parameter
(namely the unknown binomial probability).  In schemes
(1), (2), (3) the 3, 2, and 1 fixed conditioning parameters
are, respectively:  (a) two row marginals and one column
marginal  (b) two independent row marginals  (c) the 
total sum of four cells.   In the conditioning arguments
which yield the signficance tests I alluded to above, 
those  3, 2, or 1 parameters are *known*.  


 As long as the conditional probabilities are the same,

   which conditional probabilities are you referring to?

 and one uses one of the scenarios you mentioned, the
 distribution of the Fisher exact test given the marginals
 is as stated.  Thus the probability that the test at a
 given level rejects is precisely the stated level in all
 of these cases, assuming that randomized testing is used.


 If one uses a decision approach, none of this is correct,
 even if the Fisher model happens to be true.


  I was only addressing the matter   of the logical relationship
between the probability model used in the significance test to  
the implied underlying sample space of 2x2 tables from which
the observed table was drawn.  It seems to me that the
choice of experimental design has some bearing on the choice
of such universe and I was wondering why the Fisher Universe
of permutations with 4 fixed marginals is chosen as the
basis for inferential tests for experimental setups in which
quite plainly only *two* marginals can be regarded as fixed 
(e.g. case-control studies,  so on).  Is there a simple
answer to this question?  (I guess there really is not...)

 -- 
 This address is for information only.  I do not claim that these views
 are those of the Statistics Department or of Purdue University.
 Herman Rubin, Dept. of Statistics, Purdue Univ., West Lafayette IN47907-1399
 [EMAIL PROTECTED] Phone: (765)494-6054   FAX: (765)494-0558


=
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
  http://jse.stat.ncsu.edu/
=



Re: 2x2 tables in epi. Why Fisher test?

2001-05-10 Thread Rich Ulrich


 - I offer a suggestion of a reference.

On 10 May 2001 17:25:36 GMT, Ronald Bloom [EMAIL PROTECTED] wrote:

[ snip, much detail ] 
 It has become the custom, in epidemiological reports
 to use always the hypergeometric inference test --
 The Fisher Exact Test -- when treating 2x2 tables 
 arising from all manner of experimental setups -- e.g.
 
 a.) the prospective study
 b.) the cross-sectional study
 3.) the retrospective (or case-control) study
  [ ... ]

I don't know what you are reading, to conclude that this
has become the custom.   Is that a standard for some
journals, now?

I would have thought that the Logistic formulation was
what was winning out, if anything.

My stats-FAQ  has mention of the discussion published in
JRSS (Series B)  in the1980s.  Several statisticians gave 
ambivalent support to Fisher's test.  Yates argued the logic
of the exact test, and he further recommended the  X2 test
computed with his (1935) adjustment factor, as a very accurate 
estimator of Fisher's p-levels.

I suppose that people who hate naked p-levels will have to 
hate Fisher's Exact test, since that is all it gives you.

I like the conventional chisquared test for the 2x2, computed
without Yates's correction --  for pragmatic reasons.  Pragmatically,
it produces a good imitation of what you describe, a randomization
with a fixed N but not fixed margins.  That is ironic, as Yates
points out (cited above) because the test assumes fixed margins
when you derive it.

-- 
Rich Ulrich, [EMAIL PROTECTED]
http://www.pitt.edu/~wpilib/index.html


=
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
  http://jse.stat.ncsu.edu/
=



Re: 2x2 tables in epi. Why Fisher test?

2001-05-10 Thread Elliot Cramer

In sci.stat.consult Ronald Bloom [EMAIL PROTECTED] wrote:
Herman as usual is absolutely correct; the validity of the Fisher test is
analagous to the validity of regression tests which are derived
conditional on x but, since the distribution does not involve x, are valid
unconditionally even if the x's are random.


Incidentally, if one randomizes to get an exact p value, the Fisher test
is uniformly most powerful. Herman can tell us if this is for all three
cases.


=
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
  http://jse.stat.ncsu.edu/
=



Re: 2x2 tables in epi. Why Fisher test?

2001-05-10 Thread David Duffy

In sci.stat.edu Ronald Bloom [EMAIL PROTECTED] wrote:

 It has become the custom, in epidemiological reports
 to use always the hypergeometric inference test --
 The Fisher Exact Test -- when treating 2x2 tables 
 arising from all manner of experimental setups -- e.g.

Only for tables with small cell sizes (and for combination of multiple
such tables), and only because software is freely available.  I would
have thought it is more likely to be seen used for large sparse 2xK tables,
eg HLA literature. Its shortcomings (conservative under the other setups)
are also well known (I hope!).

David Duffy.


=
Instructions for joining and leaving this list and remarks about
the problem of INAPPROPRIATE MESSAGES are available at
  http://jse.stat.ncsu.edu/
=