---------- Forwarded message ----------
From: Douglas Bonett <[email protected]>
Date: Mon, May 5, 2014 at 10:31 PM
Subject: Re: asymptotic standard error of lambda
To: John Darrington <[email protected]>


For lambda C|R, its variance can be expressed as


(N – A)(A + B – 2C)/(N – B)^3


where N is the total sample size, B is the largest column total, A is the
sum across rows of the largest frequencies within each row. The C term is
the hardest to explain in words – it is the summation of the largest
frequency in each row for *only* those rows where the largest row frequency
is in the same column as the largest column total.  It is easier to show it
in an example.


Here is a 2x3 table from Bishop, Fienberg & Holland (page 388):



       c1     c2       c3

r1    *225*     53      206

r2      3      1       *12*

      *228 *    54      218



A = 225 + 12 = 237

B = 228

C = 225 (since 225 is the only row maximum that occurs in the first column)

VAR[lambda(R|C)] = (500 – 237)(237 + 228 – 2*225)/(500 – 228)^3 = .000196

ASE1 = sqrt(.000196) = .014

SPSS also gives ASE1 = .014




On Mon, May 5, 2014 at 1:24 PM, John Darrington <
[email protected]> wrote:

> On Mon, May 05, 2014 at 11:27:40AM -0700, Ben Pfaff wrote:
>      On Mon, May 5, 2014 at 10:51 AM, John Darrington
>      <[email protected]> wrote:
>      > On Mon, May 05, 2014 at 08:09:16AM -0700, Ben Pfaff wrote:
>      >      I'm sure there is an error in our implementation.  NaN is
> coming from
>      >      the square root of a negative number, as you said.
>      >
>      >      I made another mistake below.  PSPP actually calculates ASE0
> correctly
>      >      for asymmetric lambda (lambda divided by ASE0 is what's
> displayed as
>      >      "Approx. T", which matches that calculated by SPSS for
> asymmetric
>      >      lambda).  It's ASE1, displayed as "Asymp. Std. Error", that
> PSPP gets
>      >      wrong.
>      >
>      > Ahh. I was calculating ASE0.
>      >
>      > ASE1 like you say seems wierd and results in an imaginary number.
>  I can only imagine
>      > that this is a mistake in the SPSS documentation.  Unfortunately I
> haven't been able
>      > to find any other references on how to calculate this value.
>      >
>      > Another issue: if we have T, we should be able to calculate the
> significance.  We just
>      > need to know the degrees of freedom.  I wonder how these are
> calculated?
>      >
>      > Unfortunately the litereature on these values seems to be scarce.
>
>      https://v8doc.sas.com/sashtml/stat/chap28/sect20.htm has a different
> formula,
>      but I don't understand how to interpret r_i|l_i = l.
>
> The text below it says:
>  Also, let li be the unique value of j such that ri=nij, and let l be the
> unique value of j such that r = n·j.
>
> I interpret this to mean that r_i is summed for all i where the condition
> l_i == l is true.
>
>
>
> --
> PGP Public key ID: 1024D/2DE827B3
> fingerprint = 8797 A26D 0854 2EAB 0285  A290 8A67 719C 2DE8 27B3
> See http://sks-keyservers.net or any PGP keyserver for public key.
>
>
> _______________________________________________
> pspp-dev mailing list
> [email protected]
> https://lists.gnu.org/mailman/listinfo/pspp-dev
>
>
_______________________________________________
pspp-dev mailing list
[email protected]
https://lists.gnu.org/mailman/listinfo/pspp-dev

Reply via email to