Rich,

Thanks very much for your insightful criticism.

You are absolutely correct if the arc from Coffee_Triggers_Headache 
is interpreted causally, which is the natural interpretation given 
that I was talking about a causal graph.  So let me apologize for 
creating confusion.  However, there is a hidden factor causal model 
that makes the causal claims I described.

Make Hidden_Factor cause John_Drinks_Coffee with probability (almost) 
100%.  Also make Hidden_Factor cause Hidden_Trigger with probability 
equal to the noisy-or trigger probability for coffee.  Then make 
Coffee_Triggers_Headache have probability 100% if John_Drinks_Coffee 
and Hidden_Trigger are both true, and 0% otherwise.  Then make 
Other_Triggers_Headache have probability:
   - 100% if Hidden_Trigger and Other are both true;
   - the noisy-or trigger probability for other if Mystery_Trigger
     is false and Other is true; and
   - 0% otherwise.
Then set the prior probability of Mystery_Factor equal to the prior 
probability that John drinks coffee.

This model is equivalent to the Noisy-Or and makes the causal claims 
I described in my previous email.  If we turn off John's coffee 
drinking and the headache, but set Hidden_Factor to its conditional 
probability given Headache and John_Drinks_Coffee, the probability of 
Headache is much higher than the counterfactual probability for the 
standard Noisy-or.

You are also right that the example I gave does not contradict 
Charles' claim.  I didn't intend to argue against his claim.  My 
intent was simply to note that causal claims are stronger than claims 
about conditional probabilities on observed random variables. 
Sometimes we can verify causal claims by experimental manipulation, 
but even so, our analysis always depends on auxiliary assumptions 
about the causal mechanism.  We never can know what would have 
happened if events had unfolded differently from how they unfolded.

Sorry for the confusion.

Kathy

>Kathy,
>
>It does not seem that you are saying what I think you intend to say. 
>Consider this sentence:
>
>   For example, we could draw an arc from
>>Coffee_Triggers_Headache to Other_Triggers_Headache, and change the
>>probability to 100% that Other_Triggers_Headache is true when
>>Coffee_Triggers_Headache is true, but leave everything else as it is.
>>Check it out.  This doesn't change the joint distribution on (Coffee,
>>Other_causes, Headache).  But it makes different counterfactual
>>claims.  The claim this alternate model makes is that if John drinks
>>coffee, and there is also some other cause of a headache present, and
>>the coffee triggers a headache, then so does the other cause, so that
>>if you removed the coffee, there would still be a headache.
>
>If you removed the coffee, coffee would no longer trigger the 
>headache (even if its inhibitor was not present), which means 
>coffee-trigger-headache would no longer causes 
>other-triggers-headache. So why would there still be a headache?
>
>
>Now in what follows you talk about a common cause of drink-coffee 
>and other-trigger. This model is consistent with your claim that the 
>headache would still be there if we removed coffee. However, it is 
>not the model you describe above.
>
>>This
>>might occur, for example, if whatever factor causes John to
>>experience a desire to drink coffee also causes these other factors
>>to trigger a headache.  Does this seem farfetched?  It is not ruled
>>out by the observables!  In fact, it makes EXACTLY the same
>>statistical claims about observables that the other model does.
>Can you clarify,
>
>Also, although all this in interesting, it does not seem to 
>invalidate Charle's argument, which only concerns the observed 
>probability distributions.
>
>
>Rich


Reply via email to