Re: [ccp4bb] Published derivation of mFo-DFc formula?

Randy Read Fri, 09 Jan 2009 04:26:47 -0800

Hi Ian,

Indeed, I didn't have time to reply during the festive season.

Just to give appropriate credit, the 2mFo-DFc coefficients foracentrics in SIGMAA were derived by analogy to arguments made by PeterMain, but taking account of the effect of errors in the partialmodel. According to that argument, one doesn't expect model bias inthe mFo coefficients for centrics, which is why they have differentcoefficients.

In the original version of SIGMAA, the difference map coefficientswere mFo-Fc, not mFo-DFc. At that time, I was thinking that mFo givesthe best (lowest rms error, though biased) representation of the truedensity, while Fc represents the model so the difference between themshould show most clearly how you need to change the model. A year ortwo later, I decided that it was more appropriate to smear out themodel density according to its uncertainty, hence the addition of theD factor. There were two more advantages to the D factor. First, ifthe model is complete garbage, both m and D are zero so the differencemap is flat. (If you don't know anything about the true phases, youcan't make a map showing you how to change it!) Second, it turns outthat if Fo is not on the same scale as Fc, the D term absorbs thenecessary scale factor correction. (Before I added the D term,Eleanor Dodson had written a comment in the source code of the CCP4version of SIGMAA, saying something along the lines "Randy seems to beassuming that the data are

on absolute scale.  What on earth is he thinking?")

These arguments for the difference map coefficients are based simplyon intuition about what makes sense to show how the model shouldchange. So I was very pleased, when we were working on refinementlikelihood targets, to see that they can also be justified in terms oflog-likelihood-gradient maps. If you take the derivative of the log-likelihood functions with respect to Fc, you get


f(|s|)(mFo-DFc)

where f(|s|) is a resolution-dependent function given by

2D/sigma-delta^2 for acentrics, and
 D/sigma-delta^2 for centrics,

where sigma-delta^2 is the variance from the Rice function.

The D/sigma-delta^2 part would give a map that is the convolution of amap computed with the coefficients you suggest (i.e. 2mFo-2DFc foracentrics, mFo-DFc for centrics) and some shape function, which mightsharpen the map if D/sigma-delta^2 increases with resolution or smearit out, if it decreases with resolution.

Anyway, at the least the factor of two for acentrics should beincluded in the various programs that compute difference mapcoefficients. Someone should probably look at the effect of theconvolution with the resolution-dependent part. You will getdifferent results if you consider the effect of coordinate error to bepart of the model (e.g. D is taken up in the model partly byincreasing B-factors -- if you take the derivative with respect toDFc, the factor D is missing from f(|s|)) or if you work in terms of E-values. It's possible that the LLG map computed when the likelihoodis expressed in terms of E-values would be optimal in terms of beingsharpened as much as can be justified by the level of phase error atdifferent resolutions.


Thanks for stimulating the discussion about this point!

Regards,

Randy Read

On 8 Jan 2009, at 11:43, Ian Tickle wrote:

All - I didn't get a single response to my posting last week
(https://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind0812&L=CCP4BB&T=0&O=D
&X=512817322E87355F7F&Y=i.tickle%40astex-therapeutics.com&P=266420)
concerning the formulae that are widely used for the 'minimally-biased'
Fourier and difference Fourier coefficients.  It probably didn't help
that I posted it in the middle of the festive season! - but still
somewhat surprising since I imagine everyone here is involved withmapsat one time or another, and has an interest in getting the densitythat
shows best what if any further modifications need to be made to the
current model.  Anyway now that people have hopefully returned to work
from the rigours of the CCP4 Study Weekend I thought I'd post it again
and see if I can provoke some discussion this time. I won't postall my
calculations again, just a summary of my conclusions.
First, I think I can now prove my conjecture that the optimaldifference
Fourier coefficient dF is given for both acentrics and centrics by:

        dF = Fm - DFc

where Fm is the 'minimally-biased' Fourier coefficient derived by Read
(AC 1986,A42,140):

        Fm(acen) = 2mFo - DFc
        Fm(cen)  =  mFo
I'm satisfied now that my alternative conjecture, that dF = Fm - Fc,is
probably wrong.  Also I can see that there might be an argument to put
DFc in the FC (FC_ALL) column in place of Fc as appears to becurrently
done by REFMAC, but not by SIGMAA (but I'd still like to see some
discussion of that).

So here's a summary comparison of theory with what is my understanding
is actually implemented in software, and with the inconsistencies
highlighted (>...<):

        Source       Coefficient   Acentrics        Centrics
        ======       ===========   =========        ========

        THEORY(Read)     Fm        2mFo - DFc       mFo
          ..  (me)       dF        2(mFo-DFc)       mFo - DFc

        SIGMAA           Fm        2mFo - DFc       mFo
                         dF       > mFo - DFc <     mFo - DFc
                      Fc            Fc              Fc

        REFMAC           Fm        2mFo - DFc    > 2mFo - DFc <
                         dF       > mFo - DFc <     mFo - DFc
                      Fc         > DFc <         > DFc <

Even if you don't accept my suggestion for the acentric dF coefficient
there are clearly some significant inconsistencies between the
coefficients output by SIGMAA & REFMAC which it would be nice to
resolve!

Cheers

-- Ian


Disclaimer
This communication is confidential and may contain privilegedinformation intended solely for the named addressee(s). It may notbe used or disclosed except for the purpose for which it has beensent. If you are not the intended recipient you must not review,use, disclose, copy, distribute or take any action in reliance uponit. If you have received this communication in error, please notifyAstex Therapeutics Ltd by emailing i.tic...@astex-therapeutics.comand destroy all copies of the message and any attached documents.Astex Therapeutics Ltd monitors, controls and protects all itsmessaging traffic in compliance with its corporate email policy. TheCompany accepts no liability or responsibility for any onwardtransmission or use of emails and attachments having left the AstexTherapeutics domain. Unless expressly stated, opinions in thismessage are those of the individual sender and not of AstexTherapeutics Ltd. The recipient should check this email and anyattachments for the presence of computer viruses. Astex TherapeuticsLtd accepts no liability for damage caused by any virus transmittedby this email. E-mail is susceptible to data corruption,interception, unauthorized amendment, and tampering, AstexTherapeutics Ltd only send and receive e-mails on the basis that theCompany is not liable for any such alteration or any consequencesthereof.Astex Therapeutics Ltd., Registered in England at 436 CambridgeScience Park, Cambridge CB4 0QA under number 3751674


------
Randy J. Read
Department of Haematology, University of Cambridge
Cambridge Institute for Medical Research      Tel: + 44 1223 336500
Wellcome Trust/MRC Building                   Fax: + 44 1223 336827
Hills Road                                    E-mail: rj...@cam.ac.uk

Cambridge CB2 0XY, U.K. www-structmed.cimr.cam.ac.uk

Re: [ccp4bb] Published derivation of mFo-DFc formula?

Reply via email to