Re: [ccp4bb]: maximum likelihood question

George M. Sheldrick Mon, 05 Sep 2005 00:05:22 -0700

***  For details on how to be removed from this list visit the  ***
***          CCP4 home page http://www.ccp4.ac.uk         ***

I am now completely confused by the ML vs. LS discussion and hope that Iam not the only person with this problem!

To clarify the issue, here is a simple clear-cut example of considerableinterest to small-molecule crystallographers: what is the best way todetermine the 'absolute configuration' using weak anomalous scattering(e.g. for an organic compound containing only C, H, N and O).

The conventional wisdom is that a so-called Flack parameter x is refinedas one of the LEAST-SQUARES parameters, where Iobs(hkl) is fitted byIcalc(hkl) + x.Icalc(-h-k-l) with weights based on sigma(Iobs). Complexscattering factors are employed to calculate the intensity Icalc. Thisis similar to the way twins are refined.

When x is 0 (with a 'small' esd from the full-matrix LS) the absoluteconfiguration is correct, when it is 1 (with a 'small' esd) it is wrong.If there are no errors in the model then only these two values of x arepossible (racemic twinning, which would permit x to take any value inthe range 0 to 1, is rare but not unknown; however in some case it canbe ruled out by chemical evidence). Clearly the estimated esd of x isjust as important as the actual value of x.

The following 'ML' approach was proposed in discussions at the ComputingSchool in Siena (contributions from Rob Hooft, Simon Parsons and DavidWatkin are acknowledged, but I'm sure that there were others). We definethe quantity Qobs = [Iobs(hkl)-Iobs(-h-k-l)]/[Iobs(hkl)+Iobs(-h-k-l)]and a corresponding Qcalc. This should cancel some systematic errors.The esd of Qobs can be calculated by standard methods from the knownesds of Iobs(hkl) and Iobs(-h-k-l). Then we estimate the log likelihood(LL) of a particular value of x by -0.5[(Qobs-Qcalc)/esd(Qobs)]^2 summedover all reflections.

So my question is: should I estimate the value of x and its esd byintegrating over -infinity < x < +infinity (in practice this range canbe truncated to say -5 to +5) or should I integrate over 0 =< x =< 1 orshould I use only the values of the LL at x = 0 and x = 1? Preliminarytests suggest that these methods give appreciably lower esds for x thanthe LS approach. If I only consider the LL for x=0 and x=1 I have asimple way of calculating the probability that x is 0 rather than 1(i.e. the probability that the absolute configuration of the model iscorrect). Note that because of correlations between x and the atomicpositional parameters in polar space groups it might be necessary torefine the structure to convergence for x=0 and x=1 to get the Icalcvalues for this.

I would appreciate clarification from the ML experts, with a view toputting it into my programs (which are still used by some small moleculecrystallographers).


George

Re: [ccp4bb]: maximum likelihood question

Reply via email to