Dear Tim,

Interesting discussion, and I agree with your last description of the issues.  
When I started playing with this for oxymyoglobin [J. Mol. Biol. 142, 531-544 
(1980)]
it seemed immediately apparent (i.e. by thinking about it before starting to 
write programs) that the simple Babinet approach was deficient since the 
solvent was not just the inverse of the protein.  This is Tim's point about not 
using Fc in the Babinet method because protein density is not flat (but you 
could use a flat protein model Fm).  As pointed out in this discussion, there 
is not much difference between the two at 20A resolution anyway, but what did 
become apparent when I started doing actual calculations was that the solvent 
effect was noticeable even at moderate resolutions.  It seemed to me then that 
the obvious was to go was a flat solvent to generate Fs.  The issues then were 
where to put the protein-solvent boundary, and how sharp to make it (i.e. a B 
factor).

I also noticed that when the mask was made, small cavities in the protein would 
generate bits of "solvent" in the interior of the protein that should not be 
there (another issue alluded to in the discussion).  In those days I was 
looking at maps/masks plotted on paper rather than graphics, and I solved the 
cavity problem manually by adding atoms in the cavities in the pdb file I used 
to make the mask until I couldn't see any more cavities left in a plot.  This 
is not, of course, an automatic procedure, and that would need a bit of 
thought.  The determination of the border width between protein and solvent, 
and the B factor, were just optimised by trial and error, running several 
values, plotting R factors and fitting a function to them (parabola I think) to 
help find a minimum.  Again the issue for automating this requires finding 
suitable parameters with derivatives.

Having done all this in a simple-minded way, I was very impressed by the effect 
on the refinement, and on of the figures in the JMB paper shows how dramatic it 
was and how far up the resolution range the effect was felt.  At this stage I 
should have programmed it properly, but the oxymyglobin structure was done, I 
had to move jobs and projects and good intentions fell by the wayside (mea 
culpa).  Luckily more public spirited people picked up some of the ideas and 
improved them, but the protein cavity issue is still there it seems.

Obviously I would like to add my vote for a proper flat (or nearly?) solvent 
mask model as being the right apporach.

Simon



---------------------------------------------------------------------------
| Simon E.V. Phillips                                                     |
---------------------------------------------------------------------------
| Director, Research Complex at Harwell (RCaH)                            |
| Rutherford Appleton Laboratory                                          |
| Harwell Science and Innovation Campus                                   |
| Didcot                                                                  |
| Oxon OX11 0FA                                                           |
| United Kingdom                                                          |
| Email: simon.phill...@rc-harwell.ac.uk                                  |
| Tel:   +44 (0)1235 567701                                               |
|        +44 (0)1235 567700 (sec)                                         |
|        +44 (0)7884 436011 (mobile)                                      |
| www.rc-harwell.ac.uk                                                    |
---------------------------------------------------------------------------
| Astbury Centre for Structural Molecular Biology                         |
| Institute of Molecular and Cellular Biology                             |
| University of LEEDS                                                     |
| LEEDS LS2 9JT                                                           |
| United Kingdom                                                          |
| Email: s.e.v.phill...@leeds.ac.uk                                       |
| Tel:   +44 (0)113 343 3027                                              |
| WWW:   http://www.astbury.leeds.ac.uk/People/staffpage.php?StaffID=SEVP |
---------------------------------------------------------------------------
-----Original Message-----
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Tim Fenn
Sent: 23 October 2010 21:14
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Babinet solvent correction [WAS: R-free flag problem]

On Sat, 23 Oct 2010 10:05:15 -0700
Pavel Afonine <pafon...@gmail.com> wrote:

> Hi Tim,
>
>  ...but I hope this answers the question:
> Babinet's vs. the flat model?  Use them together!  ;)
>
>
> thanks a lot for your reply.
>
> Could you please explain the *****physical***** meaning of using both
> models together?

I can try!  Typically, we model the bulk solvent using a real space mask that 
is set to 1 in the bulk solvent region and 0 in the protein.
This gets Fourier transformed, symmetrized and added in to the scattering 
factors from the molecule (Equation 1 in the paper, page 6 in your 
presentation):

Ftot = Fc + ks*Fs*exp(-Bs*s^2/4)

which works great and is how things are usually coded in most macromolecular 
software, no problems or arguments there.  However, we can come from the 
opposite - but equivalent! - direction of Babinet's principle, which tells us 
the bulk solvent can also be modeled by inverting everything: set the bulk 
solvent region to 0 and the protein region to 1 in the real space mask, apply a 
Fourier transform to that and then invert the phase:

Ftot = Fc - ks*Fm*exp(-Bs*s^2/4)

(I'm using Fm to distinguish it from Fs, due to the inversion of 0's and 1's in 
the real space mask)  This is equation 2 in the paper.

So we're still using the flat model to compute Fm, and we're using Babinet's 
principle to add it in to the structure factors - although its better described 
as adding the inverse (thus the minus sign in the second equation) of the 
complement (Fm rather than Fs). These two equations are exactly equivalent, 
without any loss of generality. So, I would argue the flat model and Babinet's 
are very much congruous.  Also take a look at the description/discussion in the 
paper regarding Figure
2 (which helped me think about things at first).

The big difference is that Babinet's is usually applied as:

Ftot = Fc - ks*Fc*exp(-Bs*s^2/4)

which, I would argue, isn't quite right - the bulk solvent doesn't scatter like 
protein, but it does get the shape right.  Which I think is why Fokine and 
Urzhumtsev point out that at high resolution this form would start to show 
disagreement with the data.  I haven't looked at this explicitly though, so we 
still haven't answered that question!
We didn't want to spend much time on it in the paper, our main goal was to try 
out the differentiable models we describe.  The Babinet trick was a convenient 
way to make coding easier.

Anyway, I hope this helps explain it a bit more, and again: sorry for the 
long-windedness.

Regards,
Tim

--
---------------------------------------------------------

        Tim Fenn
        f...@stanford.edu
        Stanford University, School of Medicine
        James H. Clark Center
        318 Campus Drive, Room E300
        Stanford, CA  94305-5432
        Phone:  (650) 736-1714
        FAX:  (650) 736-1961

---------------------------------------------------------

Reply via email to