Re: [ccp4bb] First images of proteins and viruses caught with an X-ray laser

James Holton Fri, 11 Feb 2011 08:35:49 -0800

The indexing ambiguities do not include anomalous pair confusion becausethere is no way to rotate the lattice to make every h,k,l overlap with-h,-k,-l. I.E. you can't rotate your left hand to superimpose it onyour right. The only way to mix those up is to change the sign of somedetector geometry parameter (I.E. looking in a mirror).

That said, anomalous differences tend to be very weak and noisy in allbut the most exotic cases of macromolecular diffraction. Twinning makesthis worse because you are (to a first order approximation) averagingDANO(h,k,l) with DANO(k,h,-l) and the result will tend to be closer tozero than either one taken individually. However, the biggest source oferror in LCLS datasets at the moment is partiality. Basically, you onlyget one shot per crystal, you can't rotate it appreciably in the 70 fsexposure time, the beam is a laser so there is essentially no divergenceor dispersion, and the crystals are so small as to be one mosaic domaineach, so there is no "mosaic spread". The "3D profile" of the spots istherefore dominated by the finite size of the crystal itself (Sherrerbroadening). We were actually worried for a while that we wouldn't seeany spots at all at LCLS!

So, everything is a partial, and we currently don't have postrefinementsoftware that can model the shape of each crystal and give us apartiality. At least, not in a reasonable amount of time. If we spent30 s on each of the 3 million images, we would still be processing themfor a few more years. So, for the first run, it was decided to jutaverage out the partiality errors. For example, unknown partialitymeans that each spot is measured with 100% error (at best), but if youhave 700 of them, then the expected error of the average is ~3%. JohnSpence called this a "Monte Carlo integration", and it turned out to bea really good idea. We measured the error of the average by splittingthe images into two heaps and comparing the merged datasets thatresulted from each heap. I proposed calling this "R-internal" forinternal agreement, since a traditional Rmerge does not really apply.However, I admit that for the PDB deposition I entered R-internal as"Rmerge". Technically, R-internal is exactly what an Rmerge used to be:the R-factor between data from different crystals.

Personally, I think "the way" to crack this "twin problem" is to scaleall the data and look at the partial intensity histograms for eachspot. In situations where the "true" values of h,k,l and k,h,-l haveradically different intensities, there will be a bimodal distribution,and that will allow us to re-index the ~700 images that contained a spotfrom one of those two hkls. Which group to flip (the bright ones or thedim ones) is an interesting question, but probably the dim ones, sincethey are the least consistent with the average intensity. Might need totry both. After re-mergeing and re-scaling, there will be another hklwith the strongest bimodal distribution, and then you iterate. That'sthe idea anyway.


-James Holton
MAD Scientist

On 2/10/2011 6:32 AM, Jacob Keller wrote:

Would it be true that the anomalous differences could not be measured
in these types of datasets, because one would not know which
Friedel/Bivoet reflection one is measuring in a given frame? Perhaps,
given anomalous signal, there would be a way to tease out which
orientation one was looking at from the correlations of the
signs/magnitudes of anomalous-scattering-induced deviations from the
mean intensities (derived from the whole dataset) for all of the
relections observed in each frame? I guess this might also detwin the
data?

JPK

On Thu, Feb 10, 2011 at 7:17 AM, Anastassis Perrakis<a.perra...@nki.nl>  wrote:

Anyway, I thought that was a cool idea, but like so many other cool
things, it had to be cut from the Nature paper.  Admittedly, the problem has
not actually been solved yet.  This is why we used REFMAC in TWIN mode.

Is that a hint on the:

a. wisdom of the editor
b. wisdom of 'the third referee'
c. wisdom of the dogma 'five years of eight eight lifes in 2000 words'
d. All of the above

;-)

A.

Re: [ccp4bb] First images of proteins and viruses caught with an X-ray laser

Reply via email to