Dear Enrico, Frank and colleagues,

     I am glad to have suggested that everyone's views on this issue should
be aired out on this BB rather than sent off-list to an IUCr committee
member: this is much more interactive and thought-provoking. 

     There would seem to be clear biases in some of the positions - for
instance, the statement that we overvalue individual structures and that
there is value only in their ensemble has to be seen to be coming from
someone in a structural genomics centre ;-) . However, as Wladek pointed
out, when an investigator's project is crucially dependent on a result
embodied in a deposited structure, it would be of the greatest value to that
investigator to be able to double-check how reliable some features of that
structure (especially its ligands) actually are.

     On the other hand Enrico, as a specialist of crystallisation and
modelling, sees value only in improving those contributors to the task of
structure determination. This is forgetting (1) an essential capability of
crystallography: that, through experimental phasing, it can show you what a
protein looks like even if you have never seen nor modelled one before,
through the wondrous process of producing model-free electron-density maps;
and (2) an essential aspect of the task of structure determination: that it
doesn't aim at producing a model with perfect geometry, but one that best
explains the measured data and neither under- nor over-interprets them (I
realise, though, that Enrico's statement "Data just introduces experimental
errors into what would otherwise be a perfect structure" is likely to be
tongue-in-cheek ...). 

     When it comes to making explicit the advantages of archiving at least
the raw images that yielded the data against which a deposited PDB entry was
refined, many good reasons have been given, but I feel that 

     (1) there is an over-emphasis on the preservation of diffuse scattering
that has a tendency to give this archiving a nuance of "blue-skies" research
and thus to detract from its practical urgency; time will come for diffuse
scattering to be fully appreciated, but at the moment its mention acts as a
bit of a distraction, if not a turn-off in this context for people who not
not love it already;

     (2) as far as I see it, the highest future benefit of having archived
raw images will result from being able to reprocess datasets from samples
containing multiple lattices ("non-merohedral twinning"). Numerous
structures are determined and refined against data obtained by integrating
only the spots from the major lattice, without rejecting those that are
corrupted by overlap by a spot from a minor lattice. This leads to
systematic errors in these data that may only be incompletely taken out by
outlier rejection at the merging stage, and will create noise or confusing
residual features in difference maps, if not false features in the main map
and therefore its interpretation by the model. In my opinion it will be the
development of methods for dealing with overlapped lattices and for the
proper treatment of such data in scaling and refinement (as is already
possible with small molecules) that will bring about the major possibility
of substantially improving deposited results by reprocessing the raw images
co-deposited with them;

     (3) there is also the more immediate possibility of better removing ice
rings, or ligand powder rings, from images, than by having to throw away
certain thin shells of merged data in the structure factor file.

     I see the case for raw image deposition as absolutely compelling,
especially in view of the auto-catalytic process through which their
availability will speed up the development of precisely the new methods and
software to extract better data from them and better refine models against
them. The impact of structure factor deposition on the development of better
refinement programs is there to prove that this paradigm of a chain reaction
makes total sense.

     Various arguments tend to be fired off as decoys - "get better
crystals", why not "get a better post-doc"? - but they are unhelpful in the
way they prolong procrastination when what we need is to bite the bullet.
The IUCr Forum that John Helliwell pointed at already contains draft plans
for a pilot run of a reasonable scheme.


     With best wishes,
     
          Gerard.

--
On Tue, Oct 18, 2011 at 06:19:27PM +0200, Enrico Stura wrote:
> Dear Peter,
>
> How many crystallographers does it take to transform bad data into good 
> data?
> None, you need a modeller. Only a modeller can give you a structure with 
> perfect
> geometry. Data just introduces experimental errors into what would 
> otherwise be a perfect
> structure.
>
> If you have good data do you need crystallographers?
> ...
>
> Of course there all the cases in between. That ... you are right, is the 
> other half of the story.
>
> From a biological point of view, only borderline cases make "cents" ($+€) 
> to store.
> The experimenter in consultation with a beamline scientist at an SR 
> facility is the best
> small commitee suitable to evaluate what is worth keeping. I am sure that 
> the images
> that are worth storing for a long long time would fit on a few Tb at a 
> reasonable cost.
> Storing everything would make it harder to find something worth improving 
> in the future.
>
> Enrico.
>
>
> On Tue, 18 Oct 2011 17:12:42 +0200, Peter Keller 
> <pkel...@globalphasing.com> wrote:
>
>> Dear Enrico,
>>
>> Please don't get me wrong: what you are saying is not incorrect, but it
>> is only half the story.
>>
>> On Tue, 2011-10-18 at 15:13 +0200, Enrico Stura wrote:
>>> With improving techniques, we should always be making progress!
>>
>> Yes, of course!
>>
>>> If we are trying to answer a biological question that is really 
>>> important,
>>> we would be better off
>>> improving the purification, the crystallization, the cryo-conditions
>>
>> You have left X-ray crystallography out of this list. It is a technique
>> like the others, and can also be improved :-)
>>
>> It may be true that the number of crystallographers that are working on
>> improving instrumental methodology and software is small compared to the
>> number working on improving wet-lab techniques, but that number is not
>> zero, and the contribution is significant. The rest of you benefit from
>> that work!
>>
>>> instead of having to rely on
>>> processing old images with new software.
>>>
>>> I have 10 years  worth of images. I have reprocessed very few of them and
>>> never made any
>>> sensational progress using the new software. Poor diffraction is poor
>>> diffraction.
>>
>> Maybe so, but certain types of datasets are useful for methods and
>> software development, even if no new biological insights could be gained
>> by reprocessing them. These datasets are often hard to get hold of in
>> practice, especially when they are in someone's lab on a tape that
>> no-one has a reader for any more.
>>
>> Obtaining protein, growing crystals and collecting new data in such a
>> way that the interesting features of those datasets are reproduced can
>> be much much harder than curating the images would be. This is
>> especially true for software-oriented people like us who don't have
>> regular access to wet-lab facilities.
>>
>>> Money can be better spent buying a wine cellar, storage works for wine.
>>
>> Images have already been lost that ought to have been kept. The
>> questions are: how to select the datasets that are potentially of value,
>> and how to make sure that they don't disappear.
>>
>> Regards,
>> Peter.
>>
>
>
> -- 
> Enrico A. Stura D.Phil. (Oxon) ,    Tel: 33 (0)1 69 08 4302 Office
> Room 19, Bat.152,                   Tel: 33 (0)1 69 08 9449    Lab
> LTMB, SIMOPRO, IBiTec-S, CE Saclay, 91191 Gif-sur-Yvette,   FRANCE
> http://www-dsv.cea.fr/en/institutes/institute-of-biology-and-technology-saclay-ibitec-s/unites-de-recherche/department-of-molecular-engineering-of-proteins-simopro/molecular-toxinology-and-biotechnology-laboratory-ltmb/crystallogenesis-e.-stura
> http://www.chem.gla.ac.uk/protein/mirror/stura/index2.html
> e-mail: est...@cea.fr                             Fax: 33 (0)1 69 08 90 71

-- 

     ===============================================================
     *                                                             *
     * Gerard Bricogne                     g...@globalphasing.com  *
     *                                                             *
     * Global Phasing Ltd.                                         *
     * Sheraton House, Castle Park         Tel: +44-(0)1223-353033 *
     * Cambridge CB3 0AX, UK               Fax: +44-(0)1223-366889 *
     *                                                             *
     ===============================================================

Reply via email to