Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-17 Thread Ian Tickle
I thought about this and I think the only workable solution that will
keep everyone happy and maintain the status quo as far as possible is
to get an extra data item added to mmCIF for the parameter combination
multiplicity*occupancy (I believe this parameter is called the 'site
population' in Shel-X).

So in addition to '_atom_site.occupancy' (which is in the core CIF
dictionary) we would have (say) '_atom_site.population' specifically
for mmCIF.  That way programs that created mmCIF files for deposition
could use either parameter without ambiguity, and we don't need to
change any existing programs, except for those few that do CIF - PDB
conversion.  Existing macromolecular CIF files would have to be
updated to show the new data item, that's all.  This assumes of course
that that really is the data they actually contain, maybe some don't,
that would need to be checked: I suspect the PDB doesn't verify which
parameter is actually deposited.  Uncertainty about what the
'occupancy' column actually contains in the case of special positions
is what we want to resolve.  A bit untidy I know, but I can't see any
other way that doesn't involve major upheavals.

We would also need to get the PDB Contents Guide changed so that it
refers to 'site population' instead of occupancy.  That way no
programs reading/writing PDB files (except mine!) would need to be
changed.

Does anyone have a better suggestion?

-- Ian

On Thu, Dec 16, 2010 at 9:10 PM, Dale Tronrud det...@uoxray.uoregon.edu wrote:
 On 12/16/10 06:47, Ian Tickle wrote:

 For the sake of argument let's say that 0.02 Ang is too big to be
 rounding error.  So if you see that big a shift then the intention of
 the refinement program (or rather the programmer) which allowed such a
 value to be appear in the output should be that it's real.  If the
 intention of the user was that the atom is actually on axis then the
 program should not have allowed such a large shift, since it will be
 interpreted as 'much bigger than rounding error' and therefore
 'significantly off-axis'.

   I would certainly hope that no one believes that the precision of
 the parameters in a PDB file are significant to the level of round-off
 error!  It's bad enough that a small number of people take the three
 decimal points of precision in the PDB file seriously.  When a person
 places an atom in a model they aren't stating a believe that that is
 the EXACT location of the atom, only that they believe the center of
 the locus of all equivalent atoms in the crystal falls near that spot.
 If it's 0.02 A from a special position (and the SU of the position is
 larger than that) then it might be on the special position and it might
 not.

   If I come across one of your models and you have an atom exactly on
 a special position (assuming you're able to do that with three decimal
 points in a PDB file) I'd still assume that you only intend that there
 is an atom in the vicinity of that point and it might be exactly on
 the axis but it might be a little off.  All structural models are fuzzy.

 Dale Tronrud



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Ian Tickle
Dale

 The reward of the
 full calculation is that all the complications you describe disappear.
 An atom that sits 0.001 A from a special position is not unstable
 in the least.

That's indeed a very interesting observation, I have to admit that I
didn't think that would be achievable.  But there must still be some
threshold of distance at which even that fails?  Presumably within
rounding error?  Or are you saying (I assume you aren't!) that you can
even refine all co-ordinates of an atom exactly on a special position?
 Say the x and z co-ordinates of an atom at (0,y,0) in monoclinic?
Presumably the atom would have to be given a random push one way or
the other (random number generators are generally not a feature of
crystallographic refinement programs, with the obvious exception of
simulated annealing!)?

?  I always avoid programing tests of a == b for real numbers
 because the round-off errors will always bite you at some point.
 This means that a test of an atom exactly on a special position
 can't be done reliably in floating point math.

Obviously common sense has to be applied here and tests for strict
floating-point equality studiously avoided.  But this is very easily
remedied, my optimisation programs are full of tests like
IF(ABS(X-Y).LT.1E-6) THEN ... and I'm certain so are yours (assuming
of course you still program in Fortran!).  This implies that in the
case that an atom is off-axis and disordered you have to take care not
to place it within say a few multiples of rounding error of the axis,
since then it might be indeed be confused with one 'on' the special
position.  However if someone claims that an atom sits within say
10*rounding error of an axis as distinct from being on the axis, then
a) there's no way that can be proved, and b) it would be
indistinguishable from being on the s.p. and the difference in terms
of structure factors and maps would be insignificant anyway, so it may
as well be on-axis.

I think this is how the Oxford CRYSTALS software (
http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
least 30 years, deals with this issue, so I can't accept that it can't
be made to work, even if I haven't got all the precise details
straight of how it's done in practice.

   Your preferred assumption is that any atom near enough to
 a special position is really on the special position and should
 have an occupancy of one.  My assumption is that no atom is every
 EXACTLY on the special position and if they are close enough to
 their symmetry image to forbid coexistence the occupancy should
 be 1/n.  I think either assumption is reasonable but, of course,
 prefer mine for what I consider practical reasons.  It helps that
 I have to code to make mine work.

Whichever way it's done is only a matter of convention (clearly both
ways work just as well), however I would reiterate that my main
concern here is that convention and practice appear to have parted
company in this particular instance!

Cheers

-- IAn


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Ian Tickle
 I think this is how the Oxford CRYSTALS software (
 http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
 least 30 years, deals with this issue, so I can't accept that it can't
 be made to work, even if I haven't got all the precise details
 straight of how it's done in practice.

PS just to point out that CRYSTALS is now (since 2009) open source, so
anyone can download it and find out for themselves how it's done!

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Harry Powell
Or you could write and ask the author, who is always willing to help (sorry, 
David!)...

On 16 Dec 2010, at 11:24, Ian Tickle wrote:

 I think this is how the Oxford CRYSTALS software (
 http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
 least 30 years, deals with this issue, so I can't accept that it can't
 be made to work, even if I haven't got all the precise details
 straight of how it's done in practice.
 
 PS just to point out that CRYSTALS is now (since 2009) open source, so
 anyone can download it and find out for themselves how it's done!
 
 Cheers
 
 -- Ian

Harry
--
Dr Harry Powell, MRC Laboratory of Molecular Biology, MRC Centre, Hills Road, 
Cambridge, CB2 0QH


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Herman . Schreuder
Dear Ian,

You still have an arbitrary threshold: at high resolution you see two 
disordered atoms off-axis and at low resolution you see one ordered atom 
on-axis. However, somewhere in between you or the program has to decide whether 
you still see two atoms or if the data (resolution) does not warrant such a 
statement and you switch to the one-atom model. As George Sheldrick confirmed, 
there is a discontinuous transition between the two, which does not correspond 
to the physical reality. There is no quantum transition or something when the 
atom get closer than a certain limit to a crystallographic symmetry element. 
The atom does not care, its position is just determined by the local force 
fields and if those force fields have two local minima close together, the atom 
will be disordered.

The decision to switch from a model where the atom is added once with full 
occupancy to the fourier transform calculation, or whether the atom is added 
twice with half occupancy is an arbitrary decision, made by the programmer or 
the user of the program.

Cheers,
Herman  

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian Tickle
Sent: Wednesday, December 15, 2010 6:57 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

That's my whole point, it's not an arbitrary threshold, it's determined 
completely by what the data are capable of telling you about the structure, 
depending on the resolution.  Either you have sufficient resolution to be able 
to say that the atom is disordered off the s.p. or you don't and you have no 
choice but to constrain it to the s.p., whether it's actually disordered or not.

In any case there is no discontinuous change in occupancy at all, I never 
suggested that there should be.  Say at high resolution you see
2 disordered atoms off-axis each with 1/2 occupancy, so total occupancy = 1.  
At lower resolution you see 1 ordered atom on-axis with occupancy 1 - so no 
change in total occupancy.  It makes absolutely no difference if instead if you 
store multiplicity*occupancy in the file, the total occupancy is still 1.
However multiplicity*occupancy is not conserved so will change discontinuously 
(off-axis total = 1, on-axis total = 1/2); occupancy is conserved (it 
represents real atoms after all!).

I have no issue with Shel-X if it's writing out the occupancy for deposition 
(or at least making best efforts to do so).  What it does for intermediate 
files is the user's own data conversion problem if s/he decides to switch 
between different programs.

Cheers

-- Ian

On Wed, Dec 15, 2010 at 5:20 PM, George M. Sheldrick 
gshe...@shelx.uni-ac.gwdg.de wrote:

 I agree with Herman. It is simply not acceptable to have a sudden 
 discontinuous change in effective occupancy at some arbitrary point 
 as a disordered atom approaches a special position. Anyway, whatever 
 the CIF people decide, I will not introduce an incompatibility between 
 different versions of SHELX. When SHELXL produces a small molecule CIF 
 for depostion, it of course attempts to generate the occupancy 
 according to the CIF definition. Not too surprisingly, there are a few 
 complicated cases of 'nearly special positions' where the program gets this 
 wrong.
 This is probably the most serious known 'bug' in SHELXL, but is 
 proving rather difficult to eliminate completely.

 George

 Prof. George M. Sheldrick FRS
 Dept. Structural Chemistry,
 University of Goettingen,
 Tammannstr. 4,
 D37077 Goettingen, Germany
 Tel. +49-551-39-3021 or -3068
 Fax. +49-551-39-22582


 On Wed, 15 Dec 2010, Ian Tickle wrote:

 Hi Herman

 What makes an atom on a special position is that it is literally ON 
 the s.p.: it can't be 'almost on' the s.p. because then if you tried 
 to refine the co-ordinates perpendicular to the axis you would find 
 that the matrix would be singular or at least so badly conditioned 
 that the solution would be poorly defined.  The only solution to that 
 problem is to constrain (i.e. fix) these co-ordinates to be exactly 
 on the axis and not attempt to refine them.  The data are telling you 
 that you have insufficient resolution so you are not justified in 
 placing the atom very close to the axis; the best you can do is place 
 the atom with unit occupancy exactly _on_ the axis.  It's only once 
 the atom is a 'significant' distance (i.e. relative to the 
 resolution) away from the axis that these co-ordinates can be 
 independently refined.  Then the data are telling you that the atom is 
 disordered.
 If you collected higher resolution data you might well be able to 
 detect  successfully refine disordered atoms closer to the axis than 
 with low resolution data.  So it has nothing to do with the 
 programmer setting an arbitrary threshold.  This would have to be 
 some complicated function of atom type, occupancy, B factor, 
 resolution, data quality etc to work properly anyway so I doubt

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Ian Tickle
 You still have an arbitrary threshold: at high resolution you see two 
 disordered atoms off-axis and at low resolution you see one ordered atom 
 on-axis. However, somewhere in between you or the program has to decide 
 whether you still see two atoms or if the data (resolution) does not warrant 
 such a statement and you switch to the one-atom model.

Switching between interpretations happens all the time as higher
resolution data is obtained!  Let's say at low resolution you see
density apparently for one copy of a side-chain (i.e. the density is
not of sufficient resolution to warrant interpreting it as disordered
two half side-chains) and you fit that.  To keep it simple I'm
assuming it's on a general, not a special position.  Then you collect
high-resolution data and now you see that the same side-chain is
disordered.  Rephrasing your statement: somewhere in between you or
the program has to decide whether you still see one (ordered, with
occupancy=1) side-chain or if the data (resolution) warrants such a
statement and you switch to the two side-chain (disordered, now with
sum occupancy=1) model.

 As George Sheldrick confirmed, there is a discontinuous transition between 
 the two, which does not correspond to the physical reality. There is no 
 quantum transition or something when the atom get closer than a certain 
 limit to a crystallographic symmetry element. The atom does not care, its 
 position is just determined by the local force fields and if those force 
 fields have two local minima close together, the atom will be disordered.

I'm sorry I don't see this discontinuity that you are referring to at
all (I think you have forgotten to include the symmetry copy), and I'm
certainly not claiming there is any quantum transition.  Let's start
with a disordered (1/2 occupancy) atom off a 2-fold axis and see what
happens to the electron density as it approaches and finally sits on
the 2-fold.  Here are the electron densities (this would obviously
look at lot better graphed - my apologies!):

 1   6  10  6  1   *   1  6  10  6  1

Now move the atom closer in steps to the axis so it overlaps more and
more with its symmetry copy:

   *

 1   6  10  6  2  6  10  6  1

   1   6  10  7 7  10  6  1

1  6  11  12  11  6  1

   1  7  16  16  7  1

2  12  20  12  2

  *

On the final step the fully overlapped atom has twice the occupancy
(i.e. 1 instead of 1/2) as the original as evidenced by a peak height
of 20 units, compared with 10.  In which step did the discontinuity
occur?  Clearly we could make the steps as small as we like, and you
would see a smooth transition from 2 1/2 atoms to 1 whole one.


 The decision to switch from a model where the atom is added once with full 
 occupancy to the fourier transform calculation, or whether the atom is added 
 twice with half occupancy is an arbitrary decision, made by the programmer or 
 the user of the program.

I completely agree, both ways of doing it work equally well and it's
all down to convention.  As I pointed out to Dale, the way I'm
describing does work in practice, as evidenced by the fact that
CRYSTALS which does it the way I describe, has been doing it this way
for the last 40 years.  So I can't accept that it can't work in
practice when plainly it does!

This issue here is purely one of divergence of agreed convention (CIF,
mmCIF  PDB) and practice.

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Jon Wright

On 16/12/2010 12:24, Ian Tickle wrote:

I think this is how the Oxford CRYSTALS software (
http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
least 30 years, deals with this issue, so I can't accept that it can't
be made to work, even if I haven't got all the precise details
straight of how it's done in practice.


PS just to point out that CRYSTALS is now (since 2009) open source, so
anyone can download it and find out for themselves how it's done!


Do you have a link? I was looking and didn't find it

Thanks

Jon


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Ian Tickle
Hi Jon, try this:

http://www.xtl.ox.ac.uk/download.html

-- Ian

On Thu, Dec 16, 2010 at 1:14 PM, Jon Wright wri...@esrf.fr wrote:
 On 16/12/2010 12:24, Ian Tickle wrote:

 I think this is how the Oxford CRYSTALS software (
 http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
 least 30 years, deals with this issue, so I can't accept that it can't
 be made to work, even if I haven't got all the precise details
 straight of how it's done in practice.

 PS just to point out that CRYSTALS is now (since 2009) open source, so
 anyone can download it and find out for themselves how it's done!

 Do you have a link? I was looking and didn't find it

 Thanks

 Jon




Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Herman . Schreuder
Dear Ian,
Of course there is no dicontinuity, but you create one the moment you decide 
that certain symmetry operators no longer apply to certain atoms. Confusion 
arises e.g. when you download a pdb file of fairly high resolution and find a 
water molecule with an occupancy of e.g. 0.45 at 0.02 Å from a symmetry axis. 
Is it a special water suffering from some rounding errors with a total 
occupancy of 0.45, or a rotationally disordered water (perhaps refined by an 
overzealous crystallographer) with a combined occupancy of 0.90? 

This in my eyes unneccessary distinction between special and non-special 
positions does create confusion and unnecessarily complicates the programming 
of programs working with coordinates files. If certain refinement program have 
difficulties with atoms close to symmetry axes, one could always add the 
approprioate constraints of restraints.

Cheers,
Herman 

 

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian Tickle
Sent: Thursday, December 16, 2010 1:31 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 You still have an arbitrary threshold: at high resolution you see two 
 disordered atoms off-axis and at low resolution you see one ordered atom 
 on-axis. However, somewhere in between you or the program has to decide 
 whether you still see two atoms or if the data (resolution) does not warrant 
 such a statement and you switch to the one-atom model.

Switching between interpretations happens all the time as higher resolution 
data is obtained!  Let's say at low resolution you see density apparently for 
one copy of a side-chain (i.e. the density is not of sufficient resolution to 
warrant interpreting it as disordered two half side-chains) and you fit that.  
To keep it simple I'm assuming it's on a general, not a special position.  Then 
you collect high-resolution data and now you see that the same side-chain is 
disordered.  Rephrasing your statement: somewhere in between you or the 
program has to decide whether you still see one (ordered, with
occupancy=1) side-chain or if the data (resolution) warrants such a statement 
and you switch to the two side-chain (disordered, now with sum occupancy=1) 
model.

 As George Sheldrick confirmed, there is a discontinuous transition between 
 the two, which does not correspond to the physical reality. There is no 
 quantum transition or something when the atom get closer than a certain 
 limit to a crystallographic symmetry element. The atom does not care, its 
 position is just determined by the local force fields and if those force 
 fields have two local minima close together, the atom will be disordered.

I'm sorry I don't see this discontinuity that you are referring to at all (I 
think you have forgotten to include the symmetry copy), and I'm certainly not 
claiming there is any quantum transition.  Let's start with a disordered (1/2 
occupancy) atom off a 2-fold axis and see what happens to the electron density 
as it approaches and finally sits on the 2-fold.  Here are the electron 
densities (this would obviously look at lot better graphed - my apologies!):

 1   6  10  6  1   *   1  6  10  6  1

Now move the atom closer in steps to the axis so it overlaps more and more with 
its symmetry copy:

   *

 1   6  10  6  2  6  10  6  1

   1   6  10  7 7  10  6  1

1  6  11  12  11  6  1

   1  7  16  16  7  1

2  12  20  12  2

  *

On the final step the fully overlapped atom has twice the occupancy (i.e. 1 
instead of 1/2) as the original as evidenced by a peak height of 20 units, 
compared with 10.  In which step did the discontinuity occur?  Clearly we could 
make the steps as small as we like, and you would see a smooth transition from 
2 1/2 atoms to 1 whole one.


 The decision to switch from a model where the atom is added once with full 
 occupancy to the fourier transform calculation, or whether the atom is added 
 twice with half occupancy is an arbitrary decision, made by the programmer or 
 the user of the program.

I completely agree, both ways of doing it work equally well and it's all down 
to convention.  As I pointed out to Dale, the way I'm describing does work in 
practice, as evidenced by the fact that CRYSTALS which does it the way I 
describe, has been doing it this way for the last 40 years.  So I can't accept 
that it can't work in practice when plainly it does!

This issue here is purely one of divergence of agreed convention (CIF, mmCIF  
PDB) and practice.

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Ian Tickle
Ooops sorry that's only the executable, I guess to have to contact
David for the source.

-- Ian

On Thu, Dec 16, 2010 at 1:14 PM, Jon Wright wri...@esrf.fr wrote:
 On 16/12/2010 12:24, Ian Tickle wrote:

 I think this is how the Oxford CRYSTALS software (
 http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
 least 30 years, deals with this issue, so I can't accept that it can't
 be made to work, even if I haven't got all the precise details
 straight of how it's done in practice.

 PS just to point out that CRYSTALS is now (since 2009) open source, so
 anyone can download it and find out for themselves how it's done!

 Do you have a link? I was looking and didn't find it

 Thanks

 Jon




Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Ian Tickle
On Thu, Dec 16, 2010 at 1:18 PM,  herman.schreu...@sanofi-aventis.com wrote:
 Of course there is no dicontinuity, but you create one the moment you decide 
 that certain symmetry operators no longer apply to certain atoms.

Change 'atoms' in that statement to 'reflections', or to 'map grid
points', then read it again.  Does the (-h,k,-l) operator apply to the
(0,k,0) reflections or the (-x,y,-z) operator to the grid points
(0,y,0)?  Do you multiply the intensities of the (0,k,0) reflections
or the electron densities of the (0,y,0) points by 0.5 because they
are on the axis in order to make it easier to program, or do you
decide that will confuse the user, and instead deal with the
multiplicity factor internally in the program?

 Confusion arises e.g. when you download a pdb file of fairly high resolution 
 and find a water molecule with an occupancy of e.g. 0.45 at 0.02 Å from a 
 symmetry axis. Is it a special water suffering from some rounding errors with 
 a total occupancy of 0.45, or a rotationally disordered water (perhaps 
 refined by an overzealous crystallographer) with a combined occupancy of 0.90?

For the sake of argument let's say that 0.02 Ang is too big to be
rounding error.  So if you see that big a shift then the intention of
the refinement program (or rather the programmer) which allowed such a
value to be appear in the output should be that it's real.  If the
intention of the user was that the atom is actually on axis then the
program should not have allowed such a large shift, since it will be
interpreted as 'much bigger than rounding error' and therefore
'significantly off-axis'.  Of course even if the intention of the user
was that splitting the atom across the 2-fold is meaningful, the user
may still have got it wrong if the data doesn't justify splitting the
atom like that.  The important point is how the program's actions are
interpreted by other programs and users - that's why we have
conventions.

Single precision floating point rounding error is ~ 1 in 10^7, so even
if we allow 100 times that with a 100 Ang axis it's still only 0.001
Ang.  Despite Dale being able to successful refine a 1/2 atom 0.001
Ang from an axis I don't think he can claim that moving an atom by
0.001 Ang makes any difference above rounding error to the structure
factors.  If the data can't detect the difference, then the user
should not be claiming that there is one, and programs should not
allow atoms to be set as close as 0.001 Ang, (or whatever the
consensus is a suitable multiple of the rounding error) from an axis:
it just makes no sense.  You just have to decide what is the smallest
realistic shift that the best data could possibly detect under optimal
conditions, agree that programs can't set an off-axis shift below
that, and set the rounding error test lower than that (say by an order
of magnitude for safety).

As I said CRYSTALS must do something like this and it has worked fine
for the last 40 years on small molecules where clearly the resolution
is going to allow you to detect much smaller off-axis shifts than for
macromolecules, so the choice of rounding threshold will be even more
critical.

 This in my eyes unneccessary distinction between special and non-special 
 positions does create confusion and unnecessarily complicates the programming 
 of programs working with coordinates files.

I suspect the reason that the CIF committee decided that it was
necessary to adopt this convention is that it makes transparent the
connection between occupancy, the chemical formula and static
disorder, i.e. if you see partial occupancy then you know its
disordered; occupancy=1 simply and clearly signifies that a whole atom
is present.  This connection is obfuscated by multiplying by the point
multiplicity, which is really just an algorithmic issue for the
programmer to deal with internally (exactly as for reflections and map
grid points), and shouldn't 'leak out' into the user's domain.

It's a very minor complication compared with everything else that goes
on in a refinement program.  If you're dealing with reflections or
maps you still have to program the point multiplicity, you can't avoid
it.  So it's not as though you have to write additional subroutines
specifically for atoms.

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Dale Tronrud
On 12/16/10 03:06, Ian Tickle wrote:
 Dale
 
 The reward of the
 full calculation is that all the complications you describe disappear.
 An atom that sits 0.001 A from a special position is not unstable
 in the least.
 
 That's indeed a very interesting observation, I have to admit that I
 didn't think that would be achievable.  But there must still be some
 threshold of distance at which even that fails?  Presumably within
 rounding error?  Or are you saying (I assume you aren't!) that you can
 even refine all co-ordinates of an atom exactly on a special position?
  Say the x and z co-ordinates of an atom at (0,y,0) in monoclinic?
 Presumably the atom would have to be given a random push one way or
 the other (random number generators are generally not a feature of
 crystallographic refinement programs, with the obvious exception of
 simulated annealing!)?
 

   To be frank, I wrote this code about 15 years ago, it works, and
I've not given any thought to atoms on special positions since.  I'll
have to go back to my notes and code to dig up the exact method.
Anyone with a copy of TNT can look up the code.  I am, however, not
in the least concerned about what happens when an atom falls exactly
on a special position because I just don't think that any part of a
protein model can be considered exact.  If I have a model with two
atoms, of occ=1/2 each, sitting 0.0001 A apart - it fits the density
and I think everyone knows what that model means, or at least they
should. If you decide to shove them each 0.5 A and call them a
single atom with occ=1, your model will fit the density just as well
and I have no problem with that either.

   By the way, the refinement issue has nothing to do with special
positions.  The instability you observe occurs any time you build two
atoms into the same bit of density.  If your model has two atoms, at
a general position, with exactly the same coordinates the Normal
matrix will have a singularity.  The problem doesn't come up much
because we normally choose not to build such models.  It can be an
issue in models with disorder where different conformations interpenetrate
each other but the stereochemical restraints usually come to the
rescue then.

 ?  I always avoid programing tests of a == b for real numbers
 because the round-off errors will always bite you at some point.
 This means that a test of an atom exactly on a special position
 can't be done reliably in floating point math.
 
 Obviously common sense has to be applied here and tests for strict
 floating-point equality studiously avoided.  But this is very easily
 remedied, my optimisation programs are full of tests like
 IF(ABS(X-Y).LT.1E-6) THEN ... and I'm certain so are yours (assuming
 of course you still program in Fortran!).  This implies that in the
 case that an atom is off-axis and disordered you have to take care not
 to place it within say a few multiples of rounding error of the axis,
 since then it might be indeed be confused with one 'on' the special
 position.  However if someone claims that an atom sits within say
 10*rounding error of an axis as distinct from being on the axis, then
 a) there's no way that can be proved, and b) it would be
 indistinguishable from being on the s.p. and the difference in terms
 of structure factors and maps would be insignificant anyway, so it may
 as well be on-axis.

   If the difference is insignificant, it may as well be off-axis.  I
guess if the difference is insignificant it just comes down to personal
preferences.

Dale Tronrud

 
 I think this is how the Oxford CRYSTALS software (
 http://www.xtl.ox.ac.uk/crystals.html ), which has been around for at
 least 30 years, deals with this issue, so I can't accept that it can't
 be made to work, even if I haven't got all the precise details
 straight of how it's done in practice.
 
   Your preferred assumption is that any atom near enough to
 a special position is really on the special position and should
 have an occupancy of one.  My assumption is that no atom is every
 EXACTLY on the special position and if they are close enough to
 their symmetry image to forbid coexistence the occupancy should
 be 1/n.  I think either assumption is reasonable but, of course,
 prefer mine for what I consider practical reasons.  It helps that
 I have to code to make mine work.
 
 Whichever way it's done is only a matter of convention (clearly both
 ways work just as well), however I would reiterate that my main
 concern here is that convention and practice appear to have parted
 company in this particular instance!
 
 Cheers
 
 -- IAn


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-16 Thread Dale Tronrud
On 12/16/10 06:47, Ian Tickle wrote:
 
 For the sake of argument let's say that 0.02 Ang is too big to be
 rounding error.  So if you see that big a shift then the intention of
 the refinement program (or rather the programmer) which allowed such a
 value to be appear in the output should be that it's real.  If the
 intention of the user was that the atom is actually on axis then the
 program should not have allowed such a large shift, since it will be
 interpreted as 'much bigger than rounding error' and therefore
 'significantly off-axis'. 

   I would certainly hope that no one believes that the precision of
the parameters in a PDB file are significant to the level of round-off
error!  It's bad enough that a small number of people take the three
decimal points of precision in the PDB file seriously.  When a person
places an atom in a model they aren't stating a believe that that is
the EXACT location of the atom, only that they believe the center of
the locus of all equivalent atoms in the crystal falls near that spot.
If it's 0.02 A from a special position (and the SU of the position is
larger than that) then it might be on the special position and it might
not.

   If I come across one of your models and you have an atom exactly on
a special position (assuming you're able to do that with three decimal
points in a PDB file) I'd still assume that you only intend that there
is an atom in the vicinity of that point and it might be exactly on
the axis but it might be a little off.  All structural models are fuzzy.

Dale Tronrud


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Ian Tickle
Dear George

Is applying the multiplicity factor to the occupancy internally in the
program such a issue anyway?  It need only be done once per atom on
input (i.e. you multiply each input occupancy by the multiplicity to
get the combined multiplicity*occupancy value that you would have
reading in directly in the current version), and then once per atom
again on output, reversing the process.  There shouldn't be any need
to change anything in the inner atom/reflection loop where obviously
it would indeed have slowed things down.

I can see though that the backwards-compatibility issue is more
serious.  However I suspect it will affect only a small proportion of
cases (though I accept that the fact that it may affect any at all may
be sufficient grounds for you to reject it!).  If the input value
exceeds the multiplicity we can say that it's definitely an occupancy
(otherwise clearly the occupancy would be  1).  If it's less there's
an ambiguity for sure; however then it's more likely to be the
multiplicity*occupancy (so the occupancy is nearer to 1), on the
grounds that small occupancies are less likely to be observed, because
the effect on diffraction will be less significant.  I accept that
second-guessing the user's intentions in this way is not ideal!  I
wonder how often fractional occupancies are observed at special
positions anyway?

Regards

-- Ian

On Fri, Dec 10, 2010 at 11:28 PM, George M. Sheldrick
gshe...@shelx.uni-ac.gwdg.de wrote:
 SHELXL also expects that the occupancy of a fully occupied atom on a
 threefold axis should be set at 1/3, and will generate this automatically
 if necessary. It will also generate automatically the necessary
 constraints for the x, y and z parameters (and for the Uij if the atom is
 anisotropic). It is essential that this is done correctly if a full-matrix
 refinement is being performed (e.g. to get esd estimates), otherwise
 the refinement can explode. The user may change or switch off the
 tolerance for detecting whether an atom is on a special position
 (with the SPEC instruction). Setting the occupancy to a fraction avoided
 a complicated IF construction inside a loop and 35 years ago computers
 were so slow! I can't change it now because I have to preserve upwards
 compatibility. Unfortunately the CIF committee decided to use the other
 definition (i.e. the Zn on the threefold axis has an occupancy of 1.0)
 and this has caused considerable confusion in the small molecule world
 ever since; atoms are frequently encountered on special positions in
 inorganic and mineral structures.

 George

 Prof. George M. Sheldrick FRS
 Dept. Structural Chemistry,
 University of Goettingen,
 Tammannstr. 4,
 D37077 Goettingen, Germany
 Tel. +49-551-39-3021 or -3068
 Fax. +49-551-39-22582


 On Fri, 10 Dec 2010, Ed Pozharski wrote:

 On Fri, 2010-12-10 at 21:53 +, Ian Tickle wrote:
  Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
  special position

 Shouldn't 1/3 be better for programming purposes?  If you set occupancy
 to 1.0, then you should specify that symmetry operators do not apply for
 these atoms, making Fc calculation a bit more cumbersome.

 If definition of the asu content is you get full content of the unit
 cell after applying symmetry operators, then occupancy *must* be 1/3,
 right?

 The first zinc and the water are on special position, but because they
 are not excluded from positional refinement (perhaps they should be),
 they will drift a bit.  CNS has distance cutoff for treating atoms as
 special positions, if it jumps over the limit during, say, simulated
 annealing, it  will cause problems.  Perhaps PROLSQ did something
 similar.  It is a good question if it's better to fix these in place or
 let them wobble a bit to account for some potential disorder.  While I
 see the formal argument that it should be nailed to three-fold axes, it
 is also true that this is a mathematical compromise to simplify modeling
 that does not reflect physical reality (i.e. you don't have three
 partially occupied zinc ions, it's just one).  In any event, given that
 this is a 1.5A structure, (-0.002 0.004) is statistically speaking the
 same as (0 0).

 Cheers,

 Ed.

 --
 I'd jump in myself, if I weren't so good at whistling.
                                Julian, King of Lemurs





Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread George M. Sheldrick

Dear Ian,

Of course I could convert the occupancy on reading the atom in and convert 
it back agains on reading it out. This is not quite so trivial as it 
sounds because I need to set a threshold as to how close the atom has 
to be to a special position to be treated as special, and take care 
that rounding errors have the same effect on input and output and that 
the coordinates have not moved in or out of the special zone in the 
meantime.

As it stands in SHELX, an atom that is near to a twofold will have an 
occupancy of 0.5 whether it is disordered close to a special position 
or whether it is really special, so this is never a problem.  

SHELXL is mainly used for small molecules that frequently have atoms on
speical positions, and disordered solvent molecules approximately on
sppecial positions are also very common (for example in centrosymmetric 
space groups toluene usually lies on the center of symmetry). Occupancies
are often tied to free variables which would also complicate any changes
to the code. And in any case, SHELX has been upwards compatible for the 
last 35 years and I wish it to remain that way.

Best wishes, George

Prof. George M. Sheldrick FRS
Dept. Structural Chemistry,
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-22582


On Wed, 15 Dec 2010, Ian Tickle wrote:

 Dear George
 
 Is applying the multiplicity factor to the occupancy internally in the
 program such a issue anyway?  It need only be done once per atom on
 input (i.e. you multiply each input occupancy by the multiplicity to
 get the combined multiplicity*occupancy value that you would have
 reading in directly in the current version), and then once per atom
 again on output, reversing the process.  There shouldn't be any need
 to change anything in the inner atom/reflection loop where obviously
 it would indeed have slowed things down.
 
 I can see though that the backwards-compatibility issue is more
 serious.  However I suspect it will affect only a small proportion of
 cases (though I accept that the fact that it may affect any at all may
 be sufficient grounds for you to reject it!).  If the input value
 exceeds the multiplicity we can say that it's definitely an occupancy
 (otherwise clearly the occupancy would be  1).  If it's less there's
 an ambiguity for sure; however then it's more likely to be the
 multiplicity*occupancy (so the occupancy is nearer to 1), on the
 grounds that small occupancies are less likely to be observed, because
 the effect on diffraction will be less significant.  I accept that
 second-guessing the user's intentions in this way is not ideal!  I
 wonder how often fractional occupancies are observed at special
 positions anyway?
 
 Regards
 
 -- Ian
 
 On Fri, Dec 10, 2010 at 11:28 PM, George M. Sheldrick
 gshe...@shelx.uni-ac.gwdg.de wrote:
  SHELXL also expects that the occupancy of a fully occupied atom on a
  threefold axis should be set at 1/3, and will generate this automatically
  if necessary. It will also generate automatically the necessary
  constraints for the x, y and z parameters (and for the Uij if the atom is
  anisotropic). It is essential that this is done correctly if a full-matrix
  refinement is being performed (e.g. to get esd estimates), otherwise
  the refinement can explode. The user may change or switch off the
  tolerance for detecting whether an atom is on a special position
  (with the SPEC instruction). Setting the occupancy to a fraction avoided
  a complicated IF construction inside a loop and 35 years ago computers
  were so slow! I can't change it now because I have to preserve upwards
  compatibility. Unfortunately the CIF committee decided to use the other
  definition (i.e. the Zn on the threefold axis has an occupancy of 1.0)
  and this has caused considerable confusion in the small molecule world
  ever since; atoms are frequently encountered on special positions in
  inorganic and mineral structures.
 
  George
 
  Prof. George M. Sheldrick FRS
  Dept. Structural Chemistry,
  University of Goettingen,
  Tammannstr. 4,
  D37077 Goettingen, Germany
  Tel. +49-551-39-3021 or -3068
  Fax. +49-551-39-22582
 
 
  On Fri, 10 Dec 2010, Ed Pozharski wrote:
 
  On Fri, 2010-12-10 at 21:53 +, Ian Tickle wrote:
   Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
   special position
 
  Shouldn't 1/3 be better for programming purposes?  If you set occupancy
  to 1.0, then you should specify that symmetry operators do not apply for
  these atoms, making Fc calculation a bit more cumbersome.
 
  If definition of the asu content is you get full content of the unit
  cell after applying symmetry operators, then occupancy *must* be 1/3,
  right?
 
  The first zinc and the water are on special position, but because they
  are not excluded from positional refinement (perhaps they should be),
  they will drift a bit.  CNS has distance cutoff for treating atoms as
  special 

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Ian Tickle
Dear George

I would say that an atom has fractional occupancy (but unit
multiplicity) unless it's exactly on the special position (though I
can foresee problems with rounding of decimal places for an atom say
at x=1/3), so that effectively once the atom is fixed exactly on the
s.p. the symmetry copies coalesce into a single atom with unit
occupancy (but fractional multiplicity).  This is at least one
advantage of having co-ordinates stored as fractional - it would
probably be more tricky with orthogonalised co-ordinates.  Presumably
once an input atom has satisfied the condition of being 'sufficiently
close' to a s.p. to be considered as 'on' the s.p. then the
constraints fix the co-ordinates exactly on the special position and
henceforth it's forcibly prevented from moving off it?  In any case if
an atom is very close to its symmetry copy you are going to have
matrix conditioning problems for the co-ordinates perpendicular to the
axis of symmetry (or mirror plane), so then you have no choice but to
disallow co-ordinate shifts of the atom which would take it off the
special position?

Cheers

-- Ian

On Wed, Dec 15, 2010 at 11:42 AM, George M. Sheldrick
gshe...@shelx.uni-ac.gwdg.de wrote:

 Dear Ian,

 Of course I could convert the occupancy on reading the atom in and convert
 it back agains on reading it out. This is not quite so trivial as it
 sounds because I need to set a threshold as to how close the atom has
 to be to a special position to be treated as special, and take care
 that rounding errors have the same effect on input and output and that
 the coordinates have not moved in or out of the special zone in the
 meantime.

 As it stands in SHELX, an atom that is near to a twofold will have an
 occupancy of 0.5 whether it is disordered close to a special position
 or whether it is really special, so this is never a problem.

 SHELXL is mainly used for small molecules that frequently have atoms on
 speical positions, and disordered solvent molecules approximately on
 sppecial positions are also very common (for example in centrosymmetric
 space groups toluene usually lies on the center of symmetry). Occupancies
 are often tied to free variables which would also complicate any changes
 to the code. And in any case, SHELX has been upwards compatible for the
 last 35 years and I wish it to remain that way.

 Best wishes, George

 Prof. George M. Sheldrick FRS
 Dept. Structural Chemistry,
 University of Goettingen,
 Tammannstr. 4,
 D37077 Goettingen, Germany
 Tel. +49-551-39-3021 or -3068
 Fax. +49-551-39-22582


 On Wed, 15 Dec 2010, Ian Tickle wrote:

 Dear George

 Is applying the multiplicity factor to the occupancy internally in the
 program such a issue anyway?  It need only be done once per atom on
 input (i.e. you multiply each input occupancy by the multiplicity to
 get the combined multiplicity*occupancy value that you would have
 reading in directly in the current version), and then once per atom
 again on output, reversing the process.  There shouldn't be any need
 to change anything in the inner atom/reflection loop where obviously
 it would indeed have slowed things down.

 I can see though that the backwards-compatibility issue is more
 serious.  However I suspect it will affect only a small proportion of
 cases (though I accept that the fact that it may affect any at all may
 be sufficient grounds for you to reject it!).  If the input value
 exceeds the multiplicity we can say that it's definitely an occupancy
 (otherwise clearly the occupancy would be  1).  If it's less there's
 an ambiguity for sure; however then it's more likely to be the
 multiplicity*occupancy (so the occupancy is nearer to 1), on the
 grounds that small occupancies are less likely to be observed, because
 the effect on diffraction will be less significant.  I accept that
 second-guessing the user's intentions in this way is not ideal!  I
 wonder how often fractional occupancies are observed at special
 positions anyway?

 Regards

 -- Ian

 On Fri, Dec 10, 2010 at 11:28 PM, George M. Sheldrick
 gshe...@shelx.uni-ac.gwdg.de wrote:
  SHELXL also expects that the occupancy of a fully occupied atom on a
  threefold axis should be set at 1/3, and will generate this automatically
  if necessary. It will also generate automatically the necessary
  constraints for the x, y and z parameters (and for the Uij if the atom is
  anisotropic). It is essential that this is done correctly if a full-matrix
  refinement is being performed (e.g. to get esd estimates), otherwise
  the refinement can explode. The user may change or switch off the
  tolerance for detecting whether an atom is on a special position
  (with the SPEC instruction). Setting the occupancy to a fraction avoided
  a complicated IF construction inside a loop and 35 years ago computers
  were so slow! I can't change it now because I have to preserve upwards
  compatibility. Unfortunately the CIF committee decided to use the other
  definition (i.e. the Zn on the 

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread George M. Sheldrick

Dear Ian,

Yes. Once an atom has been identified as on a special position because it 
is within a specied tolerance, SHELXL applies the appropriate contraints
to both the coordinates and the Uij so there is no danger of the atom 
wandering off the special position. Usually, when an atom it very close to
a special position but not actually on it, it is part of a disordered 
solvent molecule and will be prevented from misbehaving by distance and
Uij restraints imposed by the user; in such a case the user usually also 
switches off the special position check for that disordered molecule 
(SPEC -1) to avoid atoms being idealized onto the special position by the 
program. For solvent molecules disordered on special positions it is
also necessary to ignore symmetry equivalent atoms when generating 
idealized hydrogen atoms etc. (PART -N in SHELXL). This is all routine 
practice in small molecule crystallography. I agree that the use of 
orthogonal rather than crystal coordinates can obscur the situation, 
e.g. for an atom on a threefold axis.

Best wishes, George

Prof. George M. Sheldrick FRS
Dept. Structural Chemistry,
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-22582


On Wed, 15 Dec 2010, Ian Tickle wrote:

 Dear George
 
 I would say that an atom has fractional occupancy (but unit
 multiplicity) unless it's exactly on the special position (though I
 can foresee problems with rounding of decimal places for an atom say
 at x=1/3), so that effectively once the atom is fixed exactly on the
 s.p. the symmetry copies coalesce into a single atom with unit
 occupancy (but fractional multiplicity).  This is at least one
 advantage of having co-ordinates stored as fractional - it would
 probably be more tricky with orthogonalised co-ordinates.  Presumably
 once an input atom has satisfied the condition of being 'sufficiently
 close' to a s.p. to be considered as 'on' the s.p. then the
 constraints fix the co-ordinates exactly on the special position and
 henceforth it's forcibly prevented from moving off it?  In any case if
 an atom is very close to its symmetry copy you are going to have
 matrix conditioning problems for the co-ordinates perpendicular to the
 axis of symmetry (or mirror plane), so then you have no choice but to
 disallow co-ordinate shifts of the atom which would take it off the
 special position?
 
 Cheers
 
 -- Ian
 
 On Wed, Dec 15, 2010 at 11:42 AM, George M. Sheldrick
 gshe...@shelx.uni-ac.gwdg.de wrote:
 
  Dear Ian,
 
  Of course I could convert the occupancy on reading the atom in and convert
  it back agains on reading it out. This is not quite so trivial as it
  sounds because I need to set a threshold as to how close the atom has
  to be to a special position to be treated as special, and take care
  that rounding errors have the same effect on input and output and that
  the coordinates have not moved in or out of the special zone in the
  meantime.
 
  As it stands in SHELX, an atom that is near to a twofold will have an
  occupancy of 0.5 whether it is disordered close to a special position
  or whether it is really special, so this is never a problem.
 
  SHELXL is mainly used for small molecules that frequently have atoms on
  speical positions, and disordered solvent molecules approximately on
  sppecial positions are also very common (for example in centrosymmetric
  space groups toluene usually lies on the center of symmetry). Occupancies
  are often tied to free variables which would also complicate any changes
  to the code. And in any case, SHELX has been upwards compatible for the
  last 35 years and I wish it to remain that way.
 
  Best wishes, George
 
  Prof. George M. Sheldrick FRS
  Dept. Structural Chemistry,
  University of Goettingen,
  Tammannstr. 4,
  D37077 Goettingen, Germany
  Tel. +49-551-39-3021 or -3068
  Fax. +49-551-39-22582
 
 
  On Wed, 15 Dec 2010, Ian Tickle wrote:
 
  Dear George
 
  Is applying the multiplicity factor to the occupancy internally in the
  program such a issue anyway?  It need only be done once per atom on
  input (i.e. you multiply each input occupancy by the multiplicity to
  get the combined multiplicity*occupancy value that you would have
  reading in directly in the current version), and then once per atom
  again on output, reversing the process.  There shouldn't be any need
  to change anything in the inner atom/reflection loop where obviously
  it would indeed have slowed things down.
 
  I can see though that the backwards-compatibility issue is more
  serious.  However I suspect it will affect only a small proportion of
  cases (though I accept that the fact that it may affect any at all may
  be sufficient grounds for you to reject it!).  If the input value
  exceeds the multiplicity we can say that it's definitely an occupancy
  (otherwise clearly the occupancy would be  1).  If it's less there's
  an ambiguity for sure; however then it's more likely to 

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Ian Tickle
Dear George

I notice that the Oxford CRYSTALS program, which is what I used when I
did small-molecule crystallography and which is still quite popular
among the small-molecule people (maybe not as much as Shel-X!), uses
the CIF convention:

OCC= This parameter defines the site occupancy EXCLUDING special
position effects (i.e. is the 'chemical occupancy'). The default is
1.0.  Special position effects are computed by CRYSTALS and multiplied
onto this parameter.

(from http://www.xtl.ox.ac.uk/crystalsmanual-atomic.html )

Also the mmCIF specification on this is the same CIF one (hardly
surprising I guess since it's derived from it):

_atom_site.occupancy  The fraction of the atom type present at this site.
The sum of the occupancies of all the atom types at this site may not
significantly exceed 1.0 unless it is a dummy site.

(from 
http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_atom_site.occupancy.html
)

which doesn't say so specifically, but it's implied since if the
multiplicity is included then the maximum value of the sum is the
multiplicity, not 1.0.

So there's a real possibility of user - and programmer - confusion
here!  I must say that until I looked at the 4INS file I had assumed
that the PDB occupancy was what it claimed to be, i.e. the real
'chemical' occupancy not the multiplicity-fudged one.

Cheers

-- Ian

On Wed, Dec 15, 2010 at 1:53 PM, George M. Sheldrick
gshe...@shelx.uni-ac.gwdg.de wrote:

 Dear Ian,

 Yes. Once an atom has been identified as on a special position because it
 is within a specied tolerance, SHELXL applies the appropriate contraints
 to both the coordinates and the Uij so there is no danger of the atom
 wandering off the special position. Usually, when an atom it very close to
 a special position but not actually on it, it is part of a disordered
 solvent molecule and will be prevented from misbehaving by distance and
 Uij restraints imposed by the user; in such a case the user usually also
 switches off the special position check for that disordered molecule
 (SPEC -1) to avoid atoms being idealized onto the special position by the
 program. For solvent molecules disordered on special positions it is
 also necessary to ignore symmetry equivalent atoms when generating
 idealized hydrogen atoms etc. (PART -N in SHELXL). This is all routine
 practice in small molecule crystallography. I agree that the use of
 orthogonal rather than crystal coordinates can obscur the situation,
 e.g. for an atom on a threefold axis.

 Best wishes, George

 Prof. George M. Sheldrick FRS
 Dept. Structural Chemistry,
 University of Goettingen,
 Tammannstr. 4,
 D37077 Goettingen, Germany
 Tel. +49-551-39-3021 or -3068
 Fax. +49-551-39-22582


 On Wed, 15 Dec 2010, Ian Tickle wrote:

 Dear George

 I would say that an atom has fractional occupancy (but unit
 multiplicity) unless it's exactly on the special position (though I
 can foresee problems with rounding of decimal places for an atom say
 at x=1/3), so that effectively once the atom is fixed exactly on the
 s.p. the symmetry copies coalesce into a single atom with unit
 occupancy (but fractional multiplicity).  This is at least one
 advantage of having co-ordinates stored as fractional - it would
 probably be more tricky with orthogonalised co-ordinates.  Presumably
 once an input atom has satisfied the condition of being 'sufficiently
 close' to a s.p. to be considered as 'on' the s.p. then the
 constraints fix the co-ordinates exactly on the special position and
 henceforth it's forcibly prevented from moving off it?  In any case if
 an atom is very close to its symmetry copy you are going to have
 matrix conditioning problems for the co-ordinates perpendicular to the
 axis of symmetry (or mirror plane), so then you have no choice but to
 disallow co-ordinate shifts of the atom which would take it off the
 special position?

 Cheers

 -- Ian

 On Wed, Dec 15, 2010 at 11:42 AM, George M. Sheldrick
 gshe...@shelx.uni-ac.gwdg.de wrote:
 
  Dear Ian,
 
  Of course I could convert the occupancy on reading the atom in and convert
  it back agains on reading it out. This is not quite so trivial as it
  sounds because I need to set a threshold as to how close the atom has
  to be to a special position to be treated as special, and take care
  that rounding errors have the same effect on input and output and that
  the coordinates have not moved in or out of the special zone in the
  meantime.
 
  As it stands in SHELX, an atom that is near to a twofold will have an
  occupancy of 0.5 whether it is disordered close to a special position
  or whether it is really special, so this is never a problem.
 
  SHELXL is mainly used for small molecules that frequently have atoms on
  speical positions, and disordered solvent molecules approximately on
  sppecial positions are also very common (for example in centrosymmetric
  space groups toluene usually lies on the center of symmetry). Occupancies
  are often tied to free variables which 

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Herman . Schreuder
Dear Ian,

In my view, the confusion arises by NOT including the multiplicity into the 
occupancy. If we make the gedanken experiment and look at a range of crystal 
structures with a rotationally disordered water molecule near a symmetry axis 
(they do exist!) then as long as the water molecule is sufficiently far from 
the axis, it is clear that the occupancy should be 1/2 or 1/3 or whatever is 
the multiplicity. However, as the molecule approaches the axis at a certain 
moment at a certain treshold set by the programmer of the refinement program, 
the molecule suddenly becomes special and the occupancy is set to 1.0. So 
depending on rounding errors, different thresholds etc. different programs may 
make different decisions on whether a water is special or not.

For me, this is confusing.

Best regards,
Herman 

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian Tickle
Sent: Wednesday, December 15, 2010 3:47 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

Dear George

I notice that the Oxford CRYSTALS program, which is what I used when I did 
small-molecule crystallography and which is still quite popular among the 
small-molecule people (maybe not as much as Shel-X!), uses the CIF convention:

OCC= This parameter defines the site occupancy EXCLUDING special position 
effects (i.e. is the 'chemical occupancy'). The default is 1.0.  Special 
position effects are computed by CRYSTALS and multiplied onto this parameter.

(from http://www.xtl.ox.ac.uk/crystalsmanual-atomic.html )

Also the mmCIF specification on this is the same CIF one (hardly surprising I 
guess since it's derived from it):

_atom_site.occupancy  The fraction of the atom type present at this site.
The sum of the occupancies of all the atom types at this site may not 
significantly exceed 1.0 unless it is a dummy site.

(from 
http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_atom_site.occupancy.html
)

which doesn't say so specifically, but it's implied since if the multiplicity 
is included then the maximum value of the sum is the multiplicity, not 1.0.

So there's a real possibility of user - and programmer - confusion here!  I 
must say that until I looked at the 4INS file I had assumed that the PDB 
occupancy was what it claimed to be, i.e. the real 'chemical' occupancy not the 
multiplicity-fudged one.

Cheers

-- Ian

On Wed, Dec 15, 2010 at 1:53 PM, George M. Sheldrick 
gshe...@shelx.uni-ac.gwdg.de wrote:

 Dear Ian,

 Yes. Once an atom has been identified as on a special position because 
 it is within a specied tolerance, SHELXL applies the appropriate 
 contraints to both the coordinates and the Uij so there is no danger 
 of the atom wandering off the special position. Usually, when an atom 
 it very close to a special position but not actually on it, it is part 
 of a disordered solvent molecule and will be prevented from 
 misbehaving by distance and Uij restraints imposed by the user; in 
 such a case the user usually also switches off the special position 
 check for that disordered molecule (SPEC -1) to avoid atoms being 
 idealized onto the special position by the program. For solvent 
 molecules disordered on special positions it is also necessary to 
 ignore symmetry equivalent atoms when generating idealized hydrogen 
 atoms etc. (PART -N in SHELXL). This is all routine practice in small 
 molecule crystallography. I agree that the use of orthogonal rather 
 than crystal coordinates can obscur the situation, e.g. for an atom on a 
 threefold axis.

 Best wishes, George

 Prof. George M. Sheldrick FRS
 Dept. Structural Chemistry,
 University of Goettingen,
 Tammannstr. 4,
 D37077 Goettingen, Germany
 Tel. +49-551-39-3021 or -3068
 Fax. +49-551-39-22582


 On Wed, 15 Dec 2010, Ian Tickle wrote:

 Dear George

 I would say that an atom has fractional occupancy (but unit
 multiplicity) unless it's exactly on the special position (though I 
 can foresee problems with rounding of decimal places for an atom say 
 at x=1/3), so that effectively once the atom is fixed exactly on the 
 s.p. the symmetry copies coalesce into a single atom with unit 
 occupancy (but fractional multiplicity).  This is at least one 
 advantage of having co-ordinates stored as fractional - it would 
 probably be more tricky with orthogonalised co-ordinates.  Presumably 
 once an input atom has satisfied the condition of being 'sufficiently 
 close' to a s.p. to be considered as 'on' the s.p. then the 
 constraints fix the co-ordinates exactly on the special position and 
 henceforth it's forcibly prevented from moving off it?  In any case 
 if an atom is very close to its symmetry copy you are going to have 
 matrix conditioning problems for the co-ordinates perpendicular to 
 the axis of symmetry (or mirror plane), so then you have no choice 
 but to disallow co-ordinate shifts of the atom which would take it 
 off the special position

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Ian Tickle
Hi Herman

What makes an atom on a special position is that it is literally ON
the s.p.: it can't be 'almost on' the s.p. because then if you tried
to refine the co-ordinates perpendicular to the axis you would find
that the matrix would be singular or at least so badly conditioned
that the solution would be poorly defined.  The only solution to that
problem is to constrain (i.e. fix) these co-ordinates to be exactly on
the axis and not attempt to refine them.  The data are telling you
that you have insufficient resolution so you are not justified in
placing the atom very close to the axis; the best you can do is place
the atom with unit occupancy exactly _on_ the axis.  It's only once
the atom is a 'significant' distance (i.e. relative to the resolution)
away from the axis that these co-ordinates can be independently
refined.  Then the data are telling you that the atom is disordered.
If you collected higher resolution data you might well be able to
detect  successfully refine disordered atoms closer to the axis than
with low resolution data.  So it has nothing to do with the programmer
setting an arbitrary threshold.  This would have to be some
complicated function of atom type, occupancy, B factor, resolution,
data quality etc to work properly anyway so I doubt that it would be
feasible.  Instead it's determined completely by what the data are
capable of telling you about the structure, as indeed it should be.

My main concern was the conflict between some program implementations
and the PDB and mmCIF format descriptions on this issue.  For example
the PDB documentation says that the ATOM record contains the occupancy
(where this is defined in the CIF/mmCIF documentation).  If it had
intended that it should contain multiplicity*occupancy instead then
presumably it would have said so.

Cheers

-- Ian

On Wed, Dec 15, 2010 at 4:01 PM,  herman.schreu...@sanofi-aventis.com wrote:
 Dear Ian,

 In my view, the confusion arises by NOT including the multiplicity into the 
 occupancy. If we make the gedanken experiment and look at a range of crystal 
 structures with a rotationally disordered water molecule near a symmetry axis 
 (they do exist!) then as long as the water molecule is sufficiently far from 
 the axis, it is clear that the occupancy should be 1/2 or 1/3 or whatever is 
 the multiplicity. However, as the molecule approaches the axis at a certain 
 moment at a certain treshold set by the programmer of the refinement program, 
 the molecule suddenly becomes special and the occupancy is set to 1.0. So 
 depending on rounding errors, different thresholds etc. different programs 
 may make different decisions on whether a water is special or not.

 For me, this is confusing.

 Best regards,
 Herman

 -Original Message-
 From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian 
 Tickle
 Sent: Wednesday, December 15, 2010 3:47 PM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 Dear George

 I notice that the Oxford CRYSTALS program, which is what I used when I did 
 small-molecule crystallography and which is still quite popular among the 
 small-molecule people (maybe not as much as Shel-X!), uses the CIF convention:

 OCC= This parameter defines the site occupancy EXCLUDING special position 
 effects (i.e. is the 'chemical occupancy'). The default is 1.0.  Special 
 position effects are computed by CRYSTALS and multiplied onto this parameter.

 (from http://www.xtl.ox.ac.uk/crystalsmanual-atomic.html )

 Also the mmCIF specification on this is the same CIF one (hardly surprising I 
 guess since it's derived from it):

 _atom_site.occupancy  The fraction of the atom type present at this site.
 The sum of the occupancies of all the atom types at this site may not 
 significantly exceed 1.0 unless it is a dummy site.

 (from 
 http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_atom_site.occupancy.html
 )

 which doesn't say so specifically, but it's implied since if the multiplicity 
 is included then the maximum value of the sum is the multiplicity, not 1.0.

 So there's a real possibility of user - and programmer - confusion here!  I 
 must say that until I looked at the 4INS file I had assumed that the PDB 
 occupancy was what it claimed to be, i.e. the real 'chemical' occupancy not 
 the multiplicity-fudged one.

 Cheers

 -- Ian

 On Wed, Dec 15, 2010 at 1:53 PM, George M. Sheldrick 
 gshe...@shelx.uni-ac.gwdg.de wrote:

 Dear Ian,

 Yes. Once an atom has been identified as on a special position because
 it is within a specied tolerance, SHELXL applies the appropriate
 contraints to both the coordinates and the Uij so there is no danger
 of the atom wandering off the special position. Usually, when an atom
 it very close to a special position but not actually on it, it is part
 of a disordered solvent molecule and will be prevented from
 misbehaving by distance and Uij restraints imposed by the user; in
 such a case

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread George M. Sheldrick

I agree with Herman. It is simply not acceptable to have a sudden 
discontinuous change in effective occupancy at some arbitrary point 
as a disordered atom approaches a special position. Anyway, whatever the
CIF people decide, I will not introduce an incompatibility between
different versions of SHELX. When SHELXL produces a small molecule CIF 
for depostion, it of course attempts to generate the occupancy according 
to the CIF definition. Not too surprisingly, there are a few complicated 
cases of 'nearly special positions' where the program gets this wrong.
This is probably the most serious known 'bug' in SHELXL, but is proving 
rather difficult to eliminate completely.

George 

Prof. George M. Sheldrick FRS
Dept. Structural Chemistry,
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-22582


On Wed, 15 Dec 2010, Ian Tickle wrote:

 Hi Herman
 
 What makes an atom on a special position is that it is literally ON
 the s.p.: it can't be 'almost on' the s.p. because then if you tried
 to refine the co-ordinates perpendicular to the axis you would find
 that the matrix would be singular or at least so badly conditioned
 that the solution would be poorly defined.  The only solution to that
 problem is to constrain (i.e. fix) these co-ordinates to be exactly on
 the axis and not attempt to refine them.  The data are telling you
 that you have insufficient resolution so you are not justified in
 placing the atom very close to the axis; the best you can do is place
 the atom with unit occupancy exactly _on_ the axis.  It's only once
 the atom is a 'significant' distance (i.e. relative to the resolution)
 away from the axis that these co-ordinates can be independently
 refined.  Then the data are telling you that the atom is disordered.
 If you collected higher resolution data you might well be able to
 detect  successfully refine disordered atoms closer to the axis than
 with low resolution data.  So it has nothing to do with the programmer
 setting an arbitrary threshold.  This would have to be some
 complicated function of atom type, occupancy, B factor, resolution,
 data quality etc to work properly anyway so I doubt that it would be
 feasible.  Instead it's determined completely by what the data are
 capable of telling you about the structure, as indeed it should be.
 
 My main concern was the conflict between some program implementations
 and the PDB and mmCIF format descriptions on this issue.  For example
 the PDB documentation says that the ATOM record contains the occupancy
 (where this is defined in the CIF/mmCIF documentation).  If it had
 intended that it should contain multiplicity*occupancy instead then
 presumably it would have said so.
 
 Cheers
 
 -- Ian
 
 On Wed, Dec 15, 2010 at 4:01 PM,  herman.schreu...@sanofi-aventis.com wrote:
  Dear Ian,
 
  In my view, the confusion arises by NOT including the multiplicity into the 
  occupancy. If we make the gedanken experiment and look at a range of 
  crystal structures with a rotationally disordered water molecule near a 
  symmetry axis (they do exist!) then as long as the water molecule is 
  sufficiently far from the axis, it is clear that the occupancy should be 
  1/2 or 1/3 or whatever is the multiplicity. However, as the molecule 
  approaches the axis at a certain moment at a certain treshold set by the 
  programmer of the refinement program, the molecule suddenly becomes special 
  and the occupancy is set to 1.0. So depending on rounding errors, different 
  thresholds etc. different programs may make different decisions on whether 
  a water is special or not.
 
  For me, this is confusing.
 
  Best regards,
  Herman
 
  -Original Message-
  From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian 
  Tickle
  Sent: Wednesday, December 15, 2010 3:47 PM
  To: CCP4BB@JISCMAIL.AC.UK
  Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms
 
  Dear George
 
  I notice that the Oxford CRYSTALS program, which is what I used when I did 
  small-molecule crystallography and which is still quite popular among the 
  small-molecule people (maybe not as much as Shel-X!), uses the CIF 
  convention:
 
  OCC= This parameter defines the site occupancy EXCLUDING special position 
  effects (i.e. is the 'chemical occupancy'). The default is 1.0.  Special 
  position effects are computed by CRYSTALS and multiplied onto this 
  parameter.
 
  (from http://www.xtl.ox.ac.uk/crystalsmanual-atomic.html )
 
  Also the mmCIF specification on this is the same CIF one (hardly surprising 
  I guess since it's derived from it):
 
  _atom_site.occupancy  The fraction of the atom type present at this site.
  The sum of the occupancies of all the atom types at this site may not 
  significantly exceed 1.0 unless it is a dummy site.
 
  (from 
  http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_atom_site.occupancy.html
  )
 
  which doesn't say so

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Dale Tronrud
:47 PM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 Dear George

 I notice that the Oxford CRYSTALS program, which is what I used when I did 
 small-molecule crystallography and which is still quite popular among the 
 small-molecule people (maybe not as much as Shel-X!), uses the CIF 
 convention:

 OCC= This parameter defines the site occupancy EXCLUDING special position 
 effects (i.e. is the 'chemical occupancy'). The default is 1.0.  Special 
 position effects are computed by CRYSTALS and multiplied onto this parameter.

 (from http://www.xtl.ox.ac.uk/crystalsmanual-atomic.html )

 Also the mmCIF specification on this is the same CIF one (hardly surprising 
 I guess since it's derived from it):

 _atom_site.occupancy  The fraction of the atom type present at this site.
 The sum of the occupancies of all the atom types at this site may not 
 significantly exceed 1.0 unless it is a dummy site.

 (from 
 http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_atom_site.occupancy.html
 )

 which doesn't say so specifically, but it's implied since if the 
 multiplicity is included then the maximum value of the sum is the 
 multiplicity, not 1.0.

 So there's a real possibility of user - and programmer - confusion here!  I 
 must say that until I looked at the 4INS file I had assumed that the PDB 
 occupancy was what it claimed to be, i.e. the real 'chemical' occupancy not 
 the multiplicity-fudged one.

 Cheers

 -- Ian

 On Wed, Dec 15, 2010 at 1:53 PM, George M. Sheldrick 
 gshe...@shelx.uni-ac.gwdg.de wrote:

 Dear Ian,

 Yes. Once an atom has been identified as on a special position because
 it is within a specied tolerance, SHELXL applies the appropriate
 contraints to both the coordinates and the Uij so there is no danger
 of the atom wandering off the special position. Usually, when an atom
 it very close to a special position but not actually on it, it is part
 of a disordered solvent molecule and will be prevented from
 misbehaving by distance and Uij restraints imposed by the user; in
 such a case the user usually also switches off the special position
 check for that disordered molecule (SPEC -1) to avoid atoms being
 idealized onto the special position by the program. For solvent
 molecules disordered on special positions it is also necessary to
 ignore symmetry equivalent atoms when generating idealized hydrogen
 atoms etc. (PART -N in SHELXL). This is all routine practice in small
 molecule crystallography. I agree that the use of orthogonal rather
 than crystal coordinates can obscur the situation, e.g. for an atom on a 
 threefold axis.

 Best wishes, George

 Prof. George M. Sheldrick FRS
 Dept. Structural Chemistry,
 University of Goettingen,
 Tammannstr. 4,
 D37077 Goettingen, Germany
 Tel. +49-551-39-3021 or -3068
 Fax. +49-551-39-22582


 On Wed, 15 Dec 2010, Ian Tickle wrote:

 Dear George

 I would say that an atom has fractional occupancy (but unit
 multiplicity) unless it's exactly on the special position (though I
 can foresee problems with rounding of decimal places for an atom say
 at x=1/3), so that effectively once the atom is fixed exactly on the
 s.p. the symmetry copies coalesce into a single atom with unit
 occupancy (but fractional multiplicity).  This is at least one
 advantage of having co-ordinates stored as fractional - it would
 probably be more tricky with orthogonalised co-ordinates.  Presumably
 once an input atom has satisfied the condition of being 'sufficiently
 close' to a s.p. to be considered as 'on' the s.p. then the
 constraints fix the co-ordinates exactly on the special position and
 henceforth it's forcibly prevented from moving off it?  In any case
 if an atom is very close to its symmetry copy you are going to have
 matrix conditioning problems for the co-ordinates perpendicular to
 the axis of symmetry (or mirror plane), so then you have no choice
 but to disallow co-ordinate shifts of the atom which would take it
 off the special position?

 Cheers

 -- Ian

 On Wed, Dec 15, 2010 at 11:42 AM, George M. Sheldrick
 gshe...@shelx.uni-ac.gwdg.de wrote:

 Dear Ian,

 Of course I could convert the occupancy on reading the atom in and
 convert it back agains on reading it out. This is not quite so
 trivial as it sounds because I need to set a threshold as to how
 close the atom has to be to a special position to be treated as
 special, and take care that rounding errors have the same effect on
 input and output and that the coordinates have not moved in or out
 of the special zone in the meantime.

 As it stands in SHELX, an atom that is near to a twofold will have
 an occupancy of 0.5 whether it is disordered close to a special
 position or whether it is really special, so this is never a problem.

 SHELXL is mainly used for small molecules that frequently have
 atoms on speical positions, and disordered solvent molecules
 approximately on sppecial positions are also very common (for
 example

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-15 Thread Ian Tickle
 decisions 
  on whether a water is special or not.
 
  For me, this is confusing.
 
  Best regards,
  Herman
 
  -Original Message-
  From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian 
  Tickle
  Sent: Wednesday, December 15, 2010 3:47 PM
  To: CCP4BB@JISCMAIL.AC.UK
  Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms
 
  Dear George
 
  I notice that the Oxford CRYSTALS program, which is what I used when I did 
  small-molecule crystallography and which is still quite popular among the 
  small-molecule people (maybe not as much as Shel-X!), uses the CIF 
  convention:
 
  OCC= This parameter defines the site occupancy EXCLUDING special position 
  effects (i.e. is the 'chemical occupancy'). The default is 1.0.  Special 
  position effects are computed by CRYSTALS and multiplied onto this 
  parameter.
 
  (from http://www.xtl.ox.ac.uk/crystalsmanual-atomic.html )
 
  Also the mmCIF specification on this is the same CIF one (hardly 
  surprising I guess since it's derived from it):
 
  _atom_site.occupancy  The fraction of the atom type present at this site.
  The sum of the occupancies of all the atom types at this site may not 
  significantly exceed 1.0 unless it is a dummy site.
 
  (from 
  http://mmcif.pdb.org/dictionaries/mmcif_std.dic/Items/_atom_site.occupancy.html
  )
 
  which doesn't say so specifically, but it's implied since if the 
  multiplicity is included then the maximum value of the sum is the 
  multiplicity, not 1.0.
 
  So there's a real possibility of user - and programmer - confusion here!  
  I must say that until I looked at the 4INS file I had assumed that the PDB 
  occupancy was what it claimed to be, i.e. the real 'chemical' occupancy 
  not the multiplicity-fudged one.
 
  Cheers
 
  -- Ian
 
  On Wed, Dec 15, 2010 at 1:53 PM, George M. Sheldrick 
  gshe...@shelx.uni-ac.gwdg.de wrote:
 
  Dear Ian,
 
  Yes. Once an atom has been identified as on a special position because
  it is within a specied tolerance, SHELXL applies the appropriate
  contraints to both the coordinates and the Uij so there is no danger
  of the atom wandering off the special position. Usually, when an atom
  it very close to a special position but not actually on it, it is part
  of a disordered solvent molecule and will be prevented from
  misbehaving by distance and Uij restraints imposed by the user; in
  such a case the user usually also switches off the special position
  check for that disordered molecule (SPEC -1) to avoid atoms being
  idealized onto the special position by the program. For solvent
  molecules disordered on special positions it is also necessary to
  ignore symmetry equivalent atoms when generating idealized hydrogen
  atoms etc. (PART -N in SHELXL). This is all routine practice in small
  molecule crystallography. I agree that the use of orthogonal rather
  than crystal coordinates can obscur the situation, e.g. for an atom on a 
  threefold axis.
 
  Best wishes, George
 
  Prof. George M. Sheldrick FRS
  Dept. Structural Chemistry,
  University of Goettingen,
  Tammannstr. 4,
  D37077 Goettingen, Germany
  Tel. +49-551-39-3021 or -3068
  Fax. +49-551-39-22582
 
 
  On Wed, 15 Dec 2010, Ian Tickle wrote:
 
  Dear George
 
  I would say that an atom has fractional occupancy (but unit
  multiplicity) unless it's exactly on the special position (though I
  can foresee problems with rounding of decimal places for an atom say
  at x=1/3), so that effectively once the atom is fixed exactly on the
  s.p. the symmetry copies coalesce into a single atom with unit
  occupancy (but fractional multiplicity).  This is at least one
  advantage of having co-ordinates stored as fractional - it would
  probably be more tricky with orthogonalised co-ordinates.  Presumably
  once an input atom has satisfied the condition of being 'sufficiently
  close' to a s.p. to be considered as 'on' the s.p. then the
  constraints fix the co-ordinates exactly on the special position and
  henceforth it's forcibly prevented from moving off it?  In any case
  if an atom is very close to its symmetry copy you are going to have
  matrix conditioning problems for the co-ordinates perpendicular to
  the axis of symmetry (or mirror plane), so then you have no choice
  but to disallow co-ordinate shifts of the atom which would take it
  off the special position?
 
  Cheers
 
  -- Ian
 
  On Wed, Dec 15, 2010 at 11:42 AM, George M. Sheldrick
  gshe...@shelx.uni-ac.gwdg.de wrote:
  
   Dear Ian,
  
   Of course I could convert the occupancy on reading the atom in and
   convert it back agains on reading it out. This is not quite so
   trivial as it sounds because I need to set a threshold as to how
   close the atom has to be to a special position to be treated as
   special, and take care that rounding errors have the same effect on
   input and output and that the coordinates have not moved in or out
   of the special zone in the meantime.
  
   As it stands

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-13 Thread Ian Tickle
   Personally I find it disturbing to have the occupancy of B  31
 set to 0.33 and that of D  31 set to 1.00 simply because of an
 insignificant shift in the position of the atom.

 Dale Tronrud

H2O on or near a 3-fold must be disordered because obviously H2O only
has 2-fold symmetry.  It could of course be H3O+ which does have
3-fold symmetry and which therefore could be ordered.  We can
therefore choose between 2 possibilities: either it's ordered _on_ the
3-fold with site occupancy=1 (site multiplicity=1/3), or it's
disordered _off_ the 3-fold with occupancy=1/3 (site multiplicity=1).

I completely agree with you that the deviation of 'B 31' from the
special position is plainly not significant: certainly there is no way
such a small shift could be detected (or indeed refined!), so we may
as well assume that its symmetry does allow it to sit on the s.p..  So
like 'D 31' it should be set exactly on the s.p. and its occupancy set
to 1.  Disorder should be indicated by fractional occupancy only if
the molecular symmetry clearly demands it (and in this case it's not
clear), or the disorder is observable in the map and it's possible to
sensibly refine it as disordered.

The difficulty comes if a refinement program refines an atom near the
3-fold exactly onto the 3-fold, and this atom either must be
disordered because of molecular symmetry as in the H2O case, or the
disorder is observable in the map.  An atom which is just off a 3-fold
would be likely to generate singularities in the matrix if you tried
to refine its x  y co-ordinates anyway (you could of course fix them
manually and not refine them, but then how do you decide what values
to fix them at?).  In that case it would be better to fix the atom
exactly on the 3-fold and set the occupancy to 1, since then it can
still be considered in reality to be the sum of 3 exactly superposed
disordered atoms each with occupancy 1/3, which must however be
treated in the refinement as a single atom with occupancy=1.

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-11 Thread Eleanor Dodson
Of course metal ions, So4 etc often lie on special positions - the 
insulin hexamer is generated around Zn atoms on the 3-fold axis.

Eleanor

On 12/09/2010 01:29 PM, Ian Tickle wrote:

Of course it's always possible for an asymmetric molecule (or part of
a molecule, such as a side-chain) to lie on or near a symmetry axis,
provided it's rotationally disordered with occupancy 0.5 (assuming a
2-fold).  In other words half the molecules are randomly distributed
over half of the asymmetric units in one orientation and the other
half are in the other a.u.s in the symmetry-related orientation, so
the copies never clash.  The occupancy must be near 0.5 because if it
deviated much from that you would start to see breakdown of the
symmetry of the diffraction pattern (with higher Rmerge etc), and you
would likely conclude that the space group is actually a sub-group of
the original one without the symmetry axis.  Obviously it will depend
on the quality of the data, the resolution and the scattering power of
the disorder part whether you are able in practice to detect such a
breakdown of symmetry.

The question here though is whether an atom (say the CG of the ASP) of
the rotationally disordered molecule/part-molecule in such a situation
necessarily lies _on_ a special position.  It would be pure
coincidence if it did, for the simple reason that there's absolutely
no reason why it should do so.  In other words, because it has
occupancy 0.5 (obviously it must have the same occupancy as the atoms
that it's covalently bonded to, assuming there's no other disorder
present), it must be disordered and so doesn't have to obey the bulk
symmetry.  In fact, it would be equally 'happy' slightly displaced
from the special position.  There will be no significant minimum in
the internal energy of the system for a disordered atom on a special
position, because the reason it's disordered is that there are no
strong interactions with the surrounding atoms, which would favour one
possible orientation over the other.

This is quite different from the original question posed by Gloria
where (I assume) we have a molecule on a special position where there
is no rotational disorder (it may still have occupancy disorder, i.e.
it may only be present in a fraction of the a.u.s).  Here clearly the
occupancy may be  0.5 for a 2-fold and so no clashes with the
symmetry mate(s) are permitted (assuming of course that the space
group is correct!).  In this case the molecule itself must therefore
possess at least the symmetry of the special position, e.g. H2O or SO4
for a 2-fold, as Ralf says.  Being ordered (or at least not as
disordered) such a molecule must have strong interactions with its
neighbours, so any shift off the special position would likely result
in an increase in internal energy.

Cheers

-- Ian

On Thu, Dec 9, 2010 at 11:26 AM,herman.schreu...@sanofi-aventis.com  wrote:

Hi Ralf  Gloria,

It is of course all a matter of definition, but it happens now and again
that an asymmetric ligand is lying on top of a twofold axis. This is
usually modeled by fitting the ligand in two orientations at half
occupancy. In one of the proteins I am working on there is the
carboxylic acid group of an Asp sitting on a 2-fold axis. I have modeled
the Asp with 2 alternative conformations: in conformation A, the Asp
side chain would clash with itself over the 2-fold axis, thus if one
protein molecule has the Asp in conformation A, the twofold related
protein molecule must have the Asp in conformation B (or some other
conformation).

I do not know whether you would call this a Wyckoff position, but side
chains of proteins do sit on top of crystallographic symmetry axes.

Best,
Herman

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of
Ralf W. Grosse-Kunstleve
Sent: Thursday, December 09, 2010 3:47 AM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

Hi Gloria,

My hobby is space group symmetry.
My interest phenix development.


so I can't imagine a protein  crystallographer would ever need to
apply the modulation function to a  protein atom that happened to be
on one.


That's true. Protein residues don't have internal symmetry, therefore
they are not compatible with crystallographic special positions.
(Wyckoff positions are enumerations of classes of special positions.)

In the PDB molecules with internal symmetry are really rare, except for
H2O and SO4. But these contribute so little to the total scattering that
it isn't important to handle them in a special way. So Wyckoff positions
remain foreign in the macromolecular context.

Ralf




- Original Message 

From: Gloria Borgstahlgborgst...@gmail.com
To: CCP4BB@JISCMAIL.AC.UK
Sent: Wed, December 8, 2010 12:16:54 PM
Subject: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

I've gotten some interesting responses, that I will summarize for the
group  later, but I thought I should clarify why

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Herman . Schreuder
Dear Ian,
what I see are two minima, symmetrically aranged around the special position. 
One minimum is closer the molecule A, the other closer to the symmetry-related 
molecule B and waters will randomly be either in minimum A, or in minimum B. 
Crystal packing prevents the water from moving the protein molecules such that 
it can make perfect hydrogen bonds with both protein at the same time.

Cheers,
Herman

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian Tickle
Sent: Thursday, December 09, 2010 6:36 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

The forces acting on an atom at a special position must have the same symmetry 
on average as the special position, so the atom will be in equilibrium, and 
there will at first sight be no net force (i.e. zero energy gradient) to push 
the atom away from the special position.
However there are 2 solutions to this: metastable equilibrium where the energy 
is a maximum, and stable equilibrium where it's a minimum.
In the metastable case it will be like a pencil balanced on its point where a 
small disturbance, such as from random thermal motion, will be sufficient to 
move it off the special position to an asymmetric position of lower energy thus 
breaking the symmetry, and of course the initial displacement will be 
completely random leading to the disorder you observe.  In the case of stable 
equilibrium the atom will simply respond to the disturbance by executing random 
thermal motion centred on the special position.  Which case you see in practice 
will obviously depend on the precise arrangement of forces acting on the atom.

Cheers

-- Ian

On Thu, Dec 9, 2010 at 4:12 PM,  herman.schreu...@sanofi-aventis.com wrote:
 Even with the famous waters on true Wyckoff positions, I usually 
 observe an elongated or even partly split density, suggesting that the 
 water is disordered, being sometimes closer to one monomer, sometimes 
 closer to the symmetry-related monomer. Since the position of proteins 
 in a crystal is in general not determined by a single water-mediated 
 hydrogen bond, the water will in general not be able to make perfect 
 hydrogen bonds to both symmetry-related monomers at the same time. I 
 think therefore that even waters should generally be considered to be 
 disordered and only in exceptional cases will occupy true Wyckoff 
 positions.

 Best,
 Herman

 -Original Message-
 From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of 
 Ian Tickle
 Sent: Thursday, December 09, 2010 3:35 PM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein 
 atoms

 cases out there (and so far I have heard of a disulfide bond on a 
 2-fold connecting two homodimers).

 I'm slightly puzzled by this example.  If the S-S bond is on the 
 special position, then the rest of the molecule can't have 2-fold 
 symmetry, so would have to be rotationally disordered with occupancy =
 0.5 to avoid clashing with its symmetry mate:

               *
     X -- C  *
             \ *
              S
               |
              S
              * \
              *  C -- X
              *

 where the *'s indicate the 2-fold axis (i.e. vertically in the plane 
 of the page).  In this case, for the reasons I gave in my previous 
 post there's no reason for the disordered S atoms to be exactly on the 
 2-fold; it would be pure coincidence if they were.  If you mean 
 instead that the 2-fold is _perpendicular_ to the S-S bond (i.e.
 coming straight out of the page in the diagram), the molecule does 
 indeed have 2-fold symmetry and can be ordered with occupancy = 1, but 
 then the S atoms are not on special positions, so this would not be an 
 example of protein atoms _on_ a special position.

 One could imagine an example, say where the same side-chain on each 
 monomer is cross-linked (e.g. LYS with glutaraldehyde), forming the
 homodimer:

    X -- C -- N = C -- C -- C -- C -- C = N -- C -- X

 Here the central C atom could be on a 2-fold (i.e. axis perpendicular 
 to the page) special position without rotational disorder.  I've no 
 idea whether such a structure actually exists!

 Cheers

 -- Ian



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Colin Nave
Does one regard the metal atom in a metalloprotein as being part of the protein?

If so, a shared metal could occupy a special position in a dimer for example. 

In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation between 
the metal-ion type, coordination number and the amino-acid residues involved in 
the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 25 
cases of metal atoms in special positions. 

Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and heme 
iron in a crystallographic `special position' D. Cobessi, L.-S. Huang, M. Ban, 
N. G. Pon, F. Daldal and E. A. Berry ( 
http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 'special 
position' is justifiably in quotation marks in this example as disorder is 
present.

Colin


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Edward A. Berry

Colin Nave wrote:
..



Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of Rhodobacter 
capsulatus bacterioferritin with metal-free dinuclear site and heme iron in a 
crystallographic `special position' D. Cobessi, L.-S. Huang, M. Ban, N. G. Pon, F. 
Daldal and E. A. Berry ( http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though 
the 'special position' is justifiably in quotation marks in this example as disorder is 
present.

Colin



Yes, this was an example of Ian's rotationally disordered with occupancy 0.5
possibility.

Heme has a pseudo-two-fold axis passing through the plane of the ring,
which is violated only by the position of the methyl and vinyl substituents
on either side: rotating through the pseudo-2-fold superimposes methyl on
vinyl and vice versa on either side. This axis was sitting on a
crystallographic 2-fold.

We did trial refinements in a lower symmetry space group so the
axis became NCS, and we refined with the heme in one orientation or
the other. In either case, difference maps showed positive density
for another atom beyond the methyl, and negative density on CB of
the vinyl. So we concluded the heme was oriented both ways, and
in the average satisfied the symmetry of the higher space group.
Putting it with half occupancy in the asymmetric unit allowed
crystallographic symmetry to generate the rotated half.

Looking at it from the heme's point of view, this makes sense.
Its environment is perfectly symmetrical, so there is no way
for it to choose one orientation over the other and both are
adopted equally.  (The two-fold is intrinsic to the biological unit,
not generated by crystal packing).

I think the heme iron was right on the crystallographic axis.
When refining in the lower space-group with two .5 occupancy
non-interacting hemes, the propionates (which actually satisfy the
covalent symmetry) in the two hemes took on different conformations,
not sure if this was significant or just taking liberties where
freedom is given.

eab


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Ian Tickle
Good point Colin!  2-Zn insulin is of course a classic example of
this, where the two independent Zn2+ ions both sit on the
crystallographic 3-fold in R3.  It doesn't matter whether you count
the metal ion as part of the protein or not: if I understand Gloria's
original question correctly, all that matters is that the atom/ion is
present in the crystal structure.

In fact here are some extracts from the PDB entry (4INS):

REMARK 375 ZNZN B  31  LIES ON A SPECIAL POSITION.
REMARK 375 ZNZN D  31  LIES ON A SPECIAL POSITION.
REMARK 375  HOH B 251  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D  44  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D 134  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D 215  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D 269  LIES ON A SPECIAL POSITION.

HETATM  835 ZNZN B  31  -0.002  -0.004   7.891  0.33 10.40  ZN
HETATM  836 ZNZN D  31   0.000   0.000  -8.039  0.33 11.00  ZN
HETATM  885  O   HOH B 251  -0.023  -0.033  11.206  0.33 21.05   O
etc

Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
special position (assuming it's not disordered), though the first Zn
above and the water do appear to be disordered since they're not
actually on the special position.  Fractional occupancy always implies
some kind of disorder: occupancy = 1/3 of an atom on a special
position would imply occupancy disorder, i.e. it's randomly present in
only 1/3 of the unit cells.

-- Ian


On Fri, Dec 10, 2010 at 1:11 PM, Colin Nave colin.n...@diamond.ac.uk wrote:
 Does one regard the metal atom in a metalloprotein as being part of the 
 protein?

 If so, a shared metal could occupy a special position in a dimer for example.

 In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation between 
 the metal-ion type, coordination number and the amino-acid residues involved 
 in the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
 http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 
 25 cases of metal atoms in special positions.

 Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
 Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and 
 heme iron in a crystallographic `special position' D. Cobessi, L.-S. Huang, 
 M. Ban, N. G. Pon, F. Daldal and E. A. Berry ( 
 http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 'special 
 position' is justifiably in quotation marks in this example as disorder is 
 present.

 Colin



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Dale Tronrud
   The proper occupancy for an atom on a special position depends on
how one defines the meaning of the number in that column.  In the
past, refinement programs, at least I know mine did, simply expanded
all atoms in the coordinate file by the symmetry operators to determine
the contents of the unit cell.  With that operation the occupancy of
the atoms on special positions had to be reduced.  It is certainly
true that there are 1/3 the number of atoms in the unit cell represented
by ZN D  31 than, for example, the CA of residue 50.

   Most modern refinement programs try to handle this automatically,
since users proved unreliable at detecting this condition and modifying
their coordinate files.  They use the interpretation that the site
is fully occupied but there are only 1/3 the number of these sites
than sites at general positions.

   Personally I find it disturbing to have the occupancy of B  31
set to 0.33 and that of D  31 set to 1.00 simply because of an
insignificant shift in the position of the atom.

Dale Tronrud

On 12/10/10 13:53, Ian Tickle wrote:
 Good point Colin!  2-Zn insulin is of course a classic example of
 this, where the two independent Zn2+ ions both sit on the
 crystallographic 3-fold in R3.  It doesn't matter whether you count
 the metal ion as part of the protein or not: if I understand Gloria's
 original question correctly, all that matters is that the atom/ion is
 present in the crystal structure.
 
 In fact here are some extracts from the PDB entry (4INS):
 
 REMARK 375 ZNZN B  31  LIES ON A SPECIAL POSITION.
 REMARK 375 ZNZN D  31  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH B 251  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D  44  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D 134  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D 215  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D 269  LIES ON A SPECIAL POSITION.
 
 HETATM  835 ZNZN B  31  -0.002  -0.004   7.891  0.33 10.40  ZN
 HETATM  836 ZNZN D  31   0.000   0.000  -8.039  0.33 11.00  ZN
 HETATM  885  O   HOH B 251  -0.023  -0.033  11.206  0.33 21.05   O
 etc
 
 Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
 special position (assuming it's not disordered), though the first Zn
 above and the water do appear to be disordered since they're not
 actually on the special position.  Fractional occupancy always implies
 some kind of disorder: occupancy = 1/3 of an atom on a special
 position would imply occupancy disorder, i.e. it's randomly present in
 only 1/3 of the unit cells.
 
 -- Ian
 
 
 On Fri, Dec 10, 2010 at 1:11 PM, Colin Nave colin.n...@diamond.ac.uk wrote:
 Does one regard the metal atom in a metalloprotein as being part of the 
 protein?

 If so, a shared metal could occupy a special position in a dimer for example.

 In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation between 
 the metal-ion type, coordination number and the amino-acid residues involved 
 in the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
 http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 
 25 cases of metal atoms in special positions.

 Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
 Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and 
 heme iron in a crystallographic `special position' D. Cobessi, L.-S. Huang, 
 M. Ban, N. G. Pon, F. Daldal and E. A. Berry ( 
 http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 
 'special position' is justifiably in quotation marks in this example as 
 disorder is present.

 Colin



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Ian Tickle
No that surely can't be right.  Application of a symmetry operator to
a point on a special position which is unchanged by the operator
doesn't generate a symmetry copy of the point, because there is no
symmetry copy of such a point!  For general Wyckoff positions it does,
for sure.  If you look at the entry for R3 (hexagonal or rhombohedral
setting, it doesn't matter which) on the Wyckoff position website I
indicated earlier, you'll see a column labelled 'multiplicity' which
is 3 for the general position and 1 for the special position.  This
means that the symmetry operator generates 3 symmetry copies of a
general position (including the original), but only 1 copy (i.e. only
the original) of the special positions on the 3-fold.

That's precisely why such points are called special positions - i.e.
you have to treat them specially!  If what you describe is what the
refinement program or whatever is doing then it would imply there's a
programming error, i.e. the program is not treating special positions
as special, it's treating them instead as general positions.  As I
said whenever you see fractional occupancy reported it should imply
some kind of disorder, i.e. the atom exists on that site in only the
indicated fraction of the unit cells in the lattice.

-- Ian

On Fri, Dec 10, 2010 at 10:02 PM, Sue Roberts s...@email.arizona.edu wrote:
 Hi Ian

 No disorder is involved.

 The occupancy of an (fully occupied) atom on an n-fold rotation axis is 1/n
 If a two-fold, 1/2
 If a three-fold, 1/3

 When you sum over all the atoms in the unit cell, application of the symmetry 
 operations to atoms lying on the rotation axis generates atoms with unchanged 
 coordinates. Hence to generate a fully occupied atom on a n-fold symmetry 
 axis, the original occupancy has to be 1/n.

 Sue

 On Dec 10, 2010, at 2:53 PM, Ian Tickle wrote:

 Good point Colin!  2-Zn insulin is of course a classic example of
 this, where the two independent Zn2+ ions both sit on the
 crystallographic 3-fold in R3.  It doesn't matter whether you count
 the metal ion as part of the protein or not: if I understand Gloria's
 original question correctly, all that matters is that the atom/ion is
 present in the crystal structure.

 In fact here are some extracts from the PDB entry (4INS):

 REMARK 375 ZN    ZN B  31  LIES ON A SPECIAL POSITION.
 REMARK 375 ZN    ZN D  31  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH B 251  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D  44  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D 134  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D 215  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D 269  LIES ON A SPECIAL POSITION.

 HETATM  835 ZN    ZN B  31      -0.002  -0.004   7.891  0.33 10.40          
 ZN
 HETATM  836 ZN    ZN D  31       0.000   0.000  -8.039  0.33 11.00          
 ZN
 HETATM  885  O   HOH B 251      -0.023  -0.033  11.206  0.33 21.05           
 O
 etc

 Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
 special position (assuming it's not disordered), though the first Zn
 above and the water do appear to be disordered since they're not
 actually on the special position.  Fractional occupancy always implies
 some kind of disorder: occupancy = 1/3 of an atom on a special
 position would imply occupancy disorder, i.e. it's randomly present in
 only 1/3 of the unit cells.

 -- Ian


 On Fri, Dec 10, 2010 at 1:11 PM, Colin Nave colin.n...@diamond.ac.uk wrote:
 Does one regard the metal atom in a metalloprotein as being part of the 
 protein?

 If so, a shared metal could occupy a special position in a dimer for 
 example.

 In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation 
 between the metal-ion type, coordination number and the amino-acid residues 
 involved in the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
 http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 
 25 cases of metal atoms in special positions.

 Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
 Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and 
 heme iron in a crystallographic `special position' D. Cobessi, L.-S. 
 Huang, M. Ban, N. G. Pon, F. Daldal and E. A. Berry ( 
 http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 
 'special position' is justifiably in quotation marks in this example as 
 disorder is present.

 Colin



 Dr. Sue A. Roberts
 Dept. of Chemistry and Biochemistry
 University of Arizona
 1041 E. Lowell St.,  Tucson, AZ 85721
 Phone: 520 621 8171
 s...@email.arizona.edu
 http://www.biochem.arizona.edu/xray












Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Ed Pozharski
On Fri, 2010-12-10 at 22:40 +, Ian Tickle wrote:
 Application of a symmetry operator to
 a point on a special position which is unchanged by the operator
 doesn't generate a symmetry copy of the point, because there is no
 symmetry copy of such a point! 

Why not?  Symmetry-related copy may be considered just that - copy
produced by a symmetry operator.  What advantage do you gain by
restricting it to only those that result in physically different point?

There is one excellent point that you made in some past exchanges - that
deposited structures are mathematical models and they do not always have
to make strict physical sense.  This seems to be just such case - one
does not have three zincs in that spot with each being gone two thirds
of the time, but making the model physically meaningful would require
perhaps dropping to P1 and implementing NCS to enforce crystal symmetry.
So compromising on the physical meaning of the atom record seems like a
small price to pay.  This feels odd - I seem to be arguing your
side :)

Cheers,

Ed.

 

-- 
I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Boaz Shaanan
An excerpt from the Shelx manual (which I think is a good reference for proper 
handling of atoms on special positions in refinement of small or large 
molecules): 

4.3 Special position constraints
Constraints for the coordinates and 
anisotropic displacement
parameters for atoms on special positions are generated automatically by
the program for ALL special positions in ALL space groups, in 
conventional
settings or otherwise. For upwards compatibility with SHELX-76, free 
variables
may still be used for this, but it is better to leave it to the program.
If the occupancy is not input, the program will fix it at the 
appropriate
value for a special position. If the user applies (correct or incorrect)
special position constraints using free variables etc., the program 
assumes
this has been done with intent and reports but does not apply the 
correct
constraints; accidental application of wrong special position 
constraints
is one of the easiest ways to cause a refinement to 'blow up' !


Also, please check the book of Shmueli and Weiss, Introduction to 
crystallographic statistics p. 5, last paragraph  and the next page where a 
rigorous treatment of of contribution to the structure factor of atoms in 
general and special positions in the unit cell is given (eq. 1.2.10).

Thus, occupancy of an atom on a 3-fold would be 1/3, 1/2 on a 2-fold, 
1/4 on a 4-fold and so forth.  

 Cheers,

   Boaz


- Original Message -
From: Ian Tickle ianj...@gmail.com
Date: Saturday, December 11, 2010 0:41
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms
To: CCP4BB@JISCMAIL.AC.UK

 No that surely can't be right.  Application of a symmetry 
 operator to
 a point on a special position which is unchanged by the operator
 doesn't generate a symmetry copy of the point, because there is no
 symmetry copy of such a point!  For general Wyckoff 
 positions it does,
 for sure.  If you look at the entry for R3 (hexagonal or 
 rhombohedralsetting, it doesn't matter which) on the Wyckoff 
 position website I
 indicated earlier, you'll see a column labelled 'multiplicity' which
 is 3 for the general position and 1 for the special 
 position.  This
 means that the symmetry operator generates 3 symmetry copies of a
 general position (including the original), but only 1 copy (i.e. only
 the original) of the special positions on the 3-fold.
 
 That's precisely why such points are called special positions -
 i.e.
 you have to treat them specially!  If what you describe is 
 what the
 refinement program or whatever is doing then it would imply 
 there's a
 programming error, i.e. the program is not treating special positions
 as special, it's treating them instead as general 
 positions.  As I
 said whenever you see fractional occupancy reported it should imply
 some kind of disorder, i.e. the atom exists on that site in only the
 indicated fraction of the unit cells in the lattice.
 
 -- Ian
 
 On Fri, Dec 10, 2010 at 10:02 PM, Sue Roberts 
 s...@email.arizona.edu wrote:
  Hi Ian
 
  No disorder is involved.
 
  The occupancy of an (fully occupied) atom on an n-fold 
 rotation axis is 1/n
  If a two-fold, 1/2
  If a three-fold, 1/3
 
  When you sum over all the atoms in the unit cell, application 
 of the symmetry operations to atoms lying on the rotation axis 
 generates atoms with unchanged coordinates. Hence to generate a 
 fully occupied atom on a n-fold symmetry axis, the original 
 occupancy has to be 1/n.
 
  Sue
 
  On Dec 10, 2010, at 2:53 PM, Ian Tickle wrote:
 
  Good point Colin!  2-Zn insulin is of course a classic 
 example of
  this, where the two independent Zn2+ ions both sit on the
  crystallographic 3-fold in R3.  It doesn't matter whether you count
  the metal ion as part of the protein or not: if I understand 
 Gloria's original question correctly, all that matters is that 
 the atom/ion is
  present in the crystal structure.
 
  In fact here are some extracts from the PDB entry (4INS):
 
  REMARK 375 ZN    ZN B  31  LIES ON A SPECIAL POSITION.
  REMARK 375 ZN    ZN D  31  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH B 251  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D  44  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D 134  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D 215  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D 269  LIES ON A SPECIAL POSITION.
 
  HETATM  835 ZN    ZN B  31      -0.002  -0.004   7.891  0.33 
 10.40          ZN
  HETATM  836 ZN    ZN D  31       0.000   0.000  -8.039  0.33 
 11.00          ZN
  HETATM  885  O   HOH B 251      -0.023  -0.033  11.206  0.33 
 21.05           O
  etc
 
  Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's 
 on the
  special position (assuming it's not disordered), though the 
 first Zn
  above and the water do appear to be disordered since they're not
  actually on the special position.  Fractional occupancy 
 always implies
  some kind of disorder: occupancy = 1/3 of an atom on a special

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread George M. Sheldrick
SHELXL also expects that the occupancy of a fully occupied atom on a 
threefold axis should be set at 1/3, and will generate this automatically 
if necessary. It will also generate automatically the necessary 
constraints for the x, y and z parameters (and for the Uij if the atom is
anisotropic). It is essential that this is done correctly if a full-matrix
refinement is being performed (e.g. to get esd estimates), otherwise
the refinement can explode. The user may change or switch off the 
tolerance for detecting whether an atom is on a special position 
(with the SPEC instruction). Setting the occupancy to a fraction avoided
a complicated IF construction inside a loop and 35 years ago computers 
were so slow! I can't change it now because I have to preserve upwards
compatibility. Unfortunately the CIF committee decided to use the other
definition (i.e. the Zn on the threefold axis has an occupancy of 1.0)
and this has caused considerable confusion in the small molecule world 
ever since; atoms are frequently encountered on special positions in
inorganic and mineral structures.

George

Prof. George M. Sheldrick FRS
Dept. Structural Chemistry,
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-22582


On Fri, 10 Dec 2010, Ed Pozharski wrote:

 On Fri, 2010-12-10 at 21:53 +, Ian Tickle wrote:
  Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
  special position 
 
 Shouldn't 1/3 be better for programming purposes?  If you set occupancy
 to 1.0, then you should specify that symmetry operators do not apply for
 these atoms, making Fc calculation a bit more cumbersome.
 
 If definition of the asu content is you get full content of the unit
 cell after applying symmetry operators, then occupancy *must* be 1/3,
 right?
 
 The first zinc and the water are on special position, but because they
 are not excluded from positional refinement (perhaps they should be),
 they will drift a bit.  CNS has distance cutoff for treating atoms as
 special positions, if it jumps over the limit during, say, simulated
 annealing, it  will cause problems.  Perhaps PROLSQ did something
 similar.  It is a good question if it's better to fix these in place or
 let them wobble a bit to account for some potential disorder.  While I
 see the formal argument that it should be nailed to three-fold axes, it
 is also true that this is a mathematical compromise to simplify modeling
 that does not reflect physical reality (i.e. you don't have three
 partially occupied zinc ions, it's just one).  In any event, given that
 this is a 1.5A structure, (-0.002 0.004) is statistically speaking the
 same as (0 0).
 
 Cheers,
 
 Ed. 
 
 -- 
 I'd jump in myself, if I weren't so good at whistling.
Julian, King of Lemurs
 
 


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Pavel Afonine
  No disorder is involved.
 
  The occupancy of an (fully occupied) atom on an n-fold rotation axis is
 1/n
  If a two-fold, 1/2
  If a three-fold, 1/3
 
  When you sum over all the atoms in the unit cell, application of the
 symmetry operations to atoms lying on the rotation axis generates atoms with
 unchanged coordinates. Hence to generate a fully occupied atom on a n-fold
 symmetry axis, the original occupancy has to be 1/n.
 



I guess I have a few small points to add:

1) If a site on special position is not fully occupied, then the occupancy
is q/n, where q is actual occupancy (for fully occupied site q=1). Since
ATOM record in PDB file does not (directly) tell you whether the atom is on
special position or not then it is not straightforward to know whether the
occupancy of atom in question is not 1 due to disorder or due to symmetry.

2) Let's consider a hypothetical case... The maximal multiplicity of a
spacial position in proteins is 24 (is that right? please correct if not),
and the precision of the occupancy field in PDB file is only two digits,
like: 1.00. Imagine you have a fair amount of very heavy atoms all seating
at special positions with the multiplicity 24, and your structure is
relatively small. Then what you will be forced to put in the PDB file is:
1/24 ~ 0.04 and not 0.042. I'm not sure if this rounding error is
significant or not (and what is significant), but just to keep in mind
another source of mismatch between reported and recomputed R-factors.

3) If I add an atom onto special position using Coot, will it set the
occupancy as 1/n automatically or do I have to to edit the PDB file myself
(and remember to do so)? (another room for an error)

So, having said this, I tend to think that using occupancy 1 (and not 1/n)
is a good thing.

Pavel.


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-09 Thread Herman . Schreuder
Hi Ralf  Gloria,

It is of course all a matter of definition, but it happens now and again
that an asymmetric ligand is lying on top of a twofold axis. This is
usually modeled by fitting the ligand in two orientations at half
occupancy. In one of the proteins I am working on there is the
carboxylic acid group of an Asp sitting on a 2-fold axis. I have modeled
the Asp with 2 alternative conformations: in conformation A, the Asp
side chain would clash with itself over the 2-fold axis, thus if one
protein molecule has the Asp in conformation A, the twofold related
protein molecule must have the Asp in conformation B (or some other
conformation). 

I do not know whether you would call this a Wyckoff position, but side
chains of proteins do sit on top of crystallographic symmetry axes.

Best,
Herman

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of
Ralf W. Grosse-Kunstleve
Sent: Thursday, December 09, 2010 3:47 AM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

Hi Gloria,

My hobby is space group symmetry.
My interest phenix development.

 so I can't imagine a protein  crystallographer would ever need to 
 apply the modulation function to a  protein atom that happened to be 
 on one.

That's true. Protein residues don't have internal symmetry, therefore
they are not compatible with crystallographic special positions.
(Wyckoff positions are enumerations of classes of special positions.)

In the PDB molecules with internal symmetry are really rare, except for
H2O and SO4. But these contribute so little to the total scattering that
it isn't important to handle them in a special way. So Wyckoff positions
remain foreign in the macromolecular context.

Ralf
 



- Original Message 
 From: Gloria Borgstahl gborgst...@gmail.com
 To: CCP4BB@JISCMAIL.AC.UK
 Sent: Wed, December 8, 2010 12:16:54 PM
 Subject: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms
 
 I've gotten some interesting responses, that I will summarize for the 
 group  later, but I thought I should clarify why I asked.
 
 I was worrying about  this because I have been working out the steps 
 in how to determine the (3+1)D  superspace group for a protein
crystal.
 The last step listed in IT vol C  chapter 9.8, is to consider any 
 atoms that lie ON a Wyckoff position, and  what restrictions this 
 would apply to the modulation function that is refined  for each atom.
 
 My first reaction, was Wyckoff positions?  I  vaguely remember 
 those, my recollection from my experience was they were  really cool, 
 but were usually in the solvent, so I can't imagine a protein  
 crystallographer would ever need to apply the modulation function to a

 protein atom that happened to be on one.  But to a crystallographer  
 working on a modulated mineral, it would happen all the time, I'll
bet.
 So  maybe this was one more thing that just didn't really apply to 
 protein  structures and lucky us we don't worry about this last step 
 (just as I never  did model that solvent water that was on one, back 
 in the 90s).
 
 Then I  thought, maybe I'm missing something, or there are special 
 cases out there  (and so far I have heard of a disulfide bond on a 
 2-fold connecting two  homodimers).
 
 So I polled the collective knowledge of the great ccp4bb  group.
 
 On Wed, Dec 8, 2010 at 10:57 AM, Gloria Borgstahl 
 gborgst...@gmail.com
wrote:
   My fellow crystallographers,
  I wanted to take a poll.
 
   How many of you have ever had a protein atom on a Wyckoff position 
  (AKA  a special position).
  What kind of molecules have you found at special  positions (it 
  would have to contain the symmetry of the special  position, right?)

  I'm thinking it is impossible to have a protein atom  at a special 
  position or am I exposing my ignorance yet  again...
 
  my experience is that only once I found an atom in a  special 
  position, it was a strange solvent molecule, that blew  my mind for 
  a while until I learned about special positions in  crystallography.
 
  Looking forward to your responses,  Gloria
 
   
  
  
   Gloria Borgstahl
  Eppley Institute for Cancer Research and Allied  Diseases
  987696 Nebraska Medical Center
  10732A Lied Transplant  Center
  Omaha, NE 68198-7696
 
  http://sbl.unmc.edu
   Office (402) 559-8578
  FAX (402) 559-3739
 
   Professor
  Hobbies:  Protein Crystallography, Cancer,  Biochemistry, DNA 
  Metabolism, Modulated Crystals,  Crystal  Perfection
  Interests:  Manga, Led Zepplin, Cold Play, piano,  BRAN, RAGBRAI, 
  golf and lately superspace groups
   
  
  
 
 


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-09 Thread Ian Tickle
Of course it's always possible for an asymmetric molecule (or part of
a molecule, such as a side-chain) to lie on or near a symmetry axis,
provided it's rotationally disordered with occupancy 0.5 (assuming a
2-fold).  In other words half the molecules are randomly distributed
over half of the asymmetric units in one orientation and the other
half are in the other a.u.s in the symmetry-related orientation, so
the copies never clash.  The occupancy must be near 0.5 because if it
deviated much from that you would start to see breakdown of the
symmetry of the diffraction pattern (with higher Rmerge etc), and you
would likely conclude that the space group is actually a sub-group of
the original one without the symmetry axis.  Obviously it will depend
on the quality of the data, the resolution and the scattering power of
the disorder part whether you are able in practice to detect such a
breakdown of symmetry.

The question here though is whether an atom (say the CG of the ASP) of
the rotationally disordered molecule/part-molecule in such a situation
necessarily lies _on_ a special position.  It would be pure
coincidence if it did, for the simple reason that there's absolutely
no reason why it should do so.  In other words, because it has
occupancy 0.5 (obviously it must have the same occupancy as the atoms
that it's covalently bonded to, assuming there's no other disorder
present), it must be disordered and so doesn't have to obey the bulk
symmetry.  In fact, it would be equally 'happy' slightly displaced
from the special position.  There will be no significant minimum in
the internal energy of the system for a disordered atom on a special
position, because the reason it's disordered is that there are no
strong interactions with the surrounding atoms, which would favour one
possible orientation over the other.

This is quite different from the original question posed by Gloria
where (I assume) we have a molecule on a special position where there
is no rotational disorder (it may still have occupancy disorder, i.e.
it may only be present in a fraction of the a.u.s).  Here clearly the
occupancy may be  0.5 for a 2-fold and so no clashes with the
symmetry mate(s) are permitted (assuming of course that the space
group is correct!).  In this case the molecule itself must therefore
possess at least the symmetry of the special position, e.g. H2O or SO4
for a 2-fold, as Ralf says.  Being ordered (or at least not as
disordered) such a molecule must have strong interactions with its
neighbours, so any shift off the special position would likely result
in an increase in internal energy.

Cheers

-- Ian

On Thu, Dec 9, 2010 at 11:26 AM,  herman.schreu...@sanofi-aventis.com wrote:
 Hi Ralf  Gloria,

 It is of course all a matter of definition, but it happens now and again
 that an asymmetric ligand is lying on top of a twofold axis. This is
 usually modeled by fitting the ligand in two orientations at half
 occupancy. In one of the proteins I am working on there is the
 carboxylic acid group of an Asp sitting on a 2-fold axis. I have modeled
 the Asp with 2 alternative conformations: in conformation A, the Asp
 side chain would clash with itself over the 2-fold axis, thus if one
 protein molecule has the Asp in conformation A, the twofold related
 protein molecule must have the Asp in conformation B (or some other
 conformation).

 I do not know whether you would call this a Wyckoff position, but side
 chains of proteins do sit on top of crystallographic symmetry axes.

 Best,
 Herman

 -Original Message-
 From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of
 Ralf W. Grosse-Kunstleve
 Sent: Thursday, December 09, 2010 3:47 AM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 Hi Gloria,

 My hobby is space group symmetry.
 My interest phenix development.

 so I can't imagine a protein  crystallographer would ever need to
 apply the modulation function to a  protein atom that happened to be
 on one.

 That's true. Protein residues don't have internal symmetry, therefore
 they are not compatible with crystallographic special positions.
 (Wyckoff positions are enumerations of classes of special positions.)

 In the PDB molecules with internal symmetry are really rare, except for
 H2O and SO4. But these contribute so little to the total scattering that
 it isn't important to handle them in a special way. So Wyckoff positions
 remain foreign in the macromolecular context.

 Ralf




 - Original Message 
 From: Gloria Borgstahl gborgst...@gmail.com
 To: CCP4BB@JISCMAIL.AC.UK
 Sent: Wed, December 8, 2010 12:16:54 PM
 Subject: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 I've gotten some interesting responses, that I will summarize for the
 group  later, but I thought I should clarify why I asked.

 I was worrying about  this because I have been working out the steps
 in how to determine the (3+1)D  superspace group for a protein

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-09 Thread Ian Tickle
 cases out there (and so far I have heard of a disulfide bond on a
 2-fold connecting two homodimers).

I'm slightly puzzled by this example.  If the S-S bond is on the
special position, then the rest of the molecule can't have 2-fold
symmetry, so would have to be rotationally disordered with occupancy =
0.5 to avoid clashing with its symmetry mate:

   *
 X -- C  *
 \ *
  S
   |
  S
  * \
  *  C -- X
  *

where the *'s indicate the 2-fold axis (i.e. vertically in the plane
of the page).  In this case, for the reasons I gave in my previous
post there's no reason for the disordered S atoms to be exactly on the
2-fold; it would be pure coincidence if they were.  If you mean
instead that the 2-fold is _perpendicular_ to the S-S bond (i.e.
coming straight out of the page in the diagram), the molecule does
indeed have 2-fold symmetry and can be ordered with occupancy = 1, but
then the S atoms are not on special positions, so this would not be an
example of protein atoms _on_ a special position.

One could imagine an example, say where the same side-chain on each
monomer is cross-linked (e.g. LYS with glutaraldehyde), forming the
homodimer:

X -- C -- N = C -- C -- C -- C -- C = N -- C -- X

Here the central C atom could be on a 2-fold (i.e. axis perpendicular
to the page) special position without rotational disorder.  I've no
idea whether such a structure actually exists!

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-09 Thread Herman . Schreuder
Even with the famous waters on true Wyckoff positions, I usually
observe an elongated or even partly split density, suggesting that the
water is disordered, being sometimes closer to one monomer, sometimes
closer to the symmetry-related monomer. Since the position of proteins
in a crystal is in general not determined by a single water-mediated
hydrogen bond, the water will in general not be able to make perfect
hydrogen bonds to both symmetry-related monomers at the same time. I
think therefore that even waters should generally be considered to be
disordered and only in exceptional cases will occupy true Wyckoff
positions.

Best,
Herman 

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of
Ian Tickle
Sent: Thursday, December 09, 2010 3:35 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 cases out there (and so far I have heard of a disulfide bond on a 
 2-fold connecting two homodimers).

I'm slightly puzzled by this example.  If the S-S bond is on the special
position, then the rest of the molecule can't have 2-fold symmetry, so
would have to be rotationally disordered with occupancy =
0.5 to avoid clashing with its symmetry mate:

   *
 X -- C  *
 \ *
  S
   |
  S
  * \
  *  C -- X
  *

where the *'s indicate the 2-fold axis (i.e. vertically in the plane of
the page).  In this case, for the reasons I gave in my previous post
there's no reason for the disordered S atoms to be exactly on the
2-fold; it would be pure coincidence if they were.  If you mean instead
that the 2-fold is _perpendicular_ to the S-S bond (i.e.
coming straight out of the page in the diagram), the molecule does
indeed have 2-fold symmetry and can be ordered with occupancy = 1, but
then the S atoms are not on special positions, so this would not be an
example of protein atoms _on_ a special position.

One could imagine an example, say where the same side-chain on each
monomer is cross-linked (e.g. LYS with glutaraldehyde), forming the
homodimer:

X -- C -- N = C -- C -- C -- C -- C = N -- C -- X

Here the central C atom could be on a 2-fold (i.e. axis perpendicular to
the page) special position without rotational disorder.  I've no idea
whether such a structure actually exists!

Cheers

-- Ian


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-09 Thread Ian Tickle
The forces acting on an atom at a special position must have the same
symmetry on average as the special position, so the atom will be in
equilibrium, and there will at first sight be no net force (i.e. zero
energy gradient) to push the atom away from the special position.
However there are 2 solutions to this: metastable equilibrium where
the energy is a maximum, and stable equilibrium where it's a minimum.
In the metastable case it will be like a pencil balanced on its point
where a small disturbance, such as from random thermal motion, will be
sufficient to move it off the special position to an asymmetric
position of lower energy thus breaking the symmetry, and of course the
initial displacement will be completely random leading to the disorder
you observe.  In the case of stable equilibrium the atom will simply
respond to the disturbance by executing random thermal motion centred
on the special position.  Which case you see in practice will
obviously depend on the precise arrangement of forces acting on the
atom.

Cheers

-- Ian

On Thu, Dec 9, 2010 at 4:12 PM,  herman.schreu...@sanofi-aventis.com wrote:
 Even with the famous waters on true Wyckoff positions, I usually
 observe an elongated or even partly split density, suggesting that the
 water is disordered, being sometimes closer to one monomer, sometimes
 closer to the symmetry-related monomer. Since the position of proteins
 in a crystal is in general not determined by a single water-mediated
 hydrogen bond, the water will in general not be able to make perfect
 hydrogen bonds to both symmetry-related monomers at the same time. I
 think therefore that even waters should generally be considered to be
 disordered and only in exceptional cases will occupy true Wyckoff
 positions.

 Best,
 Herman

 -Original Message-
 From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of
 Ian Tickle
 Sent: Thursday, December 09, 2010 3:35 PM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

 cases out there (and so far I have heard of a disulfide bond on a
 2-fold connecting two homodimers).

 I'm slightly puzzled by this example.  If the S-S bond is on the special
 position, then the rest of the molecule can't have 2-fold symmetry, so
 would have to be rotationally disordered with occupancy =
 0.5 to avoid clashing with its symmetry mate:

               *
     X -- C  *
             \ *
              S
               |
              S
              * \
              *  C -- X
              *

 where the *'s indicate the 2-fold axis (i.e. vertically in the plane of
 the page).  In this case, for the reasons I gave in my previous post
 there's no reason for the disordered S atoms to be exactly on the
 2-fold; it would be pure coincidence if they were.  If you mean instead
 that the 2-fold is _perpendicular_ to the S-S bond (i.e.
 coming straight out of the page in the diagram), the molecule does
 indeed have 2-fold symmetry and can be ordered with occupancy = 1, but
 then the S atoms are not on special positions, so this would not be an
 example of protein atoms _on_ a special position.

 One could imagine an example, say where the same side-chain on each
 monomer is cross-linked (e.g. LYS with glutaraldehyde), forming the
 homodimer:

    X -- C -- N = C -- C -- C -- C -- C = N -- C -- X

 Here the central C atom could be on a 2-fold (i.e. axis perpendicular to
 the page) special position without rotational disorder.  I've no idea
 whether such a structure actually exists!

 Cheers

 -- Ian



[ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-08 Thread Gloria Borgstahl
I've gotten some interesting responses, that I will summarize for the
group later, but I thought I should clarify why I asked.

I was worrying about this because I have been working out the steps in
how to determine the (3+1)D superspace group for a protein crystal.
The last step listed in IT vol C chapter 9.8, is to consider any atoms
that lie ON a Wyckoff position, and what restrictions this would apply
to the modulation function that is refined for each atom.

My first reaction, was Wyckoff positions?  I vaguely remember those,
my recollection from my experience was they were really cool, but were
usually in the solvent, so I can't imagine a protein crystallographer
would ever need to apply the modulation function to a protein atom
that happened to be on one.  But to a crystallographer working on a
modulated mineral, it would happen all the time, I'll bet.
So maybe this was one more thing that just didn't really apply to
protein structures and lucky us
we don't worry about this last step (just as I never did model that
solvent water that was on one, back in the 90s).

Then I thought, maybe I'm missing something, or there are special
cases out there (and so far I have heard of a disulfide bond on a
2-fold connecting two homodimers).

So I polled the collective knowledge of the great ccp4bb group.

On Wed, Dec 8, 2010 at 10:57 AM, Gloria Borgstahl gborgst...@gmail.com wrote:
 My fellow crystallographers,
 I wanted to take a poll.

 How many of you have ever had a protein atom on a Wyckoff position
 (AKA a special position).
 What kind of molecules have you found at special positions (it would
 have to contain the symmetry of the special position, right?)
 I'm thinking it is impossible to have a protein atom at a special position
 or am I exposing my ignorance yet again...

 my experience is that only once I found an atom in a special position,
 it was a strange solvent molecule,
 that blew my mind for a while until I learned about special positions
 in crystallography.

 Looking forward to your responses, Gloria

 
 Gloria Borgstahl
 Eppley Institute for Cancer Research and Allied Diseases
 987696 Nebraska Medical Center
 10732A Lied Transplant Center
 Omaha, NE 68198-7696

 http://sbl.unmc.edu
 Office (402) 559-8578
 FAX (402) 559-3739

 Professor
 Hobbies:  Protein Crystallography, Cancer, Biochemistry,
 DNA Metabolism, Modulated Crystals,  Crystal Perfection
 Interests:  Manga, Led Zepplin, Cold Play, piano, BRAN,
 RAGBRAI, golf and lately superspace groups
 



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-08 Thread Ralf W. Grosse-Kunstleve
Hi Gloria,

My hobby is space group symmetry.
My interest phenix development.

 so I can't imagine a protein  crystallographer
 would ever need to apply the modulation function to a  protein atom
 that happened to be on one.

That's true. Protein residues don't have internal symmetry, therefore
they are not compatible with crystallographic special positions. (Wyckoff
positions are enumerations of classes of special positions.)

In the PDB molecules with internal symmetry are really rare, except
for H2O and SO4. But these contribute so little to the total scattering
that it isn't important to handle them in a special way. So Wyckoff
positions remain foreign in the macromolecular context.

Ralf
 



- Original Message 
 From: Gloria Borgstahl gborgst...@gmail.com
 To: CCP4BB@JISCMAIL.AC.UK
 Sent: Wed, December 8, 2010 12:16:54 PM
 Subject: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms
 
 I've gotten some interesting responses, that I will summarize for the
 group  later, but I thought I should clarify why I asked.
 
 I was worrying about  this because I have been working out the steps in
 how to determine the (3+1)D  superspace group for a protein crystal.
 The last step listed in IT vol C  chapter 9.8, is to consider any atoms
 that lie ON a Wyckoff position, and  what restrictions this would apply
 to the modulation function that is refined  for each atom.
 
 My first reaction, was Wyckoff positions?  I  vaguely remember those,
 my recollection from my experience was they were  really cool, but were
 usually in the solvent, so I can't imagine a protein  crystallographer
 would ever need to apply the modulation function to a  protein atom
 that happened to be on one.  But to a crystallographer  working on a
 modulated mineral, it would happen all the time, I'll bet.
 So  maybe this was one more thing that just didn't really apply to
 protein  structures and lucky us
 we don't worry about this last step (just as I never  did model that
 solvent water that was on one, back in the 90s).
 
 Then I  thought, maybe I'm missing something, or there are special
 cases out there  (and so far I have heard of a disulfide bond on a
 2-fold connecting two  homodimers).
 
 So I polled the collective knowledge of the great ccp4bb  group.
 
 On Wed, Dec 8, 2010 at 10:57 AM, Gloria Borgstahl gborgst...@gmail.com 
wrote:
   My fellow crystallographers,
  I wanted to take a poll.
 
   How many of you have ever had a protein atom on a Wyckoff position
  (AKA  a special position).
  What kind of molecules have you found at special  positions (it would
  have to contain the symmetry of the special  position, right?)
  I'm thinking it is impossible to have a protein atom  at a special position
  or am I exposing my ignorance yet  again...
 
  my experience is that only once I found an atom in a  special position,
  it was a strange solvent molecule,
  that blew  my mind for a while until I learned about special positions
  in  crystallography.
 
  Looking forward to your responses,  Gloria
 
   
   Gloria Borgstahl
  Eppley Institute for Cancer Research and Allied  Diseases
  987696 Nebraska Medical Center
  10732A Lied Transplant  Center
  Omaha, NE 68198-7696
 
  http://sbl.unmc.edu
   Office (402) 559-8578
  FAX (402) 559-3739
 
   Professor
  Hobbies:  Protein Crystallography, Cancer,  Biochemistry,
  DNA Metabolism, Modulated Crystals,  Crystal  Perfection
  Interests:  Manga, Led Zepplin, Cold Play, piano,  BRAN,
  RAGBRAI, golf and lately superspace groups