Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Herman . Schreuder
Dear Ian,
what I see are two minima, symmetrically aranged around the special position. 
One minimum is closer the molecule A, the other closer to the symmetry-related 
molecule B and waters will randomly be either in minimum A, or in minimum B. 
Crystal packing prevents the water from moving the protein molecules such that 
it can make perfect hydrogen bonds with both protein at the same time.

Cheers,
Herman

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Ian Tickle
Sent: Thursday, December 09, 2010 6:36 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

The forces acting on an atom at a special position must have the same symmetry 
on average as the special position, so the atom will be in equilibrium, and 
there will at first sight be no net force (i.e. zero energy gradient) to push 
the atom away from the special position.
However there are 2 solutions to this: metastable equilibrium where the energy 
is a maximum, and stable equilibrium where it's a minimum.
In the metastable case it will be like a pencil balanced on its point where a 
small disturbance, such as from random thermal motion, will be sufficient to 
move it off the special position to an asymmetric position of lower energy thus 
breaking the symmetry, and of course the initial displacement will be 
completely random leading to the disorder you observe.  In the case of stable 
equilibrium the atom will simply respond to the disturbance by executing random 
thermal motion centred on the special position.  Which case you see in practice 
will obviously depend on the precise arrangement of forces acting on the atom.

Cheers

-- Ian

On Thu, Dec 9, 2010 at 4:12 PM,  herman.schreu...@sanofi-aventis.com wrote:
 Even with the famous waters on true Wyckoff positions, I usually 
 observe an elongated or even partly split density, suggesting that the 
 water is disordered, being sometimes closer to one monomer, sometimes 
 closer to the symmetry-related monomer. Since the position of proteins 
 in a crystal is in general not determined by a single water-mediated 
 hydrogen bond, the water will in general not be able to make perfect 
 hydrogen bonds to both symmetry-related monomers at the same time. I 
 think therefore that even waters should generally be considered to be 
 disordered and only in exceptional cases will occupy true Wyckoff 
 positions.

 Best,
 Herman

 -Original Message-
 From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of 
 Ian Tickle
 Sent: Thursday, December 09, 2010 3:35 PM
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein 
 atoms

 cases out there (and so far I have heard of a disulfide bond on a 
 2-fold connecting two homodimers).

 I'm slightly puzzled by this example.  If the S-S bond is on the 
 special position, then the rest of the molecule can't have 2-fold 
 symmetry, so would have to be rotationally disordered with occupancy =
 0.5 to avoid clashing with its symmetry mate:

               *
     X -- C  *
             \ *
              S
               |
              S
              * \
              *  C -- X
              *

 where the *'s indicate the 2-fold axis (i.e. vertically in the plane 
 of the page).  In this case, for the reasons I gave in my previous 
 post there's no reason for the disordered S atoms to be exactly on the 
 2-fold; it would be pure coincidence if they were.  If you mean 
 instead that the 2-fold is _perpendicular_ to the S-S bond (i.e.
 coming straight out of the page in the diagram), the molecule does 
 indeed have 2-fold symmetry and can be ordered with occupancy = 1, but 
 then the S atoms are not on special positions, so this would not be an 
 example of protein atoms _on_ a special position.

 One could imagine an example, say where the same side-chain on each 
 monomer is cross-linked (e.g. LYS with glutaraldehyde), forming the
 homodimer:

    X -- C -- N = C -- C -- C -- C -- C = N -- C -- X

 Here the central C atom could be on a 2-fold (i.e. axis perpendicular 
 to the page) special position without rotational disorder.  I've no 
 idea whether such a structure actually exists!

 Cheers

 -- Ian



[ccp4bb] Structure containing nickel ions from Ni-NTA column

2010-12-10 Thread Kristof Van Hecke

Dear,

I was wondering if anybody has experienced before the leakage of Ni- 
ions (from a Ni-NTA column) and additionally binding to specific  
sites in the protein structure..?


Many thanks

Regards

Kristof

--
Kristof Van Hecke, PhD
Biomoleculaire Architectuur
Celestijnenlaan 200 F
B-3001 Heverlee (Leuven)
Tel: +32(0)16327477
--





[ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Petr Kolenko
Dear colleagues,

I appreciate any help, or any suggestion with my difficult data. Many
thanks at least for consideration.
I work with datasets at 3.6AA of resolution. Integrated with XDS,
scaled with SCALA. After integration and scaling in P1, POINTLESS
suggested space group I432:

Space group confidence: 0.95
Laue group confidence: 1.000
Total probability: 0.97


After any longer refinement (more than 20 cycles), there was always a
big gap between Rwork and Rfree (0.22 and 0.33), although the
structure looks quite good at this resolution. I tried also space
group I23, here are my statistics:

Data processing (I23 vs I432):
Rmerge - 0.124 vs 0.134
high resolution Rmerge - 0.649 vs 0.732
low resolution Rmerge - 0.048 vs 0.037

The gap between Rwork and Rfree was stabilized in I23 using tight NCS
restraints:

Rwork vs Rfree:
I23 withouth NCS: 0.228 vs 0.334
I23 with NCS: 0.249 vs 0.296
I43 : 0.228 vs 0.326

My question is, what would you recommend me to close the gap? Or is
this Rfree pointing out a lower real symmetry (I23) than suggested by
POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection
of free reflections in I432 dataset already, but with almost the same
results.

Many thanks for any response.
Petr

-- 
Petr Kolenko
petr.kole...@biochemtech.uni-halle.de
http://kolda.webz.cz


Re: [ccp4bb] Structure containing nickel ions from Ni-NTA column

2010-12-10 Thread David Veesler

Hi Kristof,
depending on what you termed specific sites the answer is definitely  
yes.
I experienced such a case in the following reference related to the  
AcrB structure but I remember that this situation arose also in the  
case of the LTC4 structure.


-There is a baby in the bath water: AcrB contamination is a major  
problem in membrane-protein crystallization.

Veesler D, Blangy S, Cambillau C, Sciara G.

Acta Crystallogr Sect F Struct Biol Cryst Commun. 2008 Oct 1;64(Pt 10): 
880-5.


-Structural basis for synthesis of inflammatory mediators by human  
leukotriene C4 synthase.


Martinez Molina D, Wetterholm A, Kohl A, McCarthy AA, Niegowski D,  
Ohlson E, Hammarberg T, Eshaghi S, Haeggström JZ, Nordlund P.


Nature. 2007 Aug 2;448(7153):613-6.


All the best
David

Le 10 déc. 10 à 12:17, Kristof Van Hecke a écrit :


Dear,

I was wondering if anybody has experienced before the leakage of Ni- 
ions (from a Ni-NTA column) and additionally binding to specific  
sites in the protein structure..?


Many thanks

Regards

Kristof

--
Kristof Van Hecke, PhD
Biomoleculaire Architectuur
Celestijnenlaan 200 F
B-3001 Heverlee (Leuven)
Tel: +32(0)16327477
--




--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.



--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.



Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Phil Evans
Can you exclude space groups with screw axes (I4132, I213)? It's worth trying 
molecular replacement in all possible space groups to check, or try the York 
Zanuda server (http://www.ysbl.york.ac.uk/YSBLPrograms/index.jsp)

Phil


On 10 Dec 2010, at 11:17, Petr Kolenko wrote:

 Dear colleagues,
 
 I appreciate any help, or any suggestion with my difficult data. Many
 thanks at least for consideration.
 I work with datasets at 3.6AA of resolution. Integrated with XDS,
 scaled with SCALA. After integration and scaling in P1, POINTLESS
 suggested space group I432:
 
 Space group confidence: 0.95
 Laue group confidence: 1.000
 Total probability: 0.97
 
 
 After any longer refinement (more than 20 cycles), there was always a
 big gap between Rwork and Rfree (0.22 and 0.33), although the
 structure looks quite good at this resolution. I tried also space
 group I23, here are my statistics:
 
 Data processing (I23 vs I432):
 Rmerge - 0.124 vs 0.134
 high resolution Rmerge - 0.649 vs 0.732
 low resolution Rmerge - 0.048 vs 0.037
 
 The gap between Rwork and Rfree was stabilized in I23 using tight NCS
 restraints:
 
 Rwork vs Rfree:
 I23 withouth NCS: 0.228 vs 0.334
 I23 with NCS: 0.249 vs 0.296
 I43 : 0.228 vs 0.326
 
 My question is, what would you recommend me to close the gap? Or is
 this Rfree pointing out a lower real symmetry (I23) than suggested by
 POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection
 of free reflections in I432 dataset already, but with almost the same
 results.
 
 Many thanks for any response.
 Petr
 
 -- 
 Petr Kolenko
 petr.kole...@biochemtech.uni-halle.de
 http://kolda.webz.cz


Re: [ccp4bb] A script is needed to renumber image

2010-12-10 Thread Ian Tickle
Quickie 1-liners are also possible in Perl, and it's a lot more
flexible than awk to boot:

perl -e 'for(*) {rename $_,$1.($2-720).$3 if /(.+?)(\d+)(.img)$/}'

This (as well as I suspect some of the shell scripts posted) would
fail if you had asked to rename to the range 001..360 since the
leading zeroes will not appear.  This can be fixed by a bit of
defensive programming: replace '($2-720)' with
'sprintf(%03d,$2-720)'.

-- Ian

On Fri, Dec 10, 2010 at 1:51 AM, wu donghui wdh0...@gmail.com wrote:
 Dear all,
 I need a script to renumber my image. My initial image number ranges from
 *_1081.img to *_1440.img. There are 360 images in total. I want to renumber
 these images with the ranges from *_361.img to *_720.img, that means every
 initial image-720, but I don't know how to do it. Below is my script draft.


 #! /bin/csh -f
 @ i = 1081
 while ( $i = 1081 )
 while ($i = 1440 )
 echo mv CD267A_3_pk_1_$i.img CD267A_3_pk_1_$i-720.img
   @ i++
 end
 exit


 Can anyone help me for this script? Thank you very much.

 Best regards,
 Donghui


Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Vellieux Frederic

Hi Petr,

Usually IDXREF suggests more than one Bravais lattice that is consistent 
with your diffraction images;
hence it is (sometimes) worthwhile trying to INTEGRATE in all possible 
Bravais lattices and this allows you to eliminate a number of 
possibilities (poor profiles during integration, instability of 
refinement, poor Rmerge values at the level of CORRECT). This leaves you 
with a smaller number of choices for the Bravais lattive. And I 
personally (at the end) integrate in the correct Bravais lattice instead 
of integrating in P1 and then doing the space group assignment at the 
level of CORRECT. I get improved Rmerge values then.


I've had a case where one of the students here was wrongly advising my 
student about an orthorhombic space group, and my advice to carry out 
integration in all Bravais lattices suggested by IDXREF, integration led 
in orthorhombic gave a molecular replacement solution that did not 
refine properly and a molecular replacement solution that refined in a 
hexagonal space group. The Rmerge values (at the level of CORRECT) vere 
very similar both for P222 and for P6 integration.


HTH,

Fred.

Petr Kolenko wrote:

Dear colleagues,

I appreciate any help, or any suggestion with my difficult data. Many
thanks at least for consideration.
I work with datasets at 3.6AA of resolution. Integrated with XDS,
scaled with SCALA. After integration and scaling in P1, POINTLESS
suggested space group I432:

Space group confidence: 0.95
Laue group confidence: 1.000
Total probability: 0.97


After any longer refinement (more than 20 cycles), there was always a
big gap between Rwork and Rfree (0.22 and 0.33), although the
structure looks quite good at this resolution. I tried also space
group I23, here are my statistics:

Data processing (I23 vs I432):
Rmerge - 0.124 vs 0.134
high resolution Rmerge - 0.649 vs 0.732
low resolution Rmerge - 0.048 vs 0.037

The gap between Rwork and Rfree was stabilized in I23 using tight NCS
restraints:

Rwork vs Rfree:
I23 withouth NCS: 0.228 vs 0.334
I23 with NCS: 0.249 vs 0.296
I43 : 0.228 vs 0.326

My question is, what would you recommend me to close the gap? Or is
this Rfree pointing out a lower real symmetry (I23) than suggested by
POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection
of free reflections in I432 dataset already, but with almost the same
results.

Many thanks for any response.
Petr

  


Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Petr Kolenko
Some refinement of my question:

I am working on mutant variant (point mutation, not on the interface)
of one protein already deposited in space group I432. Screw axes are
excluded, NCS operator is equivalent to missing symmetry operation.
To Fred, I always process data in P1 first. After that and all
analysis, I completely process the data again in suggested space
groups in different directories to avoid any mixture of the files.

I do not know, maybe the problem is only in refinement, but I can not
make the gap between Rfree and Rwork somehow reasonably stable.

Many thanks to all of you for your suggestions and interest.
Petr


2010/12/10 Vellieux Frederic frederic.velli...@ibs.fr:
 Hi Petr,

 Usually IDXREF suggests more than one Bravais lattice that is consistent
 with your diffraction images;
 hence it is (sometimes) worthwhile trying to INTEGRATE in all possible
 Bravais lattices and this allows you to eliminate a number of possibilities
 (poor profiles during integration, instability of refinement, poor Rmerge
 values at the level of CORRECT). This leaves you with a smaller number of
 choices for the Bravais lattive. And I personally (at the end) integrate in
 the correct Bravais lattice instead of integrating in P1 and then doing the
 space group assignment at the level of CORRECT. I get improved Rmerge values
 then.

 I've had a case where one of the students here was wrongly advising my
 student about an orthorhombic space group, and my advice to carry out
 integration in all Bravais lattices suggested by IDXREF, integration led in
 orthorhombic gave a molecular replacement solution that did not refine
 properly and a molecular replacement solution that refined in a hexagonal
 space group. The Rmerge values (at the level of CORRECT) vere very similar
 both for P222 and for P6 integration.

 HTH,

 Fred.

 Petr Kolenko wrote:

 Dear colleagues,

 I appreciate any help, or any suggestion with my difficult data. Many
 thanks at least for consideration.
 I work with datasets at 3.6AA of resolution. Integrated with XDS,
 scaled with SCALA. After integration and scaling in P1, POINTLESS
 suggested space group I432:

 Space group confidence: 0.95
 Laue group confidence: 1.000
 Total probability: 0.97


 After any longer refinement (more than 20 cycles), there was always a
 big gap between Rwork and Rfree (0.22 and 0.33), although the
 structure looks quite good at this resolution. I tried also space
 group I23, here are my statistics:

 Data processing (I23 vs I432):
 Rmerge - 0.124 vs 0.134
 high resolution Rmerge - 0.649 vs 0.732
 low resolution Rmerge - 0.048 vs 0.037

 The gap between Rwork and Rfree was stabilized in I23 using tight NCS
 restraints:

 Rwork vs Rfree:
 I23 withouth NCS: 0.228 vs 0.334
 I23 with NCS: 0.249 vs 0.296
 I43 : 0.228 vs 0.326

 My question is, what would you recommend me to close the gap? Or is
 this Rfree pointing out a lower real symmetry (I23) than suggested by
 POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection
 of free reflections in I432 dataset already, but with almost the same
 results.

 Many thanks for any response.
 Petr






-- 
Petr Kolenko
petr.kole...@biochemtech.uni-halle.de
http://kolda.webz.cz


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Colin Nave
Does one regard the metal atom in a metalloprotein as being part of the protein?

If so, a shared metal could occupy a special position in a dimer for example. 

In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation between 
the metal-ion type, coordination number and the amino-acid residues involved in 
the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 25 
cases of metal atoms in special positions. 

Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and heme 
iron in a crystallographic `special position' D. Cobessi, L.-S. Huang, M. Ban, 
N. G. Pon, F. Daldal and E. A. Berry ( 
http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 'special 
position' is justifiably in quotation marks in this example as disorder is 
present.

Colin


Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Herman . Schreuder
Dear Petr,

An Rfree of 33% does happen, especially at low resolution. However my 
experience is that with such high Rfrees there usually is some problem. The 
problem might be disorder in the protein or the crystal. In this case there is 
not much you can do. However, before giving up, I would at least check the 
following (if you have not already done it):

1) run phaser with sgalternative all. Although the celldimensions are the 
same, the spacegroup maybe different and even with the same space group and 
cell dimensions, the packing may be different. I am currently working on a 
project were different crystals, grown from the same protein and with the same 
cell dimensions and space group, have a different packing. In one case, the 
molecules are 90° rotated with the respect to the reference, in other cases the 
molecules are tilted by 20-30°. The latter structure (in the wrong orientation) 
could be refined to an Rfree of ~35%!
2) check for twinning. Your 4-fold symmetry might be broken to become 2-fold, 
and it might be in a different direction in different parts of the crystal.

Good luck!
Herman

  

-Original Message-
From: CCP4 bulletin board [mailto:ccp...@jiscmail.ac.uk] On Behalf Of Petr 
Kolenko
Sent: Friday, December 10, 2010 1:55 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

Some refinement of my question:

I am working on mutant variant (point mutation, not on the interface) of one 
protein already deposited in space group I432. Screw axes are excluded, NCS 
operator is equivalent to missing symmetry operation.
To Fred, I always process data in P1 first. After that and all analysis, I 
completely process the data again in suggested space groups in different 
directories to avoid any mixture of the files.

I do not know, maybe the problem is only in refinement, but I can not make the 
gap between Rfree and Rwork somehow reasonably stable.

Many thanks to all of you for your suggestions and interest.
Petr


2010/12/10 Vellieux Frederic frederic.velli...@ibs.fr:
 Hi Petr,

 Usually IDXREF suggests more than one Bravais lattice that is 
 consistent with your diffraction images; hence it is (sometimes) 
 worthwhile trying to INTEGRATE in all possible Bravais lattices and 
 this allows you to eliminate a number of possibilities (poor profiles 
 during integration, instability of refinement, poor Rmerge values at 
 the level of CORRECT). This leaves you with a smaller number of 
 choices for the Bravais lattive. And I personally (at the end) 
 integrate in the correct Bravais lattice instead of integrating in P1 
 and then doing the space group assignment at the level of CORRECT. I 
 get improved Rmerge values then.

 I've had a case where one of the students here was wrongly advising my 
 student about an orthorhombic space group, and my advice to carry out 
 integration in all Bravais lattices suggested by IDXREF, integration 
 led in orthorhombic gave a molecular replacement solution that did not 
 refine properly and a molecular replacement solution that refined in a 
 hexagonal space group. The Rmerge values (at the level of CORRECT) 
 vere very similar both for P222 and for P6 integration.

 HTH,

 Fred.

 Petr Kolenko wrote:

 Dear colleagues,

 I appreciate any help, or any suggestion with my difficult data. Many 
 thanks at least for consideration.
 I work with datasets at 3.6AA of resolution. Integrated with XDS, 
 scaled with SCALA. After integration and scaling in P1, POINTLESS 
 suggested space group I432:

 Space group confidence: 0.95
 Laue group confidence: 1.000
 Total probability: 0.97


 After any longer refinement (more than 20 cycles), there was always a 
 big gap between Rwork and Rfree (0.22 and 0.33), although the 
 structure looks quite good at this resolution. I tried also space 
 group I23, here are my statistics:

 Data processing (I23 vs I432):
 Rmerge - 0.124 vs 0.134
 high resolution Rmerge - 0.649 vs 0.732 low resolution Rmerge - 0.048 
 vs 0.037

 The gap between Rwork and Rfree was stabilized in I23 using tight NCS
 restraints:

 Rwork vs Rfree:
 I23 withouth NCS: 0.228 vs 0.334
 I23 with NCS: 0.249 vs 0.296
 I43 : 0.228 vs 0.326

 My question is, what would you recommend me to close the gap? Or is 
 this Rfree pointing out a lower real symmetry (I23) than suggested by 
 POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection 
 of free reflections in I432 dataset already, but with almost the same 
 results.

 Many thanks for any response.
 Petr






--
Petr Kolenko
petr.kole...@biochemtech.uni-halle.de
http://kolda.webz.cz


Re: [ccp4bb] brute force MR

2010-12-10 Thread Ian Stokes-Rees
Arnon,

We have developed an MR search mechanism which may be helpful in this scenario. 
 It is web accessible and available to any public or academic researchers:

https://portal.nebiogrid.org/secure/apps/wsmr/

It can use up to the full set of SCOP domains (100k) to attempt a Phaser MR 
placement of each domain and then ranks the results, allowing you to identify a 
single well placed domain.  The web-based system does not allow you to fix that 
domain to continue the search for subsequent domains but we can do this from 
the command line interface.  If you have a particular set of domains you'd like 
to search against (RCSB or SCOP PDB codes), then we can limit the search to 
that set.

If you decide to use this, please contact us once your first domain search has 
been completed (these take 2-3 years of serial computing time and will finish 
in 1-3 days, depending on how many other computations are ahead of it in our 
queue, the complexity/resolution of your data set, space group, and unit cell 
size).

Regards,

Ian Stokes-Rees


Re: [ccp4bb] brute force MR

2010-12-10 Thread Ian Stokes-Rees
For anyone who is interested, I meant to include a reference to the PNAS paper 
that has just come out (web-only early release) describing the wide search MR 
strategy we've developed:

Stokes-Rees, Sliz
Protein structure determination by exhaustive search of Protein Data Bank 
derived databases
Proc. Nat'l Academy of Sciences
doi:10.1073/pnas.1012095107
http://www.pnas.org/content/early/2010/11/17/1012095107

Ian


Re: [ccp4bb] how to reduce Rfactor R free

2010-12-10 Thread Ed Pozharski
Jack (?),

On Thu, 2010-12-09 at 21:35 -0800, Jack Russel wrote:
 So is it now the time to stop further refining the solution . 

R-values are not the only criteria for this.  You should be looking for
a) lack of unexplained density
b) good geometry
c) acceptable R-values

This
http://xray.bmc.uu.se/gerard/supmat/rfree2000/plotter.html
can help you to get some idea of the range of R-values for the
structures deposited in the PDB.

 if not then  how can i lower the  R factor and Rfree?

First, you should probably be looking to lower only Rfree.  Otherwise,
the answer to your question depends on what you are dealing with.  You
may have done some of this, but consider

- TLS
- more conservative geometry restraints
- NCS restraints if you got more than one copy in the asu
- there is this very cute Coot feature Refine/improve Ramachandran
plot
- etc, etc, etc

Keep in mind that, to paraphrase Daniel Gewirth, there are many ways in
which R-values can be manipulated, both deliberately and not.  Yet
another reason not to go nuts about R-values and to take them, like
poison, with a grain of salt.

  The second question is it possible to have such a large difference
 between R factor and R free?

Well, apparently it is, you have just proven it :).  Seriously though,
the 10% that you have is on the high side.  Some things mentioned above
for the reduction of Rfree may help to close the gap, but once again,
stuff happens.

I'd make sure to have 10% set aside for the test set, at this resolution
5% may be not enough. 

Could you privately send me the mtz-file from refmac output?  I am
curious about R-value statistics in this case.  Naturally, it will be
treated confidentially.

Cheers,

Ed.

-- 
I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs


Re: [ccp4bb] A script is needed to renumber image

2010-12-10 Thread Ed Pozharski
Assuming that your files are named something_.img what follows must
be one line):

echo | awk '{for(i=361;i721;i++) printf mv something_%d.img something_
%d.img\n,i+720,i;}' | bash -sf

I'd try first without piping it to bash just to make sure that it works
right.

On Fri, 2010-12-10 at 09:51 +0800, wu donghui wrote:
 Dear all,
 
 
 I need a script to renumber my image. My initial image number ranges
 from *_1081.img to *_1440.img. There are 360 images in total. I want
 to renumber these images with the ranges from *_361.img to *_720.img,
 that means every initial image-720, but I don't know how to do it.
 Below is my script draft.
 
 
 
 
 
 
 #! /bin/csh -f
 
 
 @ i = 1081
 
 
 while ( $i = 1081 ) 
 while ($i = 1440 )
 echo mv CD267A_3_pk_1_$i.img CD267A_3_pk_1_$i-720.img
   @ i++
 end
 
 
 exit
 
 
 
 
 
 
 Can anyone help me for this script? Thank you very much.
 
 
 
 
 Best regards,
 
 
 Donghui

-- 
I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs


Re: [ccp4bb] A script is needed to renumber image

2010-12-10 Thread Ed Pozharski
Ben,

does this assume that the current folder contains only the files to be
renamed?  Also, how does one add padding zeros?

While Donghui didn't need zero padding, my one liner can be easily
corrected to do this in the same way Ian's is, by replacing 
%d with %0nd, where n is the total number of digits one desires.

Ed.

On Thu, 2010-12-09 at 21:42 -0500, Ben Eisenbraun wrote:
  #! /bin/csh -f
 
 That's your problem right there!  You don't need a script at all.
 
 In bash/ksh/zsh right on the command line:
 
 c=361  for f in * ; do mv $f CD267A_3_pk_1_$c.img ; c=$(($c+1)) ; done
 
 Bill Scott probably has some zsh slickness to do the same thing without 
 the loop, but too much magic gives me the willies.
 
 -b
 
 --
 | Ben Eisenbraun
 | SBGrid Consortium   | http://sbgrid.org  |
 | Harvard Medical School  | http://hms.harvard.edu |

-- 
I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs


Re: [ccp4bb] how to reduce Rfactor R free

2010-12-10 Thread Bosch, Juergen
The other question is, you say it's 3.3Å, by which criteria ?
Xtriage might report a 4Å dataset if you take I/sigI 3 and 85% completeness
It might also make a difference which data reduction program you used.
[/Advertisement on]XDS/XSCALE or d*Trek[/Advertisement off] might squeeze a bit 
more our of your dataset.

Jürgen

-
Jürgen Bosch
Johns Hopkins Bloomberg School of Public Health
Department of Biochemistry  Molecular Biology
Johns Hopkins Malaria Research Institute
615 North Wolfe Street, W8708
Baltimore, MD 21205
Phone: +1-410-614-4742
Lab:  +1-410-614-4894
Fax:  +1-410-955-3655
http://web.mac.com/bosch_lab/http://web.me.com/bosch_lab/

On Dec 10, 2010, at 12:35 AM, Jack Russel wrote:








Hi all,

I am new in the field of crystallography.

I have just  collected  a data at 3.3 Å and the solved the structure using MR . 
the space group comes out to be P2 3. There are 3 molecules in Asymetric unit 
with the LLG of 300 and tfz 18. But after repeated rounds of refinement with 
REFMAC5 and model building with coot  the R factor had been  struck at 30% and 
R free at 40%.

So is it now the time to stop further refining the solution . if not then  how 
can i lower the  R factor and Rfree?

The second question is it possible to have such a large difference between R 
factor and R free?

Thanks in advance








Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Edward A. Berry

Colin Nave wrote:
..



Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of Rhodobacter 
capsulatus bacterioferritin with metal-free dinuclear site and heme iron in a 
crystallographic `special position' D. Cobessi, L.-S. Huang, M. Ban, N. G. Pon, F. 
Daldal and E. A. Berry ( http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though 
the 'special position' is justifiably in quotation marks in this example as disorder is 
present.

Colin



Yes, this was an example of Ian's rotationally disordered with occupancy 0.5
possibility.

Heme has a pseudo-two-fold axis passing through the plane of the ring,
which is violated only by the position of the methyl and vinyl substituents
on either side: rotating through the pseudo-2-fold superimposes methyl on
vinyl and vice versa on either side. This axis was sitting on a
crystallographic 2-fold.

We did trial refinements in a lower symmetry space group so the
axis became NCS, and we refined with the heme in one orientation or
the other. In either case, difference maps showed positive density
for another atom beyond the methyl, and negative density on CB of
the vinyl. So we concluded the heme was oriented both ways, and
in the average satisfied the symmetry of the higher space group.
Putting it with half occupancy in the asymmetric unit allowed
crystallographic symmetry to generate the rotated half.

Looking at it from the heme's point of view, this makes sense.
Its environment is perfectly symmetrical, so there is no way
for it to choose one orientation over the other and both are
adopted equally.  (The two-fold is intrinsic to the biological unit,
not generated by crystal packing).

I think the heme iron was right on the crystallographic axis.
When refining in the lower space-group with two .5 occupancy
non-interacting hemes, the propionates (which actually satisfy the
covalent symmetry) in the two hemes took on different conformations,
not sure if this was significant or just taking liberties where
freedom is given.

eab


Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Lijun Liu

Hi,

At this resolution, such a difference in Rs could be an indicator that  
the structure is over--fitted.

What weight did you choose?

Lijun

On Dec 10, 2010, at 3:17 AM, Petr Kolenko wrote:


Dear colleagues,

I appreciate any help, or any suggestion with my difficult data. Many
thanks at least for consideration.
I work with datasets at 3.6AA of resolution. Integrated with XDS,
scaled with SCALA. After integration and scaling in P1, POINTLESS
suggested space group I432:

Space group confidence: 0.95
Laue group confidence: 1.000
Total probability: 0.97


After any longer refinement (more than 20 cycles), there was always a
big gap between Rwork and Rfree (0.22 and 0.33), although the
structure looks quite good at this resolution. I tried also space
group I23, here are my statistics:

Data processing (I23 vs I432):
Rmerge - 0.124 vs 0.134
high resolution Rmerge - 0.649 vs 0.732
low resolution Rmerge - 0.048 vs 0.037

The gap between Rwork and Rfree was stabilized in I23 using tight NCS
restraints:

Rwork vs Rfree:
I23 withouth NCS: 0.228 vs 0.334
I23 with NCS: 0.249 vs 0.296
I43 : 0.228 vs 0.326

My question is, what would you recommend me to close the gap? Or is
this Rfree pointing out a lower real symmetry (I23) than suggested by
POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection
of free reflections in I432 dataset already, but with almost the same
results.

Many thanks for any response.
Petr

--
Petr Kolenko
petr.kole...@biochemtech.uni-halle.de
http://kolda.webz.cz


Lijun Liu
555 Mission Bay Blvd South
CVRB Room 482, MBox 3122
University of California, San Francisco
San Francisco, CA 94158
Phone: (415)514-2836






Re: [ccp4bb] A script is needed to renumber image

2010-12-10 Thread Ben Eisenbraun
Hi Ed,

 does this assume that the current folder contains only the files to be
 renamed?

Indeed it does.  I was aiming to show that for simple tasks like this, 
a modern shell makes things easy enough that scripts become overkill.

 Also, how does one add padding zeros?

You can just add in a printf to go, e.g., from a padded 001 to 360:

c=1  for f in * ; do echo mv $f CD267A_3_pk_1_`printf %03d $c`.img ; 
c=$(($c+1)) ; done

Or a more specific set of images:

c=1  for f in CD267A_3_pk_1_*.img ; do echo mv $f CD267A_3_pk_1_`printf 
%03d $c`.img ; c=$(($c+1)) ; done

Or an even more specific set of images:

c=1  for f in CD267A_3_pk_1_{1081..1440}.img ; do echo mv $f 
CD267A_3_pk_1_`printf %03d $c`.img ; c=$(($c+1)) ; done 

I guess once it gets complex enough then you might as well go down James'
path and create a generalized script that can be used in many cases, but
it's definitely more work to write the generalized script than to toss off
one liners.

-ben

--
| Ben Eisenbraun  | Software Sysadmin  |
| SBGrid Consortium   | http://sbgrid.org  |
| Harvard Medical School  | http://hms.harvard.edu |


Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Petr Kolenko
Hi,

Some of you were right, I did not try to modify weights. Previous
weight was 0.03 (assigned automatically), actual is 0.01 and Rwork vs
Rfree is about 0.25 and 0.33, so 8% of difference - hopefully
acceptable. But now I have very low RMSDs. And then you read in this
forum quite often, that we should refine against the data and not
against the restraints. :)

Thanks anyway,
Petr



2010/12/10 Lijun Liu lijun@ucsf.edu:
 Hi,
 At this resolution, such a difference in Rs could be an indicator that the
 structure is over--fitted.
 What weight did you choose?
 Lijun
 On Dec 10, 2010, at 3:17 AM, Petr Kolenko wrote:

 Dear colleagues,

 I appreciate any help, or any suggestion with my difficult data. Many
 thanks at least for consideration.
 I work with datasets at 3.6AA of resolution. Integrated with XDS,
 scaled with SCALA. After integration and scaling in P1, POINTLESS
 suggested space group I432:

 Space group confidence: 0.95
 Laue group confidence: 1.000
 Total probability: 0.97


 After any longer refinement (more than 20 cycles), there was always a
 big gap between Rwork and Rfree (0.22 and 0.33), although the
 structure looks quite good at this resolution. I tried also space
 group I23, here are my statistics:

 Data processing (I23 vs I432):
 Rmerge - 0.124 vs 0.134
 high resolution Rmerge - 0.649 vs 0.732
 low resolution Rmerge - 0.048 vs 0.037

 The gap between Rwork and Rfree was stabilized in I23 using tight NCS
 restraints:

 Rwork vs Rfree:
 I23 withouth NCS: 0.228 vs 0.334
 I23 with NCS: 0.249 vs 0.296
 I43 : 0.228 vs 0.326

 My question is, what would you recommend me to close the gap? Or is
 this Rfree pointing out a lower real symmetry (I23) than suggested by
 POINTLESS or PHENIX.XTRIAGE (I432)? I have tried different selection
 of free reflections in I432 dataset already, but with almost the same
 results.

 Many thanks for any response.
 Petr

 --
 Petr Kolenko
 petr.kole...@biochemtech.uni-halle.de
 http://kolda.webz.cz

 Lijun Liu
 555 Mission Bay Blvd South
 CVRB Room 482, MBox 3122
 University of California, San Francisco
 San Francisco, CA 94158
 Phone: (415)514-2836







-- 
Petr Kolenko
kole...@imc.cas.cz
http://kolda.webz.cz


Re: [ccp4bb] Space group vs. gap between Rwork and Rfree?

2010-12-10 Thread Ed Pozharski
On Fri, 2010-12-10 at 19:32 +0100, Petr Kolenko wrote:
 But now I have very low RMSDs. And then you read in this
 forum quite often, that we should refine against the data and not
 against the restraints. :) 

Well, at 3.6A you must have low rmsds.  And at 3.6A you don't really
have any data to refine against beyond backbone conformation and maybe
some sidechain orientation.  You have perhaps 4-5 reflections per
residue, right?  So you barely have enough to identify backbone torsions
with any degree of confidence.  We should, of course, refine against the
data, but not against the noise in it.

Cheers,

Ed.

-- 
I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Ian Tickle
Good point Colin!  2-Zn insulin is of course a classic example of
this, where the two independent Zn2+ ions both sit on the
crystallographic 3-fold in R3.  It doesn't matter whether you count
the metal ion as part of the protein or not: if I understand Gloria's
original question correctly, all that matters is that the atom/ion is
present in the crystal structure.

In fact here are some extracts from the PDB entry (4INS):

REMARK 375 ZNZN B  31  LIES ON A SPECIAL POSITION.
REMARK 375 ZNZN D  31  LIES ON A SPECIAL POSITION.
REMARK 375  HOH B 251  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D  44  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D 134  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D 215  LIES ON A SPECIAL POSITION.
REMARK 375  HOH D 269  LIES ON A SPECIAL POSITION.

HETATM  835 ZNZN B  31  -0.002  -0.004   7.891  0.33 10.40  ZN
HETATM  836 ZNZN D  31   0.000   0.000  -8.039  0.33 11.00  ZN
HETATM  885  O   HOH B 251  -0.023  -0.033  11.206  0.33 21.05   O
etc

Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
special position (assuming it's not disordered), though the first Zn
above and the water do appear to be disordered since they're not
actually on the special position.  Fractional occupancy always implies
some kind of disorder: occupancy = 1/3 of an atom on a special
position would imply occupancy disorder, i.e. it's randomly present in
only 1/3 of the unit cells.

-- Ian


On Fri, Dec 10, 2010 at 1:11 PM, Colin Nave colin.n...@diamond.ac.uk wrote:
 Does one regard the metal atom in a metalloprotein as being part of the 
 protein?

 If so, a shared metal could occupy a special position in a dimer for example.

 In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation between 
 the metal-ion type, coordination number and the amino-acid residues involved 
 in the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
 http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 
 25 cases of metal atoms in special positions.

 Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
 Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and 
 heme iron in a crystallographic `special position' D. Cobessi, L.-S. Huang, 
 M. Ban, N. G. Pon, F. Daldal and E. A. Berry ( 
 http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 'special 
 position' is justifiably in quotation marks in this example as disorder is 
 present.

 Colin



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Dale Tronrud
   The proper occupancy for an atom on a special position depends on
how one defines the meaning of the number in that column.  In the
past, refinement programs, at least I know mine did, simply expanded
all atoms in the coordinate file by the symmetry operators to determine
the contents of the unit cell.  With that operation the occupancy of
the atoms on special positions had to be reduced.  It is certainly
true that there are 1/3 the number of atoms in the unit cell represented
by ZN D  31 than, for example, the CA of residue 50.

   Most modern refinement programs try to handle this automatically,
since users proved unreliable at detecting this condition and modifying
their coordinate files.  They use the interpretation that the site
is fully occupied but there are only 1/3 the number of these sites
than sites at general positions.

   Personally I find it disturbing to have the occupancy of B  31
set to 0.33 and that of D  31 set to 1.00 simply because of an
insignificant shift in the position of the atom.

Dale Tronrud

On 12/10/10 13:53, Ian Tickle wrote:
 Good point Colin!  2-Zn insulin is of course a classic example of
 this, where the two independent Zn2+ ions both sit on the
 crystallographic 3-fold in R3.  It doesn't matter whether you count
 the metal ion as part of the protein or not: if I understand Gloria's
 original question correctly, all that matters is that the atom/ion is
 present in the crystal structure.
 
 In fact here are some extracts from the PDB entry (4INS):
 
 REMARK 375 ZNZN B  31  LIES ON A SPECIAL POSITION.
 REMARK 375 ZNZN D  31  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH B 251  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D  44  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D 134  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D 215  LIES ON A SPECIAL POSITION.
 REMARK 375  HOH D 269  LIES ON A SPECIAL POSITION.
 
 HETATM  835 ZNZN B  31  -0.002  -0.004   7.891  0.33 10.40  ZN
 HETATM  836 ZNZN D  31   0.000   0.000  -8.039  0.33 11.00  ZN
 HETATM  885  O   HOH B 251  -0.023  -0.033  11.206  0.33 21.05   O
 etc
 
 Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
 special position (assuming it's not disordered), though the first Zn
 above and the water do appear to be disordered since they're not
 actually on the special position.  Fractional occupancy always implies
 some kind of disorder: occupancy = 1/3 of an atom on a special
 position would imply occupancy disorder, i.e. it's randomly present in
 only 1/3 of the unit cells.
 
 -- Ian
 
 
 On Fri, Dec 10, 2010 at 1:11 PM, Colin Nave colin.n...@diamond.ac.uk wrote:
 Does one regard the metal atom in a metalloprotein as being part of the 
 protein?

 If so, a shared metal could occupy a special position in a dimer for example.

 In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation between 
 the metal-ion type, coordination number and the amino-acid residues involved 
 in the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
 http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 
 25 cases of metal atoms in special positions.

 Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
 Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and 
 heme iron in a crystallographic `special position' D. Cobessi, L.-S. Huang, 
 M. Ban, N. G. Pon, F. Daldal and E. A. Berry ( 
 http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 
 'special position' is justifiably in quotation marks in this example as 
 disorder is present.

 Colin



Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Ian Tickle
No that surely can't be right.  Application of a symmetry operator to
a point on a special position which is unchanged by the operator
doesn't generate a symmetry copy of the point, because there is no
symmetry copy of such a point!  For general Wyckoff positions it does,
for sure.  If you look at the entry for R3 (hexagonal or rhombohedral
setting, it doesn't matter which) on the Wyckoff position website I
indicated earlier, you'll see a column labelled 'multiplicity' which
is 3 for the general position and 1 for the special position.  This
means that the symmetry operator generates 3 symmetry copies of a
general position (including the original), but only 1 copy (i.e. only
the original) of the special positions on the 3-fold.

That's precisely why such points are called special positions - i.e.
you have to treat them specially!  If what you describe is what the
refinement program or whatever is doing then it would imply there's a
programming error, i.e. the program is not treating special positions
as special, it's treating them instead as general positions.  As I
said whenever you see fractional occupancy reported it should imply
some kind of disorder, i.e. the atom exists on that site in only the
indicated fraction of the unit cells in the lattice.

-- Ian

On Fri, Dec 10, 2010 at 10:02 PM, Sue Roberts s...@email.arizona.edu wrote:
 Hi Ian

 No disorder is involved.

 The occupancy of an (fully occupied) atom on an n-fold rotation axis is 1/n
 If a two-fold, 1/2
 If a three-fold, 1/3

 When you sum over all the atoms in the unit cell, application of the symmetry 
 operations to atoms lying on the rotation axis generates atoms with unchanged 
 coordinates. Hence to generate a fully occupied atom on a n-fold symmetry 
 axis, the original occupancy has to be 1/n.

 Sue

 On Dec 10, 2010, at 2:53 PM, Ian Tickle wrote:

 Good point Colin!  2-Zn insulin is of course a classic example of
 this, where the two independent Zn2+ ions both sit on the
 crystallographic 3-fold in R3.  It doesn't matter whether you count
 the metal ion as part of the protein or not: if I understand Gloria's
 original question correctly, all that matters is that the atom/ion is
 present in the crystal structure.

 In fact here are some extracts from the PDB entry (4INS):

 REMARK 375 ZN    ZN B  31  LIES ON A SPECIAL POSITION.
 REMARK 375 ZN    ZN D  31  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH B 251  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D  44  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D 134  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D 215  LIES ON A SPECIAL POSITION.
 REMARK 375      HOH D 269  LIES ON A SPECIAL POSITION.

 HETATM  835 ZN    ZN B  31      -0.002  -0.004   7.891  0.33 10.40          
 ZN
 HETATM  836 ZN    ZN D  31       0.000   0.000  -8.039  0.33 11.00          
 ZN
 HETATM  885  O   HOH B 251      -0.023  -0.033  11.206  0.33 21.05           
 O
 etc

 Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
 special position (assuming it's not disordered), though the first Zn
 above and the water do appear to be disordered since they're not
 actually on the special position.  Fractional occupancy always implies
 some kind of disorder: occupancy = 1/3 of an atom on a special
 position would imply occupancy disorder, i.e. it's randomly present in
 only 1/3 of the unit cells.

 -- Ian


 On Fri, Dec 10, 2010 at 1:11 PM, Colin Nave colin.n...@diamond.ac.uk wrote:
 Does one regard the metal atom in a metalloprotein as being part of the 
 protein?

 If so, a shared metal could occupy a special position in a dimer for 
 example.

 In Acta Cryst. (2008). D64, 257-263 Metals in proteins: correlation 
 between the metal-ion type, coordination number and the amino-acid residues 
 involved in the coordination I. Dokmanic, M. Sikic and S. Tomic ( 
 http://scripts.iucr.org/cgi-bin/paper?S090744490706595X ) it says there are 
 25 cases of metal atoms in special positions.

 Also Acta Cryst. (2002). D58, 29-38 The 2.6 Å resolution structure of 
 Rhodobacter capsulatus bacterioferritin with metal-free dinuclear site and 
 heme iron in a crystallographic `special position' D. Cobessi, L.-S. 
 Huang, M. Ban, N. G. Pon, F. Daldal and E. A. Berry ( 
 http://scripts.iucr.org/cgi-bin/paper?S0907444901017267 ) though the 
 'special position' is justifiably in quotation marks in this example as 
 disorder is present.

 Colin



 Dr. Sue A. Roberts
 Dept. of Chemistry and Biochemistry
 University of Arizona
 1041 E. Lowell St.,  Tucson, AZ 85721
 Phone: 520 621 8171
 s...@email.arizona.edu
 http://www.biochem.arizona.edu/xray












Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Ed Pozharski
On Fri, 2010-12-10 at 22:40 +, Ian Tickle wrote:
 Application of a symmetry operator to
 a point on a special position which is unchanged by the operator
 doesn't generate a symmetry copy of the point, because there is no
 symmetry copy of such a point! 

Why not?  Symmetry-related copy may be considered just that - copy
produced by a symmetry operator.  What advantage do you gain by
restricting it to only those that result in physically different point?

There is one excellent point that you made in some past exchanges - that
deposited structures are mathematical models and they do not always have
to make strict physical sense.  This seems to be just such case - one
does not have three zincs in that spot with each being gone two thirds
of the time, but making the model physically meaningful would require
perhaps dropping to P1 and implementing NCS to enforce crystal symmetry.
So compromising on the physical meaning of the atom record seems like a
small price to pay.  This feels odd - I seem to be arguing your
side :)

Cheers,

Ed.

 

-- 
I'd jump in myself, if I weren't so good at whistling.
   Julian, King of Lemurs


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Boaz Shaanan
An excerpt from the Shelx manual (which I think is a good reference for proper 
handling of atoms on special positions in refinement of small or large 
molecules): 

4.3 Special position constraints
Constraints for the coordinates and 
anisotropic displacement
parameters for atoms on special positions are generated automatically by
the program for ALL special positions in ALL space groups, in 
conventional
settings or otherwise. For upwards compatibility with SHELX-76, free 
variables
may still be used for this, but it is better to leave it to the program.
If the occupancy is not input, the program will fix it at the 
appropriate
value for a special position. If the user applies (correct or incorrect)
special position constraints using free variables etc., the program 
assumes
this has been done with intent and reports but does not apply the 
correct
constraints; accidental application of wrong special position 
constraints
is one of the easiest ways to cause a refinement to 'blow up' !


Also, please check the book of Shmueli and Weiss, Introduction to 
crystallographic statistics p. 5, last paragraph  and the next page where a 
rigorous treatment of of contribution to the structure factor of atoms in 
general and special positions in the unit cell is given (eq. 1.2.10).

Thus, occupancy of an atom on a 3-fold would be 1/3, 1/2 on a 2-fold, 
1/4 on a 4-fold and so forth.  

 Cheers,

   Boaz


- Original Message -
From: Ian Tickle ianj...@gmail.com
Date: Saturday, December 11, 2010 0:41
Subject: Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms
To: CCP4BB@JISCMAIL.AC.UK

 No that surely can't be right.  Application of a symmetry 
 operator to
 a point on a special position which is unchanged by the operator
 doesn't generate a symmetry copy of the point, because there is no
 symmetry copy of such a point!  For general Wyckoff 
 positions it does,
 for sure.  If you look at the entry for R3 (hexagonal or 
 rhombohedralsetting, it doesn't matter which) on the Wyckoff 
 position website I
 indicated earlier, you'll see a column labelled 'multiplicity' which
 is 3 for the general position and 1 for the special 
 position.  This
 means that the symmetry operator generates 3 symmetry copies of a
 general position (including the original), but only 1 copy (i.e. only
 the original) of the special positions on the 3-fold.
 
 That's precisely why such points are called special positions -
 i.e.
 you have to treat them specially!  If what you describe is 
 what the
 refinement program or whatever is doing then it would imply 
 there's a
 programming error, i.e. the program is not treating special positions
 as special, it's treating them instead as general 
 positions.  As I
 said whenever you see fractional occupancy reported it should imply
 some kind of disorder, i.e. the atom exists on that site in only the
 indicated fraction of the unit cells in the lattice.
 
 -- Ian
 
 On Fri, Dec 10, 2010 at 10:02 PM, Sue Roberts 
 s...@email.arizona.edu wrote:
  Hi Ian
 
  No disorder is involved.
 
  The occupancy of an (fully occupied) atom on an n-fold 
 rotation axis is 1/n
  If a two-fold, 1/2
  If a three-fold, 1/3
 
  When you sum over all the atoms in the unit cell, application 
 of the symmetry operations to atoms lying on the rotation axis 
 generates atoms with unchanged coordinates. Hence to generate a 
 fully occupied atom on a n-fold symmetry axis, the original 
 occupancy has to be 1/n.
 
  Sue
 
  On Dec 10, 2010, at 2:53 PM, Ian Tickle wrote:
 
  Good point Colin!  2-Zn insulin is of course a classic 
 example of
  this, where the two independent Zn2+ ions both sit on the
  crystallographic 3-fold in R3.  It doesn't matter whether you count
  the metal ion as part of the protein or not: if I understand 
 Gloria's original question correctly, all that matters is that 
 the atom/ion is
  present in the crystal structure.
 
  In fact here are some extracts from the PDB entry (4INS):
 
  REMARK 375 ZN    ZN B  31  LIES ON A SPECIAL POSITION.
  REMARK 375 ZN    ZN D  31  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH B 251  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D  44  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D 134  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D 215  LIES ON A SPECIAL POSITION.
  REMARK 375      HOH D 269  LIES ON A SPECIAL POSITION.
 
  HETATM  835 ZN    ZN B  31      -0.002  -0.004   7.891  0.33 
 10.40          ZN
  HETATM  836 ZN    ZN D  31       0.000   0.000  -8.039  0.33 
 11.00          ZN
  HETATM  885  O   HOH B 251      -0.023  -0.033  11.206  0.33 
 21.05           O
  etc
 
  Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's 
 on the
  special position (assuming it's not disordered), though the 
 first Zn
  above and the water do appear to be disordered since they're not
  actually on the special position.  Fractional occupancy 
 always implies
  some kind of disorder: occupancy = 1/3 of an atom on a special
  

Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread George M. Sheldrick
SHELXL also expects that the occupancy of a fully occupied atom on a 
threefold axis should be set at 1/3, and will generate this automatically 
if necessary. It will also generate automatically the necessary 
constraints for the x, y and z parameters (and for the Uij if the atom is
anisotropic). It is essential that this is done correctly if a full-matrix
refinement is being performed (e.g. to get esd estimates), otherwise
the refinement can explode. The user may change or switch off the 
tolerance for detecting whether an atom is on a special position 
(with the SPEC instruction). Setting the occupancy to a fraction avoided
a complicated IF construction inside a loop and 35 years ago computers 
were so slow! I can't change it now because I have to preserve upwards
compatibility. Unfortunately the CIF committee decided to use the other
definition (i.e. the Zn on the threefold axis has an occupancy of 1.0)
and this has caused considerable confusion in the small molecule world 
ever since; atoms are frequently encountered on special positions in
inorganic and mineral structures.

George

Prof. George M. Sheldrick FRS
Dept. Structural Chemistry,
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-22582


On Fri, 10 Dec 2010, Ed Pozharski wrote:

 On Fri, 2010-12-10 at 21:53 +, Ian Tickle wrote:
  Hmmm - but shouldn't the occupancy of the Zn be 1.00 if it's on the
  special position 
 
 Shouldn't 1/3 be better for programming purposes?  If you set occupancy
 to 1.0, then you should specify that symmetry operators do not apply for
 these atoms, making Fc calculation a bit more cumbersome.
 
 If definition of the asu content is you get full content of the unit
 cell after applying symmetry operators, then occupancy *must* be 1/3,
 right?
 
 The first zinc and the water are on special position, but because they
 are not excluded from positional refinement (perhaps they should be),
 they will drift a bit.  CNS has distance cutoff for treating atoms as
 special positions, if it jumps over the limit during, say, simulated
 annealing, it  will cause problems.  Perhaps PROLSQ did something
 similar.  It is a good question if it's better to fix these in place or
 let them wobble a bit to account for some potential disorder.  While I
 see the formal argument that it should be nailed to three-fold axes, it
 is also true that this is a mathematical compromise to simplify modeling
 that does not reflect physical reality (i.e. you don't have three
 partially occupied zinc ions, it's just one).  In any event, given that
 this is a 1.5A structure, (-0.002 0.004) is statistically speaking the
 same as (0 0).
 
 Cheers,
 
 Ed. 
 
 -- 
 I'd jump in myself, if I weren't so good at whistling.
Julian, King of Lemurs
 
 


Re: [ccp4bb] Fwd: [ccp4bb] Wyckoff positions and protein atoms

2010-12-10 Thread Pavel Afonine
  No disorder is involved.
 
  The occupancy of an (fully occupied) atom on an n-fold rotation axis is
 1/n
  If a two-fold, 1/2
  If a three-fold, 1/3
 
  When you sum over all the atoms in the unit cell, application of the
 symmetry operations to atoms lying on the rotation axis generates atoms with
 unchanged coordinates. Hence to generate a fully occupied atom on a n-fold
 symmetry axis, the original occupancy has to be 1/n.
 



I guess I have a few small points to add:

1) If a site on special position is not fully occupied, then the occupancy
is q/n, where q is actual occupancy (for fully occupied site q=1). Since
ATOM record in PDB file does not (directly) tell you whether the atom is on
special position or not then it is not straightforward to know whether the
occupancy of atom in question is not 1 due to disorder or due to symmetry.

2) Let's consider a hypothetical case... The maximal multiplicity of a
spacial position in proteins is 24 (is that right? please correct if not),
and the precision of the occupancy field in PDB file is only two digits,
like: 1.00. Imagine you have a fair amount of very heavy atoms all seating
at special positions with the multiplicity 24, and your structure is
relatively small. Then what you will be forced to put in the PDB file is:
1/24 ~ 0.04 and not 0.042. I'm not sure if this rounding error is
significant or not (and what is significant), but just to keep in mind
another source of mismatch between reported and recomputed R-factors.

3) If I add an atom onto special position using Coot, will it set the
occupancy as 1/n automatically or do I have to to edit the PDB file myself
(and remember to do so)? (another room for an error)

So, having said this, I tend to think that using occupancy 1 (and not 1/n)
is a good thing.

Pavel.