Re: [ccp4bb] arp/warp in p22121: what to do in Pointless

2007-09-24 Thread George M. Sheldrick
Phil,

For the record, the program XPREP - widely used by small molecule 
crystallographers but also useful for macromoleculess - always 
makes the conventional cell (in this example P 21 21 2) the default 
(i.e. what you get if you always hit Enter), and writes out new 
.hkl and .ins files in this setting unless the user forces it to do 
otherwise (the .ins file is used for solving the structure with 
direct methods and contains the cell corresponding to the new .hkl 
file). There are three cases where it is intentionally made less 
awkward for the user to choose an alternative setting if she/he so
wishes:

1. Primitive rhombohedral rather than the hexagonal cell with 
three times the volume. Unlike the situation for macromolecules, 
where some older programs do not work with the primitive 
rhombohedral cell, the primitive cell is quite often chosen for 
small molecules.

2. The space group I2 rather than C2 if it results in a beta 
angle much closer to 90 degrees (or when it is desired to compare
the structure with a structure - possibly another crystalline 
phase of the same compound - in say I222). The same applies to
other C-centred monoclinic cells, e.g. Cc - Ia etc. (but Fd is 
discouraged).

3. The setting P21/n rather than P21/c if it results in a beta 
angles much closer to 90 etc.

These three 'unconventional' settings are also generally accepted 
by the checking software, databases and journals. To cover all 
reasonable non-standard settings, XPREP actually understands 413 
different space groups!  

Best wishes, George  

Prof. George M. Sheldrick FRS
Dept. Structural Chemistry, 
University of Goettingen,
Tammannstr. 4,
D37077 Goettingen, Germany
Tel. +49-551-39-3021 or -3068
Fax. +49-551-39-2582


On Mon, 24 Sep 2007, Phil Evans wrote:

 I think this is only a problem in the primitive orthorhombic system (at least
 I assume people don't want hexagonal axes along a, A  B centred lattices etc,
 although there is no reason in principle why not).
 
 Following some earlier discussions with Ian, Pointless now honours (and
 preserves) a reference file (HKLREF) in eg P 2 21 21, and also explicit
 reindex operations, but an initial indexing will still enforce the standard
 setting eg P 21 21 2, because I accept the reference setting from the cctbx
 library
 
 ie suppose you have a crystal which when indexed with a = b = c and
 Pointless decides unambiguously for the sake of argument)  that the axis along
 a is a 2-fold and the other two are 2(1) screws, ie space group P 2 21 21.
 
 At present this will be reindexed to the standard setting P 21 21 2, but is
 that what you want, or should it be left as abc? Which criterion takes
 precedence?
 
 Phil
 
 
 On 19 Sep 2007, at 17:54, Ian Tickle wrote:
 
  Hi Sue
  
  It's certainly true that the convention in the 1935 and 1952 editions of
  IT Volume 1 *appeared* to be the 'standard setting' convention that you
  describe because only the 'standard' settings were listed, and this was
  the way that many crystallographers interpreted it (actually only
  macromolecular crystallographers, the small molecule people stick to the
  IUCr convention), so this is probably where you're coming from.  However
  the 1983 edition of Volume A clarified the situation and made it clear
  that this was never the intention, so all the conventional settings are
  now shown on the SG diagram pages.  P22121  P21221 certainly are
  defined in IT Vol. A - look on the diagram page for SG no. 18  you'll
  see them.
  
  The 'standard symbol' for a space group is merely the heading on the
  page used only for indexing purposes, so space groups P22121, P21221 and
  P21212 all have the same standard symbol P21212; hence the standard
  symbol is not unique and can't be used to unambiguously define the space
  group.  The 'standard setting' is merely the space group setting that
  has the same name as the standard symbol.  Even if that weren't true do
  we really want to be still sticking to a convention that was abandoned
  25 years ago and doesn't a later convention override an earlier one
  anyway?
  
  Actually the convention in use is not the issue anyway, I don't care
  which convention is used as long as all programs use the same
  convention! - then I'll never need to permute axes (just as
  fundamentally I don't care which co-ordinate format is used as long as
  all programs use the same one, then I'll never need to reformat).  So
  Mosflm uses the IUCr convention (i.e. a=b=c for primitive
  orthorhombic), and therefore any program which doesn't support that
  convention for any space group forces you to permute the axes completely
  unnecessarily.
  
  -- Ian
  
   -Original Message-
   From: Sue Roberts [mailto:[EMAIL PROTECTED]
   Sent: 19 September 2007 16:38
   To: Ian Tickle
   Subject: Re: [ccp4bb] arp/warp in p22121
   
   Hi Ian
   
   But there's an older convention, which is to use the space groups
   settings defined in the International Tables - and  P22121 

Re: [ccp4bb] arp/warp in p22121: what to do in Pointless

2007-09-24 Thread Ian Tickle
Phil,

It also affects centred monoclinic: to avoid some cases of beta  120
you have to use the I121 setting instead of C121 (and it is a
conventional setting, see IT vol. A pp.126-7).  For example the
conventional (but non-standard) I121 setting: a=50 b=60 c=51 beta=90.2
would I think be re-indexed by pointless to the unconventional (but
standard) C121 setting: a=71.3 b=60 c=50 beta=134.3.

Trigonal, tetragonal  hexagonal settings with a or b unique are
definitely not only non-standard but also non-conventional: they violate
Table 9.3.4.1 in IT Vol. A and they don't appear in the diagrams, so
let's not go there!  The point about the IUCr convention is that the
cell is determined by the rules in Table 9.3.4.1 so you would never get
into that situation.  For the oP lattice the cell is determined once and
for all by the rules in exactly the same way: the systematic absences
are not taken into account because they are not always reliable, e.g. it
may be only after you've solved the TF that you can assign the SG
correctly, or at least beyond reasonable doubt, and re-indexing at that
point makes no sense.

To summarise my POV: I don't really care which convention you use, but I
think gratuitous re-indexing (i.e. without the user specifically asking
for it) is just very confusing, and this is what can happen if you stick
to the standard settings.  I'm all for educating users, and re-indexing
1 or 2 datasets can be very educational, but if you have lots of
datasets to deal with it's no longer amusing!  Sometimes re-indexing
*is* needed, e.g. because you already have isomorphous protein-ligand
structures and you want to be able to compare them easily, but this
should *only* be done to conform to a reference dataset.  I don't see
any point in re-indexing just to conform to the 'standard setting'
'non-convention' which is derived from a mis-reading of the 1952 ed of
IT (my guess is they didn't list all the conventional settings because
the publisher wanted to save money! - but at least they saw sense for
the 1983 ed.).

In any case this will only affect users of Arp/Warp, all other CCP4
programs (and most non-CCP4 programs, e.g. Shel-X, CNS etc) can handle
non-standard settings, and if you use Mosflm and always specify SG P222
for all oP cases (which I always do anyway), you would have to re-index
anyway.

-- Ian

 -Original Message-
 From: [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] On Behalf Of Phil Evans
 Sent: 24 September 2007 16:58
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] arp/warp in p22121: what to do in Pointless
 
 I think this is only a problem in the primitive orthorhombic system  
 (at least I assume people don't want hexagonal axes along a, A  B  
 centred lattices etc, although there is no reason in 
 principle why not).
 
 Following some earlier discussions with Ian, Pointless now honours  
 (and preserves) a reference file (HKLREF) in eg P 2 21 21, and also  
 explicit reindex operations, but an initial indexing will still  
 enforce the standard setting eg P 21 21 2, because I accept the  
 reference setting from the cctbx library
 
 ie suppose you have a crystal which when indexed with a = b 
 = c and  
 Pointless decides unambiguously for the sake of argument)  that the  
 axis along a is a 2-fold and the other two are 2(1) screws, ie space  
 group P 2 21 21.
 
 At present this will be reindexed to the standard setting P 21 21  
 2, but is that what you want, or should it be left as abc? Which  
 criterion takes precedence?
 
 Phil
 
 
 On 19 Sep 2007, at 17:54, Ian Tickle wrote:
 
  Hi Sue
 
  It's certainly true that the convention in the 1935 and 1952  
  editions of
  IT Volume 1 *appeared* to be the 'standard setting' 
 convention that  
  you
  describe because only the 'standard' settings were listed, 
 and this  
  was
  the way that many crystallographers interpreted it (actually only
  macromolecular crystallographers, the small molecule people stick  
  to the
  IUCr convention), so this is probably where you're coming from.   
  However
  the 1983 edition of Volume A clarified the situation and 
 made it clear
  that this was never the intention, so all the conventional 
 settings  
  are
  now shown on the SG diagram pages.  P22121  P21221 certainly are
  defined in IT Vol. A - look on the diagram page for SG no. 
 18  you'll
  see them.
 
  The 'standard symbol' for a space group is merely the heading on the
  page used only for indexing purposes, so space groups P22121,  
  P21221 and
  P21212 all have the same standard symbol P21212; hence the standard
  symbol is not unique and can't be used to unambiguously define the  
  space
  group.  The 'standard setting' is merely the space group 
 setting that
  has the same name as the standard symbol.  Even if that weren't  
  true do
  we really want to be still sticking to a convention that 
 was abandoned
  25 years ago and doesn't a later convention override an earlier one
  anyway?
 
  Actually the convention in 

Re: [ccp4bb] arp/warp in p22121: what to do in Pointless

2007-09-24 Thread Winter, G (Graeme)
Hi Phil,
 
Just in case anyone was used to the old default behaviour (and like to use the 
current version of Arp/wArp ;oD) could you ensure that there is an option which 
will allow the automatic reindexing to the old default setting? 
 
Thanks,
 
Graeme



From: CCP4 bulletin board on behalf of Phil Evans
Sent: Mon 24/09/2007 6:18 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] arp/warp in p22121: what to do in Pointless



OK I'll try to change the default behaviour, after I've fixed the 
current round of bugs!

Sometimes reindexing is required from the Mosflm indexing even 
without a reference set, if the Laue group assignment is wrong, or 
where there is pseudo lattice symmetry (eg we have a case which is 
really R3 but pseudo I-centred cubic, so Mosflm may choose the wrong 
3-fold in its indexing)

My POV was that on the first dataset, you don't care about the 
indexing, just about the space group, but this is undermined by the 
frequent uncertainty about systematic absences.

I think the best thing to do is to choose the reindexing for the Laue 
group (or point group), and not to switch it for different possible 
space groups within the Laue group. I think this should be able to 
pick up the I121 setting of C2 (though I might have to make that a 
special case: similarly George's P21/n v. P21/c)

Phil

On 24 Sep 2007, at 17:59, Ian Tickle wrote:


 Phil,

 It also affects centred monoclinic: to avoid some cases of beta  120
 you have to use the I121 setting instead of C121 (and it is a
 conventional setting, see IT vol. A pp.126-7).  For example the
 conventional (but non-standard) I121 setting: a=50 b=60 c=51 beta=90.2
 would I think be re-indexed by pointless to the unconventional (but
 standard) C121 setting: a=71.3 b=60 c=50 beta=134.3.

 Trigonal, tetragonal  hexagonal settings with a or b unique are
 definitely not only non-standard but also non-conventional: they 
 violate
 Table 9.3.4.1 in IT Vol. A and they don't appear in the diagrams, so
 let's not go there!  The point about the IUCr convention is that the
 cell is determined by the rules in Table 9.3.4.1 so you would never 
 get
 into that situation.  For the oP lattice the cell is determined 
 once and
 for all by the rules in exactly the same way: the systematic absences
 are not taken into account because they are not always reliable, 
 e.g. it
 may be only after you've solved the TF that you can assign the SG
 correctly, or at least beyond reasonable doubt, and re-indexing at 
 that
 point makes no sense.

 To summarise my POV: I don't really care which convention you use, 
 but I
 think gratuitous re-indexing (i.e. without the user specifically 
 asking
 for it) is just very confusing, and this is what can happen if you 
 stick
 to the standard settings.  I'm all for educating users, and re-
 indexing
 1 or 2 datasets can be very educational, but if you have lots of
 datasets to deal with it's no longer amusing!  Sometimes re-indexing
 *is* needed, e.g. because you already have isomorphous protein-ligand
 structures and you want to be able to compare them easily, but this
 should *only* be done to conform to a reference dataset.  I don't see
 any point in re-indexing just to conform to the 'standard setting'
 'non-convention' which is derived from a mis-reading of the 1952 ed of
 IT (my guess is they didn't list all the conventional settings because
 the publisher wanted to save money! - but at least they saw sense for
 the 1983 ed.).

 In any case this will only affect users of Arp/Warp, all other CCP4
 programs (and most non-CCP4 programs, e.g. Shel-X, CNS etc) can handle
 non-standard settings, and if you use Mosflm and always specify SG 
 P222
 for all oP cases (which I always do anyway), you would have to re-
 index
 anyway.

 -- Ian

 -Original Message-
 From: [EMAIL PROTECTED]
 [mailto:[EMAIL PROTECTED] On Behalf Of Phil Evans
 Sent: 24 September 2007 16:58
 To: CCP4BB@JISCMAIL.AC.UK
 Subject: Re: [ccp4bb] arp/warp in p22121: what to do in Pointless

 I think this is only a problem in the primitive orthorhombic system
 (at least I assume people don't want hexagonal axes along a, A  B
 centred lattices etc, although there is no reason in
 principle why not).

 Following some earlier discussions with Ian, Pointless now honours
 (and preserves) a reference file (HKLREF) in eg P 2 21 21, and also
 explicit reindex operations, but an initial indexing will still
 enforce the standard setting eg P 21 21 2, because I accept the
 reference setting from the cctbx library

 ie suppose you have a crystal which when indexed with a = b
 = c and
 Pointless decides unambiguously for the sake of argument)  that the
 axis along a is a 2-fold and the other two are 2(1) screws, ie space
 group P 2 21 21.

 At present this will be reindexed to the standard setting P 21 21
 2, but is that what you want, or should it be left as abc? Which
 criterion takes precedence?

 Phil


 On 19 Sep 2007, at 17:54, Ian