Phil Evans wrote:
> ***  For details on how to be removed from this list visit the  ***
> ***          CCP4 home page http://www.ccp4.ac.uk         ***
> 
> 
> Since this seems to be causing endless confusion, here is the
> definition used in Scala for <I/sigmaI> which is what I report to the
> PDB. This is the table column labelled Mn(I)/sd and printed in the
> summary at the end (in the latest version only)
> 
> 
> After scaling, for each unique reflection h we have several
> observations of the intensity Ih and an estimated "corrected" error
> estimate sd(I)
> 
>>From these we calculate a weighted mean <Ih> and an error estimate of
> the mean sd(<Ih>), and a ratio for that unique reflection
> <Ih>/sd(<Ih>)
> 
> What is printed as Mn(I)/sd is the mean value of that ratio for all
> reflections (possibly in a resolution bin) ie
> 
> <<Ih>/sd(<Ih>)>
> 
> This is an estimate of the average signal to noise. Its value does as
> has been frequently pointed out depend on the sd estimates being
> correct, which is always a doubtful proposition, but that's another
> story . . .
> 
> 
> 
> Note the the source code of Scala is in the CCP4 distribution and
> anyone may look in it (look in subroutines ad5sts and  prdres)
> 
> Best wishes
> Phil Evans
> 
> 
> 
> Edward A. Berry writes:
>   >
>   >
>   > Edwin Pozharski wrote:
>   > >
>   > > I just want to point out that what is requested upon uploading data to 
> the
>   > > Protein DataBank is
>   > >
>   > > NET I OVER AVERAGE SIGMA I
>   > >
>   > > To me it sounds pretty much like <I>/<sigmaI>.
>   > >
>   > Yes, ADIT asks for:
>   >     Net I over average sigma(I)    19
>   > And for the last resolution shell there is a different definition:
>   >     Mean I over sigma(I) (observed) 2.71
>   >
>   > But when the PDB file is produced, both values are presented as:
>   > REMARK 200  <I/SIGMA(I)> FOR THE DATA SET  : 19.0000
>   >
>   > ...|.
>   > REMARK 200  <I/SIGMA(I)> FOR SHELL         : 2.710
>   >
>   >
>   > As to whether I/Sigma refers to unique reflections or measurements,
>   > there is also the question, before or after adding partials?
>   > And the cutoff criterion by which we decide which measurements are
>   > classified as observations, which I understand should be -3 for
>   > scalepack users: is that a cutoff on the raw (partial) measurement
>   > or on the full measurement generated by summing partials? I had the
>   > impression that the default -3 sigma cutoff in scalepack was for the
>   > raw measurements, although reading the current manual I can't find
>   > anything to justify that.
>   >
>   > As users of proprietary (closed-source) software, we depend on the
>   > authors for definition of the output values.
>   >
>   > As a biologist I thought I could just collect data, run the programs,
>   > and deposit my structure. Now I'm getting all confused!
>   >
>   > Ed
>   >
>   > > Ed.
>   > >
>   > >
>   > >>Anthony Duff wrote:
>   > >>
>   > >>
>   > >>>
>   > >>>When I asked about "I on sig I" as to whether one should report: (1)
>   > >>><I/SIGI>; (2) <I>/<SIGI>; or (3) I/SIGI, the responses were that it is
>   > >>><I/SIGI> that should be reported, although it seems that "I/SIGI"
>   > >>>cannot be reasonably interpreted as anything other than <I/SIGI>.
>   > >>>
>   > >>>Bart Hazes put it most clearly (noting that Jim Pflugrath is uncertain
>   > >>>that reporting I/SIGI has much merit all):
>   > >>>
>   > >>>
>   > >>>>- Well because of the first commandment. "Thou shalt report <I/SigI>"
>   > >>>>
>   > >>>>- Without it a Table 1 wouldn't be a Table 1 would it?
>   > >>>
>   > >>>
>   > >>>As Fred. Vellieux said
>   > >>>
>   > >>>
>   > >>>>You can compute a column containing I/SIGI using SFTOOLS,
>   > >>>>then compute its average value.
>   > >>>
>   > >>>in detail...
>   > >>>
>   > >>>    sftools read mymtzfile.mtz complete        # calculate
>   > >>>    completeness in 20 bins calc col IoSI = col IMEAN col SIGIMEAN /
>   > >>>           # polish mathematics.         # creates IoSI =
>   > >>>    IMEAN/SIGIMEAN plot col IoSI versus resol      # gives average
>   > >>>    (IoSI) in 20 bins checkhkl        # read average (IoSI)
>   > >>>
>   > >>>
>   > >>>
>   > >>>Having said all that, I note that colleagues using scalepack are
>   > >>>liable to report:
>   > >>><I>/<SIGI>, calculating it themselves using "average I" and and
>   > >>>"average error" from the last table of the scalepack log file,
>   > >>>or
>   > >>><I/SIGI>, but not for unique reflections, but for all reflections,
>   > >>>taken from the last line of the table "Summary of reflection
>   > >>>intensities and R-factors by batch number".  A quick investigation has
>   > >>>revealed that I/SIGI for all reflections can be very much greater than
>   > >>>I/SIGI for unique reflections.
>   > >>>
>   > >>>It seems to me that it is impossible to obtain the correct <I/SIGI>
>   > >>>from a scalepack log file.
>   > >>>Is this correct?
>   > >>>
>   > >>>I'm wondering if there is any consistency in the value to be found in
>   > >>>the headers of pdb files following "REMARK 200  <I/SIGMA(I)> FOR THE
>   > >>>DATA SET"
>   > >>>
>   > >>>Anthony
>   > >>>
>   >
> 

Reply via email to