Dear all, I had this experience: going pedantically to the individual points the RSRZ and other validation statistics in the form were reporting - in a vast majority of the cases nothing was wrong at all. So it seems to be somewhat overdoing its job - not that this is bad on its own - but we are losing contrast quite a bit between the really serious issues on ... all the other.
Jan Dohnalek On Wed, Nov 30, 2016 at 2:33 AM, dusan turk <dusan.t...@ijs.si> wrote: > Guys, > > I have a two issues to add here: > > 1. RSZS validation does not tolerate chain IDs longer than 1 character, so > it kills one of the very essential reasons why mmCIF format was introduced > (to enable deposition of large structures in a single file). > > 2. I have noticed in validation report of my own structure (4PIA) that the > RSZS does not ALWAYS work right. For example, the "PHE 63" is well resolved > with a hole in the ring, yet the validation declares it as a density > outlier. Besides there are several other residues in this structure that > "sit" well in the density, but are considered outliers, whereas several, > for which side chains the density is missing, are not listed. > > Has anyone else had a similar experience? > > Taken all remarks together they suggest that something needs to be done > with RSZS software or density validation procedure to resolve these issues. > > ???? > > best, > dusan > > > On 30/11/16 01:00, CCP4BB automatic digest system wrote: > >> Date: Mon, 28 Nov 2016 20:35:44 -0800 >> From: Pavel Afonine<pafon...@gmail.com> >> Subject: Re: Calculation of RSRZ Score in PDB Validation Reports >> >> I find Lothar's comments regarding H and RSRZ excellent! I would think of >> it as a pretty much bug report. I hope developers at that end listen. This >> goes very well in line with Phoebe's comment earlier today. >> >> Pavel >> >> On Mon, Nov 28, 2016 at 2:51 PM, Dale Tronrud<de...@daletronrud.com> >> wrote: >> >> On 11/28/2016 12:52 PM,esse...@helix.nih.gov wrote: >>> >>>> I found that one can get RSRZ to go way down by loosening the geometry >>>>> restraints. The result is a crappy structure and I don't recommend >>>>> >>>> doing >>> >>>> that, but it does get all the atoms crammed into some sort of density. >>>>> >>>> Your observation is quite interesting. I can add this: when we were >>>> >>> working >>> >>>> with low to medium resolution structures, deleting the hydrogen atoms >>>> >>> from >>> >>>> the model after refinement moved the very bad RSRZ statistic to about >>>> the >>>> average in the given resolution range! Note, no re-refinement was done >>>> >>> just >>> >>>> a simple deletion of the riding H-atoms. I find this to be odd given the >>>> fact that, say the phenix developers favor the inclusion of H-atoms on >>>> riding positions even in cases of low resolution structures. (I assume >>>> >>> the >>> >>>> refmac5 and BUSTER-TNT developers have also a favorable opinion about >>>> including H-atoms in the final model - and during refinement). >>>> >>>> In my mind, it may be tempting to delete H-atoms to improve this >>>> >>> statistic but >>> >>>> when you use them in refinement they should be included regardless of >>>> the >>>> outcome of the RSRZ analysis. >>>> >>> Of course, if you trick a validation statistic like this you haven't >>> accomplished anything. All you are saying is that one should rank RSRZ >>> scores with and without hydrogen atoms separately. Perhaps you should >>> suggest that to the PDB validation people. >>> >>> Dale Tronrud >>> >>>> RSRZ, in my most humble of opinions, seems like one of those statistics >>>>> >>>> that >>> >>>> is far more useful in theory than reality. Particularly for >>>>> medium-resolution structures, the fit of each entire side chain to the >>>>> >>>> density >>> >>>> is likely to be imperfect because the density is imperfect, especially >>>>> >>>> toward >>> >>>> the tips of those side chains. >>>>> >>>>> Then again, it can be a good flag for bits of the structure worth a >>>>> >>>> second >>> >>>> look in rebuilding. >>>>> >>>> The latter is certainly true. It may mean that the developers of RSRZ >>>> analysis need to tune it a bit to make it fully useful. >>>> >>>> L. >>>> >>>> ________________________________ >>>>> From: CCP4 bulletin board [CCP4BB@JISCMAIL.AC.UK] on behalf of Matthew >>>>> Bratkowski [mab...@cornell.edu] >>>>> Sent: Tuesday, November 22, 2016 10:12 AM >>>>> To:CCP4BB@JISCMAIL.AC.UK >>>>> Subject: [ccp4bb] Calculation of RSRZ Score in PDB Validation Reports >>>>> >>>>> Hello all, >>>>> >>>>> I was wondering if anyone knew how the RSRZ score was calculated in the >>>>> protein data bank validation reports and how useful of a metric this >>>>> >>>> actually >>> >>>> is for structure validation? I am trying to improve this score on a >>>>> >>>> structure >>> >>>> that I am working on, but I'm not really sure where to begin. From my >>>>> understanding, the score is based on the number of RSRZ outliers with a >>>>> >>>> score >>> >>>> 2. In my case, I have several residues with scores between 2 and 4, >>>>>> >>>>> but at >>> >>>> least by eye, fit to the electron density does not look that bad. >>>>> >>>> Hence, I >>> >>>> can't justify deleting them to try to improve the score. If the score >>>>> >>>> is just >>> >>>> based on percent of outlier residues, then for instance wouldn't a >>>>> >>>> structure >>> >>>> with say 20 residues modeled with no corresponding electron density >>>>> >>>> have the >>> >>>> same score as a structure with 20 residues with RSRZ values of say 2.5? >>>>> >>>>> >>>>> I was also wondering how the resolution of the structure relates to the >>>>> >>>> score? >>> >>>> Glancing through several pdb validation reports, I noticed some >>>>> >>>> structure >>> >>>> with low resolution (3.5 A or lower) with relatively high scores, while >>>>> >>>> others >>> >>>> with high resolution (2 A or higher) getting low scores. It is >>>>> >>>> reasonable to >>> >>>> assume that a structure of lower than 3.5 A would be missing several >>>>> >>>> side >>> >>>> chains and may also have some ambiguous main chain electron density, >>>>> >>>> which >>> >>>> should in theory increase the RSRZ score. While of course every >>>>> >>>> structure is >>> >>>> different and the quality of it due to the rigor of the person building >>>>> >>>> the >>> >>>> model, I was wondering if there were any general trends related to >>>>> >>>> resolution >>> >>>> and RSRZ score. >>>>> >>>>> Thanks, >>>>> Matt >>>>> >>>>> ------------------------------ >> >> Date: Mon, 28 Nov 2016 21:46:24 -0800 >> From: Ethan Merritt<merr...@u.washington.edu> >> Subject: Re: Calculation of RSRZ Score in PDB Validation Reports >> >> On Monday, 28 November 2016 08:35:44 PM Pavel Afonine wrote: >> >>> I find Lothar's comments regarding H and RSRZ excellent! I would think of >>> it as a pretty much bug report. I hope developers at that end listen. >>> This >>> goes very well in line with Phoebe's comment earlier today. >>> >> I guess I'm a bit surprised that adding or subtracting hydrogens from the >> model >> without re-refining or at least re-calculating Fc would affect RSRZ at >> all. >> I had thought that RSRZ was obtained by comparing density in an Fc map >> (or probably mFo-DFc) with the corresponding density in an Fo map. >> I thought that the coordinates were used only to determine the per-residue >> region of the map to be compared. >> >> Going back to the 2004 Kleywegt paper that the PDB cites for calculation >> of >> RSRZ I see that it's a bit ambiguous exactly what maps are being compared. >> So maybe I'm wrong and the current coordinates are used directly to get >> local "Fc density" by expanding 3D Gaussians without reference to a >> previously >> calculated map from refined phases. >> >> Can anyone clarify exactly what maps are being compared during wwPDB >> validation? >> >> Ethan >> >> > -- > Dr. Dusan Turk, Prof. > Head of Structural Biology Group http://bio.ijs.si/sbl/ > Head of Centre for Protein and Structure Production > Centre of excellence for Integrated Approaches in Chemistry and Biology of > Proteins, Scientific Director > http://www.cipkebip.org/ > Professor of Structural Biology at IPS "Jozef Stefan" > e-mail: dusan.t...@ijs.si > phone: +386 1 477 3857 Dept. of Biochem.& Mol.& Struct. Biol. > fax: +386 1 477 3984 Jozef Stefan Institute > Jamova 39, 1 000 Ljubljana,Slovenia > Skype: dusan.turk (voice over internet: www.skype.com > -- Jan Dohnalek, Ph.D Institute of Biotechnology Academy of Sciences of the Czech Republic Biocev Prumyslova 595 252 50 Vestec near Prague Czech Republic Tel. +420 325 873 758