Jenny,

I of course would suggest that you follow Olve's advice, and use theseus to do a maximum likelihood, simultaneous superposition of all your structures ( http://www.theseus3d.org ). The variable bits, like your loop, will be naturally down-weighted in a rigorous statistical manner. Then you can look at the average structure file that is output (_ave.pdb at the end of the filename), and the B- factor column has the overall RMSD for each atom in there. You can look at the full superposition (the _sup.pdb file) in rasmol or in pymol with the 'set all_states, on' command.

However, if you really need to do the very analysis that you asked about, the following bash script will do exactly that with theseus (you need both awk and theseus in your executable path). It prints out the average RMSD for the atoms you specify in the loop, after pairwise least-squares superpositioning on all atoms other than the loop, for all possible pairwise combinations of your pdb files. (Note that in this script all backslashes '\' must have a carriage return immediately after them.) You will need to change the lower and upper values at the top of the script (inclusive for the loop you want excluded). You invoke the script something like "karen.sh pdb1.pdb pdb2.pdb pdb3.pdb" or "karen.sh *.pdb" to do all the .pdbs in one directory. If you have any problems or have other specific superpositioning issues I'm glad to help out.

Cheers,

Douglas


karen.sh
#################################

#!/bin/bash

# everything including and between lower and upper
# is excluded from the superposition
lower=40;
upper=60;

pdbs=($*);

for (( i = 0; i < [EMAIL PROTECTED]; ++i ))
do
  for (( j = 0; j < i; ++j ))
  do
    name="${pdbs[i]%.*}_${pdbs[j]%.*}";
    theseus -l -r ${name} -S ${lower}-${upper} ${pdbs[i]} ${pdbs[j]}\
            > ${name}.log;
    rmsd=$(cut -c 7-11,61-67 ${name}_ave.pdb |\
awk '{if ($1 > lo && $1 < up) {sum += $2; n++}}; END {print sum/ n}'\
    lo=${lower} up=${upper});
    echo "${name} rmsd = ${rmsd}";
  done
done


####################################


^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`
Douglas L. Theobald
Department of Biochemistry
Brandeis University
Waltham, MA  02454-9110

[EMAIL PROTECTED]

             ^\
   /`  /^.  / /\
  / / /`/  / . /`
 / /  '   '
'


On May 11, 2007, at 1:58 PM, Olve Peersen wrote:

I would highly recommend Doug Theobald's program Theseus for this - the pictures at www.theseus3d.org say it all. Theseus does maximum likely hood superimpositions of multiple structures (i.e. NOT pairwise against a "master" copy), and the real beauty of it is that you don't have to pick which residues you want to superimpose. Places where the whole set of structures show divergence are effectively down-weighted and don't contribute much to the final solution vs. least squares where every atom position has equal weight and the "bad" parts screw up the alignment of the "good" parts. For this, I would do a Theseus superposition of all the structures and then analyze the set of superimposed structures by whatever method you want (e.g. rmsd of variances in important sections of the structures).

- Olve

-------------------------------------------------------
Olve Peersen
Associate Professor
Dept. of Biochemistry & Molecular Biology
1870 Campus Delivery
Colorado State University
Ft. Collins, CO  80523-1870
-------------------------------------------------------
970.491-0433    Office  (MRB 279)
970.491-0271    Lab     (MRB 149)
970.491-0494    Fax
[EMAIL PROTECTED]
-------------------------------------------------------

On May 11, 2007, at 11:15 AM, Donnie Berkholz wrote:

Eleanor Dodson wrote:
It is a bit clunky - you can use siperpose molecules - fit residues to fit a selected range (1-40; 60-100 say) and write out a complete fitted
pdb file. Then you could use a VERY old program
compar  xyzin1 original.pdb xyzin2 fitted.pdb  (xyzin3 another.pdb)
and it will match all pairs with the same RESIDUE ID and give the RMSD
distance

There is documentation for it.

There's a nice (non-CCP4) program called ProFit that does a pretty nice
job of superimposing with a lot of flexibility.

Thanks,
Donnie

On May 10, 2007, at 6:45 PM, Jenny wrote:
Hi, All,

I have a question about rmsd calculation.

I have some pdbs (100 residues ) and these pdbs differ pretty much only the loop region 40-60. Is there any easy way that I can superimpose the fixed region ( 1-40,60-100) and then calculate the rmsd for the loop?I need to calculate for each pair, so if there is any script or program available to do this quickly, that would be great.

Thanks.

Jenny

Reply via email to