Jenny,
I of course would suggest that you follow Olve's advice, and use
theseus to do a maximum likelihood, simultaneous superposition of all
your structures ( http://www.theseus3d.org ). The variable bits,
like your loop, will be naturally down-weighted in a rigorous
statistical manner. Then you can look at the average structure file
that is output (_ave.pdb at the end of the filename), and the B-
factor column has the overall RMSD for each atom in there. You can
look at the full superposition (the _sup.pdb file) in rasmol or in
pymol with the 'set all_states, on' command.
However, if you really need to do the very analysis that you asked
about, the following bash script will do exactly that with theseus
(you need both awk and theseus in your executable path). It prints
out the average RMSD for the atoms you specify in the loop, after
pairwise least-squares superpositioning on all atoms other than the
loop, for all possible pairwise combinations of your pdb files. (Note
that in this script all backslashes '\' must have a carriage return
immediately after them.) You will need to change the lower and upper
values at the top of the script (inclusive for the loop you want
excluded). You invoke the script something like "karen.sh pdb1.pdb
pdb2.pdb pdb3.pdb" or "karen.sh *.pdb" to do all the .pdbs in one
directory. If you have any problems or have other specific
superpositioning issues I'm glad to help out.
Cheers,
Douglas
karen.sh
#################################
#!/bin/bash
# everything including and between lower and upper
# is excluded from the superposition
lower=40;
upper=60;
pdbs=($*);
for (( i = 0; i < [EMAIL PROTECTED]; ++i ))
do
for (( j = 0; j < i; ++j ))
do
name="${pdbs[i]%.*}_${pdbs[j]%.*}";
theseus -l -r ${name} -S ${lower}-${upper} ${pdbs[i]} ${pdbs[j]}\
> ${name}.log;
rmsd=$(cut -c 7-11,61-67 ${name}_ave.pdb |\
awk '{if ($1 > lo && $1 < up) {sum += $2; n++}}; END {print sum/
n}'\
lo=${lower} up=${upper});
echo "${name} rmsd = ${rmsd}";
done
done
####################################
^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`^`
Douglas L. Theobald
Department of Biochemistry
Brandeis University
Waltham, MA 02454-9110
[EMAIL PROTECTED]
^\
/` /^. / /\
/ / /`/ / . /`
/ / ' '
'
On May 11, 2007, at 1:58 PM, Olve Peersen wrote:
I would highly recommend Doug Theobald's program Theseus for this -
the pictures at www.theseus3d.org say it all. Theseus does maximum
likely hood superimpositions of multiple structures (i.e. NOT
pairwise against a "master" copy), and the real beauty of it is
that you don't have to pick which residues you want to
superimpose. Places where the whole set of structures show
divergence are effectively down-weighted and don't contribute much
to the final solution vs. least squares where every atom position
has equal weight and the "bad" parts screw up the alignment of the
"good" parts. For this, I would do a Theseus superposition of all
the structures and then analyze the set of superimposed structures
by whatever method you want (e.g. rmsd of variances in important
sections of the structures).
- Olve
-------------------------------------------------------
Olve Peersen
Associate Professor
Dept. of Biochemistry & Molecular Biology
1870 Campus Delivery
Colorado State University
Ft. Collins, CO 80523-1870
-------------------------------------------------------
970.491-0433 Office (MRB 279)
970.491-0271 Lab (MRB 149)
970.491-0494 Fax
[EMAIL PROTECTED]
-------------------------------------------------------
On May 11, 2007, at 11:15 AM, Donnie Berkholz wrote:
Eleanor Dodson wrote:
It is a bit clunky - you can use siperpose molecules - fit
residues to
fit a selected range (1-40; 60-100 say) and write out a complete
fitted
pdb file. Then you could use a VERY old program
compar xyzin1 original.pdb xyzin2 fitted.pdb (xyzin3 another.pdb)
and it will match all pairs with the same RESIDUE ID and give the
RMSD
distance
There is documentation for it.
There's a nice (non-CCP4) program called ProFit that does a pretty
nice
job of superimposing with a lot of flexibility.
Thanks,
Donnie
On May 10, 2007, at 6:45 PM, Jenny wrote:
Hi, All,
I have a question about rmsd calculation.
I have some pdbs (100 residues ) and these pdbs differ pretty
much only the loop region 40-60. Is there any easy way that I can
superimpose the fixed region ( 1-40,60-100) and then calculate
the rmsd for the loop?I need to calculate for each pair, so if
there is any script or program available to do this quickly, that
would be great.
Thanks.
Jenny