Jacob Keller

There is a service that will hopefully do the calculation you want, search for a template of residues (eg. catalytic-triads). This is just a through space superposition of atoms.

It is part of a tool set of fragments found by knowledge based analysis (pdbetemplate) ....

http://www.ebi.ac.uk/pdbe-as/pdbetemplate/

and there is a tool called DB search accessible from this page , or directly from here...

http://www.ebi.ac.uk/pdbe-as/pdbesearch/PDBeSearch.jsp


You can upload a small template of residues (up to 6) in PDB format using the
[upload and analyse] button.  The search is weighted based
on occupancy - where weights for each atom a defined by the occupancy value.

You can specify to just the CA atoms, the side chain (based on occupancy), and
use fuzzy matching (slow) where ASP looks like GLU  etc.

Here is an example PDB file fragment you can use to try out the search - it
is a catalytic quartet of residues - a superset of the catalytic triad.

Regards
Tom Oldfield, PDBe.


HEADER HYDROLASE (SERINE PROTEINASE) 22-JAN-85 5CHA 5CHA 3 COMPND ALPHA CHYMOTRYPSIN A (E.C.3.4.21.1) 5CHA 4 SOURCE COW (BOS $TAURUS) 5CHA 5
AUTHOR R.A.BLEVINS,A.TULINSKY 5CHA   6
SITE 1 CTA 3 HIS A 57 ASP A 102 SER A 195 5CHA 160 ATOM 403 N HIS A 57 28.981 28.453 20.681 1.00 10.49 5CHA 550 ATOM 404 CA HIS A 57 28.691 28.510 22.151 1.00 10.51 5CHA 551 ATOM 405 C HIS A 57 29.897 28.096 22.990 1.00 11.71 5CHA 552 ATOM 406 O HIS A 57 29.926 28.441 24.252 1.00 11.27 5CHA 553 ATOM 407 CB HIS A 57 27.441 27.724 22.518 1.00 10.23 5CHA 554 ATOM 408 CG HIS A 57 27.711 26.268 22.692 1.00 11.48 5CHA 555 ATOM 409 ND1 HIS A 57 27.449 25.299 21.720 1.00 11.56 5CHA 556 ATOM 410 CD2 HIS A 57 28.206 25.605 23.753 1.00 10.62 5CHA 557 ATOM 411 CE1 HIS A 57 27.787 24.143 22.229 1.00 12.29 5CHA 558 ATOM 412 NE2 HIS A 57 28.219 24.290 23.459 1.00 12.00 5CHA 559 ATOM 729 N ASN A 100 21.139 29.092 17.682 1.00 15.17 5CHA 876 ATOM 730 CA ASN A 100 21.091 28.846 16.199 1.00 14.91 5CHA 877 ATOM 731 C ASN A 100 22.361 29.384 15.589 1.00 14.15 5CHA 878 ATOM 732 O ASN A 100 22.993 30.366 16.007 1.00 14.62 5CHA 879 ATOM 733 CB ASN A 100 19.865 29.473 15.542 1.00 14.99 5CHA 880 ATOM 734 CG ASN A 100 19.459 28.894 14.195 1.00 15.90 5CHA 881 ATOM 735 OD1 ASN A 100 19.768 27.721 13.910 1.00 15.64 5CHA 882 ATOM 736 ND2 ASN A 100 18.776 29.679 13.303 1.00 15.91 5CHA 883 ATOM 745 N ASP A 102 25.217 27.828 15.245 1.00 8.90 5CHA 892 ATOM 746 CA ASP A 102 26.263 27.454 16.230 1.00 8.03 5CHA 893 ATOM 747 C ASP A 102 27.602 26.986 15.681 1.00 7.56 5CHA 894 ATOM 748 O ASP A 102 28.030 25.875 15.937 1.00 6.97 5CHA 895 ATOM 749 CB ASP A 102 25.612 26.547 17.282 1.00 7.61 5CHA 896 ATOM 750 CG ASP A 102 26.373 26.259 18.576 1.00 7.61 5CHA 897 ATOM 751 OD1 ASP A 102 26.215 25.289 19.348 1.00 6.70 5CHA 898 ATOM 752 OD2 ASP A 102 27.250 27.077 18.886 1.00 7.66 5CHA 899 ATOM 1404 N SER A 195 30.503 18.936 23.287 1.00 10.84 5CHA1551 ATOM 1405 CA SER A 195 30.268 20.144 22.534 1.00 11.39 5CHA1552 ATOM 1406 C SER A 195 31.398 20.500 21.546 1.00 11.49 5CHA1553 ATOM 1407 O SER A 195 32.602 20.510 21.836 1.00 10.92 5CHA1554 ATOM 1408 CB SER A 195 30.081 21.398 23.463 1.00 11.44 5CHA1555 ATOM 1409 OG SER A 195 28.708 21.332 23.826 1.00 13.00 5CHA1556


On 27/06/14 21:49, Keller, Jacob wrote:
I have wanted for some time to search for catalytic-triad-like configurations 
by defining three Ca-Cb bonds from known catalytic triads, then searching the 
pdb for matches, but have not thought of a quick and/or easy way to do 
this--can your software do this sort of thing, or is there some other software 
which could be used for this?

JPK

-----Original Message-----
From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Dale 
Tronrud
Sent: Friday, June 27, 2014 4:27 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: [ccp4bb] New Version of the Protein Geometry Database Now Available

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

             Protein Geometry Database Server V 1.0
              http://pgd.science.oregonstate.edu/
Developed by Andy Karplus' laboratory at Oregon State University

    We are pleased to announce the availability of an enhanced version of the 
Protein Geometry Database (PGD) web service, originally announced in Berkholz 
et al (2010) Nucleic Acids Research 38, D320-5.
This server allows you to explore the many backbone and side chain conformations that 
exist in the PDB as well as the protein geometry (lengths and angles) that occur in those 
conformations. This service is ideal for finding instances of particular conformations or 
peculiar bond lengths or angles.  It is also quite adept at identifying sets of fragments 
that can then be examined for systematic variation in "ideal" geometry. The 
expanded PGD now includes all conformational and covalent geometry information not just 
for the backbone but also for the sidechains.

    There are three basic operations available: selecting a set of fragments 
via a delimited search, analyzing the geometry of those fragments, and dumping 
the results to your computer for more specialized analysis.

    To control bias in statistical analyses due to the variable number of 
entries with the same or similar sequence, the database contains only the 
highest quality model in each sequence cluster as identified by the Pisces 
server from Roland Dunbrack's lab.  Two settings, 90% and 25% sequence 
identity, are available.  Currently, at the 90% sequence identity level there 
are 16,000 chains and at the 25% level this drops to about 11,000 chains.

    You can filter a search based on the quality of the model as indicated by resolution 
and R values.  A search can also be filtered based on DSSP secondary structure, amino 
acid type, the phi/psi/omega angles and bond lengths, angles, and chi angles.  For 
example, you can find all cysteine residues in the center of three-residue peptide 
fragments (i.e. not at a peptide terminus), in beta sheet, with both peptide bonds trans, 
and CB-SG length greater than 1.85 A from models with resolution better than 1.5 A.  By 
the way, in the "no more than 25% sequence identity" category there are 25 of 
them.

    Once you have a set of results, you can create 2D plots showing the 
relationships of up to three features (i.e. bond lengths, bond angles, or 
conformational angles).  For instance, you can look at how a given feature 
varies with phi and psi using a phi(i)/psi(i) plot.  Or, you can just as easily 
look at the variation with psi(i)/phi(i+1), or even the relationships between 
any selected bond angles.  As one example, it is instructive to perform a 
default search and plot NCaCb vs NCaC colored based on CbCaC.  As this search 
is restricted to just the highest resolution models, you can see the 
justification for chiral volume restraints.

    Finally, all of your results can be downloaded for your own analysis.

    Development of the PGD continues.  If you have worked with the site and 
have any ideas and suggestions for how to improvement it, please drop us a line.

    The publication describing the PGD is:

Berkholz, D.S., Krenesky, P.B., Davidson, J.R.,  & Karplus, P.A.
(2010) Protein Geometry Database: A flexible engine to explore backbone 
conformations and their relationships with covalent geometry.
Nucleic Acids Res. 38, D320-5.

    Also, some examples of published analyses enabled by earlier versions of 
the PGD are listed here:.

Berkholz, D.S., Shapovalov, M.V., Dunbrack, R.L.J. & Karplus, P.A.
(2009). Conformation dependence of backbone geometry in proteins.
Structure 17, 1316-1325.

Hollingsworth, S.A., Berkholz, D.S. & Karplus, P.A. (2009). On the occurrence 
of linear groups in proteins. Protein Science 18, 1321-1325

Hollingsworth, S.A. & Karplus, P. A. (2010). Review: A fresh look at the 
Ramachandran plot and the occurrence of standard structures in proteins. 
BioMolecular Concepts 1, 271-283.

Berkholz, D.S., Driggers, C.M., Shapovalov, M.V., Dunbrack, R.L., Jr.
& Karplus P.A. (2012) Nonplanar peptide bonds in proteins are common and 
conserved but not biased toward active sites. Proc Natl Acad Sci U S A.  109, 
449-53.

Dale Tronrud & P. Andrew Karplus
Department of Biochemistry and Biophysics Oregon State University -----BEGIN 
PGP SIGNATURE-----
Version: GnuPG v2.0.22 (MingW32)
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iEYEARECAAYFAlOt044ACgkQU5C0gGfAG12aZACdG68Vyjw7JJimw0ofMZrJQLu8
B1IAn0Q5xx8ptRosgMXswdYmdjNKyFkA
=D63d
-----END PGP SIGNATURE-----

--
Tom Oldfield , PhD
Team Leader
Head of PDBe Databases and Services
Protein Databank in Europe
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Gemome Campus
Hinxton
Cambridge  CB10 1SD
United Kingdom
Tel : ++44 (0) 1223 492526

Reply via email to