Re: [ccp4bb] New Version of the Protein Geometry Database Now Available

Tom Oldfield Mon, 30 Jun 2014 07:36:22 -0700

Jacob Keller

There is a service that will hopefully do the calculation you want,search for a template ofresidues (eg. catalytic-triads). This is just a through spacesuperposition of atoms.

It is part of a tool set of fragments found by knowledge based analysis(pdbetemplate) ....


http://www.ebi.ac.uk/pdbe-as/pdbetemplate/

and there is a tool called DB search accessible from this page , ordirectly from here...


http://www.ebi.ac.uk/pdbe-as/pdbesearch/PDBeSearch.jsp

You can upload a small template of residues (up to 6) in PDB formatusing the

[upload and analyse] button.  The search is weighted based
on occupancy - where weights for each atom a defined by the occupancy value.

You can specify to just the CA atoms, the side chain (based onoccupancy), and

use fuzzy matching (slow) where ASP looks like GLU  etc.

Here is an example PDB file fragment you can use to try out the search - it
is a catalytic quartet of residues - a superset of the catalytic triad.

Regards
Tom Oldfield, PDBe.

HEADER HYDROLASE (SERINE PROTEINASE) 22-JAN-85 5CHA5CHA 3COMPND ALPHA CHYMOTRYPSIN A (E.C.3.4.21.1)5CHA 4SOURCE COW (BOS $TAURUS)5CHA 5

AUTHOR R.A.BLEVINS,A.TULINSKY 5CHA   6

SITE 1 CTA 3 HIS A 57 ASP A 102 SER A 1955CHA 160ATOM 403 N HIS A 57 28.981 28.453 20.681 1.00 10.495CHA 550ATOM 404 CA HIS A 57 28.691 28.510 22.151 1.00 10.515CHA 551ATOM 405 C HIS A 57 29.897 28.096 22.990 1.00 11.715CHA 552ATOM 406 O HIS A 57 29.926 28.441 24.252 1.00 11.275CHA 553ATOM 407 CB HIS A 57 27.441 27.724 22.518 1.00 10.235CHA 554ATOM 408 CG HIS A 57 27.711 26.268 22.692 1.00 11.485CHA 555ATOM 409 ND1 HIS A 57 27.449 25.299 21.720 1.00 11.565CHA 556ATOM 410 CD2 HIS A 57 28.206 25.605 23.753 1.00 10.625CHA 557ATOM 411 CE1 HIS A 57 27.787 24.143 22.229 1.00 12.295CHA 558ATOM 412 NE2 HIS A 57 28.219 24.290 23.459 1.00 12.005CHA 559ATOM 729 N ASN A 100 21.139 29.092 17.682 1.00 15.175CHA 876ATOM 730 CA ASN A 100 21.091 28.846 16.199 1.00 14.915CHA 877ATOM 731 C ASN A 100 22.361 29.384 15.589 1.00 14.155CHA 878ATOM 732 O ASN A 100 22.993 30.366 16.007 1.00 14.625CHA 879ATOM 733 CB ASN A 100 19.865 29.473 15.542 1.00 14.995CHA 880ATOM 734 CG ASN A 100 19.459 28.894 14.195 1.00 15.905CHA 881ATOM 735 OD1 ASN A 100 19.768 27.721 13.910 1.00 15.645CHA 882ATOM 736 ND2 ASN A 100 18.776 29.679 13.303 1.00 15.915CHA 883ATOM 745 N ASP A 102 25.217 27.828 15.245 1.00 8.905CHA 892ATOM 746 CA ASP A 102 26.263 27.454 16.230 1.00 8.035CHA 893ATOM 747 C ASP A 102 27.602 26.986 15.681 1.00 7.565CHA 894ATOM 748 O ASP A 102 28.030 25.875 15.937 1.00 6.975CHA 895ATOM 749 CB ASP A 102 25.612 26.547 17.282 1.00 7.615CHA 896ATOM 750 CG ASP A 102 26.373 26.259 18.576 1.00 7.615CHA 897ATOM 751 OD1 ASP A 102 26.215 25.289 19.348 1.00 6.705CHA 898ATOM 752 OD2 ASP A 102 27.250 27.077 18.886 1.00 7.665CHA 899ATOM 1404 N SER A 195 30.503 18.936 23.287 1.00 10.845CHA1551ATOM 1405 CA SER A 195 30.268 20.144 22.534 1.00 11.395CHA1552ATOM 1406 C SER A 195 31.398 20.500 21.546 1.00 11.495CHA1553ATOM 1407 O SER A 195 32.602 20.510 21.836 1.00 10.925CHA1554ATOM 1408 CB SER A 195 30.081 21.398 23.463 1.00 11.445CHA1555ATOM 1409 OG SER A 195 28.708 21.332 23.826 1.00 13.005CHA1556



On 27/06/14 21:49, Keller, Jacob wrote:

I have wanted for some time to search for catalytic-triad-like configurations 
by defining three Ca-Cb bonds from known catalytic triads, then searching the 
pdb for matches, but have not thought of a quick and/or easy way to do 
this--can your software do this sort of thing, or is there some other software 
which could be used for this?

JPK

-----Original Message-----
From: CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] On Behalf Of Dale 
Tronrud
Sent: Friday, June 27, 2014 4:27 PM
To: CCP4BB@JISCMAIL.AC.UK
Subject: [ccp4bb] New Version of the Protein Geometry Database Now Available

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

             Protein Geometry Database Server V 1.0
              http://pgd.science.oregonstate.edu/
Developed by Andy Karplus' laboratory at Oregon State University

    We are pleased to announce the availability of an enhanced version of the 
Protein Geometry Database (PGD) web service, originally announced in Berkholz 
et al (2010) Nucleic Acids Research 38, D320-5.
This server allows you to explore the many backbone and side chain conformations that 
exist in the PDB as well as the protein geometry (lengths and angles) that occur in those 
conformations. This service is ideal for finding instances of particular conformations or 
peculiar bond lengths or angles.  It is also quite adept at identifying sets of fragments 
that can then be examined for systematic variation in "ideal" geometry. The 
expanded PGD now includes all conformational and covalent geometry information not just 
for the backbone but also for the sidechains.

    There are three basic operations available: selecting a set of fragments 
via a delimited search, analyzing the geometry of those fragments, and dumping 
the results to your computer for more specialized analysis.

    To control bias in statistical analyses due to the variable number of 
entries with the same or similar sequence, the database contains only the 
highest quality model in each sequence cluster as identified by the Pisces 
server from Roland Dunbrack's lab.  Two settings, 90% and 25% sequence 
identity, are available.  Currently, at the 90% sequence identity level there 
are 16,000 chains and at the 25% level this drops to about 11,000 chains.

    You can filter a search based on the quality of the model as indicated by resolution 
and R values.  A search can also be filtered based on DSSP secondary structure, amino 
acid type, the phi/psi/omega angles and bond lengths, angles, and chi angles.  For 
example, you can find all cysteine residues in the center of three-residue peptide 
fragments (i.e. not at a peptide terminus), in beta sheet, with both peptide bonds trans, 
and CB-SG length greater than 1.85 A from models with resolution better than 1.5 A.  By 
the way, in the "no more than 25% sequence identity" category there are 25 of 
them.

    Once you have a set of results, you can create 2D plots showing the 
relationships of up to three features (i.e. bond lengths, bond angles, or 
conformational angles).  For instance, you can look at how a given feature 
varies with phi and psi using a phi(i)/psi(i) plot.  Or, you can just as easily 
look at the variation with psi(i)/phi(i+1), or even the relationships between 
any selected bond angles.  As one example, it is instructive to perform a 
default search and plot NCaCb vs NCaC colored based on CbCaC.  As this search 
is restricted to just the highest resolution models, you can see the 
justification for chiral volume restraints.

    Finally, all of your results can be downloaded for your own analysis.

    Development of the PGD continues.  If you have worked with the site and 
have any ideas and suggestions for how to improvement it, please drop us a line.

    The publication describing the PGD is:

Berkholz, D.S., Krenesky, P.B., Davidson, J.R.,  & Karplus, P.A.
(2010) Protein Geometry Database: A flexible engine to explore backbone 
conformations and their relationships with covalent geometry.
Nucleic Acids Res. 38, D320-5.

    Also, some examples of published analyses enabled by earlier versions of 
the PGD are listed here:.

Berkholz, D.S., Shapovalov, M.V., Dunbrack, R.L.J. & Karplus, P.A.
(2009). Conformation dependence of backbone geometry in proteins.
Structure 17, 1316-1325.

Hollingsworth, S.A., Berkholz, D.S. & Karplus, P.A. (2009). On the occurrence 
of linear groups in proteins. Protein Science 18, 1321-1325

Hollingsworth, S.A. & Karplus, P. A. (2010). Review: A fresh look at the 
Ramachandran plot and the occurrence of standard structures in proteins. 
BioMolecular Concepts 1, 271-283.

Berkholz, D.S., Driggers, C.M., Shapovalov, M.V., Dunbrack, R.L., Jr.
& Karplus P.A. (2012) Nonplanar peptide bonds in proteins are common and 
conserved but not biased toward active sites. Proc Natl Acad Sci U S A.  109, 
449-53.

Dale Tronrud & P. Andrew Karplus
Department of Biochemistry and Biophysics Oregon State University -----BEGIN 
PGP SIGNATURE-----
Version: GnuPG v2.0.22 (MingW32)
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iEYEARECAAYFAlOt044ACgkQU5C0gGfAG12aZACdG68Vyjw7JJimw0ofMZrJQLu8
B1IAn0Q5xx8ptRosgMXswdYmdjNKyFkA
=D63d
-----END PGP SIGNATURE-----


--
Tom Oldfield , PhD
Team Leader
Head of PDBe Databases and Services
Protein Databank in Europe
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Gemome Campus
Hinxton
Cambridge  CB10 1SD
United Kingdom
Tel : ++44 (0) 1223 492526

Re: [ccp4bb] New Version of the Protein Geometry Database Now Available

Reply via email to