Re: [gmx-users] Trjconv PDB files define solvent as "ATOM"?

Mark Abraham Tue, 20 Mar 2012 00:00:44 -0700

On 20/03/2012 5:10 AM, John Ladasky wrote:

I am trying to import PDB file snapshots from a GROMACS4.5.4-generated trajectory into other software tools -- specifically,Biopython. I generate the snapshots using trjconv in GROMACS.
I am interested in the water molecules from my solvent box, so I donot discard them. When trjconv prompts me to "Select group foroutput", I select "Group 0 (System)". However, in downstreamapplications, I do want to differentiate the solvent atoms from myprotein polymer, and ensure that each group of atoms (protein atoms,solvent atoms) is placed in a distinct category.
Biopython's PDB file parser is not cooperating with me. It isattempting to append the water molecules as additional RESIDUES of mypolymer. Obviously, this is incorrect. So, where's the problem,Biopython or GROMACS? Looking through the PDB file specification,version 3.2, I found the following passage:
"The ATOM records present the atomic coordinates for standard aminoacids and nucleotides. They also present the occupancy and temperaturefactor for each atom. Non-polymer chemical coordinates use the HETATMrecord type."
If I am reading this correctly, my solvent atoms should be tagged as"HETATM" rather than as "ATOM". But the files that trjconv produceslabel every atom as "ATOM", whether it's an atom from the protein oran atom from a water molecule.
Is there any way to make trjconv use "HETATM" for solvent atoms? I donot see anything in the trjconv documentation. I also do notunderstand why trjconv might produce PDB files which do not adhere tothe standard. There may be a good reason, I don't know.

Strict adherence by software to the PDB format is something of anexception rather than the rule. Often you will see TER records and/orchain IDs used to differentiate different parts of the same system. Forthis kind of reason, most software that claims to read PDB should havesome way of making subset selections that are not dependent on thecontents of the PDB file. You should consult the Biopython documentationto see how it likes to interpret things, and how you can customize that.

trjconv cannot attempt to guess how all possible pieces of softwaremight like to interpret its results, and so it produces somethinggeneric and plausible. Depending how flexible Biopython is, you may needto use a shell script to post-process the trjconv output to do somethinglike Tsjerk suggested, or insert TER records, or change chain IDs. Doread how Biopython works, first.


Mark
--
gmx-users mailing list    gmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!

Please don't post (un)subscribe requests to the list. Use thewww interface or send it to gmx-users-requ...@gromacs.org.

Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] Trjconv PDB files define solvent as "ATOM"?

Reply via email to