Re: [Rdkit-discuss] Getting to grips with Open3DAlign

Paolo Tosco Tue, 21 Jun 2016 13:58:23 -0700

Dear Tim,

the Align() method returns an RMSD value, which however is computed onlyon a limited number of atom pairs, namely those that the algorithm wasable to match between the two molecules, so a low value is notparticularly informative of the overall goodness of the alignment, as itonly indicates that the matched atoms were matched nicely, but theremight only be a few of those in the core, while side chains arescattered all over.The Score() method instead returns the O3AScore for the alignment, whichis a better way to assess the quality of the superimposition.

The other problem in your script is that the i index is incrementedbefore recording it in the lowest/highest variables, so the confIds areshifted by 1, as the conformation index in the RDKit is 0-based.

I also noticed that without minimizing the conformations the aromaticrings look quite distorted, so I added a MMFF minimization, and Iincreased the number of generated conformations and thepruneRmsThreshold. Setting to False the experimental torsion anglepreferences and basic knowledge about rings seems to yield a largervariety of geometries which helps reproducing this quite peculiar x-raygeometry which is probably not so commonly found. Please find themodified script below.


Hope this helps, kind regards
Paolo


#!/usr/bin/env python


from rdkit import Chem, RDConfig
from rdkit.Chem import AllChem, rdMolAlign

ref =Chem.MolFromSmiles('NC(=[NH2+])c1ccc(C[C@@H](NC(=O)CNS(=O)(=O)c2ccc3ccccc3c2)C(=O)N2CCCCC2)cc1')mol1 =Chem.MolFromPDBFile(RDConfig.RDBaseDir+'/rdkit/Chem/test_data/1DWD_ligand.pdb')

mol1 = AllChem.AssignBondOrdersFromTemplate(ref, mol1)

mol2 =Chem.MolFromPDBFile(RDConfig.RDBaseDir+'/rdkit/Chem/test_data/1PPC_ligand.pdb')

mol2 = AllChem.AssignBondOrdersFromTemplate(ref, mol2)

pyO3A = rdMolAlign.GetO3A(mol1, mol2)
rmsd = pyO3A.Align()
score = pyO3A.Score()
print "Orig",score
Chem.MolToMolFile(mol1, "orig.mol")

cids = AllChem.EmbedMultipleConfs(mol1, numConfs=250, maxAttempts=100,
    pruneRmsThresh=0.5, useExpTorsionAnglePrefs=False,
    useBasicKnowledge=False)
AllChem.MMFFOptimizeMoleculeConfs(mol1, mmffVariant='MMFF94s')
pyO3As = rdMolAlign.GetO3AForProbeConfs(mol1, mol2, numThreads=0)
i = 0
lowest = 999999999.9
highest = 0.0
for pyO3A in pyO3As:
    rmsd = pyO3A.Align()
    score = pyO3A.Score()
    if score < lowest:
        lowest = score
        lowestConfId = i
    if score > highest:
        highest = score
        highestConfId = i
    i +=1

print "Lowest:", lowest, lowestConfId
print "Highest:", highest, highestConfId

Chem.MolToMolFile(mol1, "lowest.mol", confId=lowestConfId)
Chem.MolToMolFile(mol1, "highest.mol", confId=highestConfId)


On 06/21/16 15:41, Tim Dudgeon wrote:

Hi All,
I'm trying to get to grips with using Open3D Align in RDKit, buthitting problems.
My approach is to generate random conformers of the probe molecule andalign it to the reference molecule. My example is cobbled togetherfrom the examples in the cookbook.
from rdkit import Chem, RDConfig
from rdkit.Chem import AllChem, rdMolAlign
ref =Chem.MolFromSmiles('NC(=[NH2+])c1ccc(C[C@@H](NC(=O)CNS(=O)(=O)c2ccc3ccccc3c2)C(=O)N2CCCCC2)cc1')mol1 =Chem.MolFromPDBFile(RDConfig.RDBaseDir+'/rdkit/Chem/test_data/1DWD_ligand.pdb')
mol1 = AllChem.AssignBondOrdersFromTemplate(ref, mol1)
mol2 =Chem.MolFromPDBFile(RDConfig.RDBaseDir+'/rdkit/Chem/test_data/1PPC_ligand.pdb')
mol2 = AllChem.AssignBondOrdersFromTemplate(ref, mol2)

pyO3A = rdMolAlign.GetO3A(mol1, mol2)
score = pyO3A.Align()
print "Orig",score
Chem.MolToMolFile(mol1, "orig.mol")
cids = AllChem.EmbedMultipleConfs(mol1, numConfs=100, maxAttempts=100,pruneRmsThresh=0.1, useExpTorsionAnglePrefs=True, useBasicKnowledge=True)
pyO3As = rdMolAlign.GetO3AForProbeConfs(mol1, mol2, numThreads=0)
i = 0
lowest = 999999999.9
highest = 0.0
for pyO3A in pyO3As:
    i +=1
    score = pyO3A.Align()
    if score < lowest:
        lowest = score
        lowestConfId = i
    if score > highest:
        highest = score
        highestConfId = i

print "Lowest:", lowest, lowestConfId
print "Highest:", highest, highestConfId

Chem.MolToMolFile(mol1, "lowest.mol", confId=lowestConfId)
Chem.MolToMolFile(mol1, "highest.mol", confId=highestConfId)
What I'm finding is that the alignments with the lowest and highestO3A scores are much worse alignments (visually) than the originalstructure (the structure from 1DWD). Typical scores are:
Original 1DWD structure: 0.38
Lowest scoring conformer: 0.186
Highest scoring conformer: 0.78
Now I'm assuming that lower O3A align scores are better (though that'snot specifically stated), so as my lowest score is lower than theoriginal alignment I would have expected it to be a better alignment,but its clearly much worse. And if I'm wrong and higher scores arebetter then the same applies.
Clearly I've not understood something correctly!

Tim



------------------------------------------------------------------------------
Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape


_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

------------------------------------------------------------------------------
Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape

_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Re: [Rdkit-discuss] Getting to grips with Open3DAlign

Reply via email to