Dear All, Apologies for posting on a rather well-trodden (and tedious?) topic... I have just spent some time counting H-bond donors in a variety of ways for a 4000 compound data set - to see how our calculations could best match the results coming from a collaborator (it's as fun as it sounds). As it turns-out, Descriptors.NumHDonors is pretty much on the money! (ie counting the number or N and O atoms that have at least one H attached) However, during this process I ended-up going back to a (the?) Lipinski paper -
Christopher A Lipinski, Franco Lombardo, Beryl W Dominy, Paul J Feeney, Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings, Advanced Drug Delivery Reviews, Volume 46, Issues 1-3, 1 March 2001, Pages 3-26, ISSN 0169-409X, 10.1016/S0169-409X(00)00129-0. - to see what the Lipinski definition of Hydrogen Bond Donors was. I read the following: "We found that simply adding the number of NH bonds and OH bonds does remarkably well as an index of H bond donor character. Importantly, this parameter has direct structural relevance to the chemist." As far as I can tell, this would require explicit addition of Hs to the molecule, followed by counting the number of matches for an NH or OH BOND; something like the following: >>> from rdkit import Chem >>> from rdkit.Chem import Descriptors >>> smarts = Chem.MolFromSmarts("[#7,#8]-[#1]") >>> mol = Chem.MolFromSmiles("CC(=O)N") >>> mol = Chem.AddHs(mol) >>> matches = mol.GetSubstructMatches(smarts) >>> print len(matches) 2 The MOE descriptor lip_don seems to exactly reproduce these 'bond count' numbers for my set of compounds. So I guess my question is - shouldn't we be counting the NH and OH bonds for Lipinski-like counting? (and I guess this is what MOE's lip_don is for) Kind regards James _________________________________ Dr James Davidson Senior Team Leader, Medicinal & Computational Chemistry Vernalis (R&D) Ltd Granta Park Great Abington Cambridgeshire CB21 6GB t: +441223 895555 f: +441223 895556 d: +441223 895428 m: +447595 258005 ______________________________________________________________________ PLEASE READ: This email is confidential and may be privileged. It is intended for the named addressee(s) only and access to it by anyone else is unauthorised. If you are not an addressee, any disclosure or copying of the contents of this email or any action taken (or not taken) in reliance on it is unauthorised and may be unlawful. If you have received this email in error, please notify the sender or postmas...@vernalis.com. Email is not a secure method of communication and the Company cannot accept responsibility for the accuracy or completeness of this message or any attachment(s). Please check this email for virus infection for which the Company accepts no responsibility. If verification of this email is sought then please request a hard copy. Unless otherwise stated, any views or opinions presented are solely those of the author and do not represent those of the Company. The Vernalis Group of Companies Oakdene Court 613 Reading Road Winnersh, Berkshire RG41 5UA. Tel: +44 118 977 3133 To access trading company registration and address details, please go to the Vernalis website at www.vernalis.com and click on the "Company address and registration details" link at the bottom of the page.. ______________________________________________________________________ ------------------------------------------------------------------------------ All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity and more. Splunk takes this data and makes sense of it. Business sense. IT sense. Common sense. http://p.sf.net/sfu/splunk-d2dcopy1 _______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss