Dear All,
 
Today I have spent some time processing a freely-available SDF that
contains many compounds and melting-points / ranges (
http://www.mdpi.org/molmall/mdpi1-51sd.zip).  The reason for doing this
is that I wanted to implement a melting-point predictor following the
work of Andreas Bender (J. Chem. Inf. Model. 2005, 45, 581-590) and more
recently Reifeng Liu at AZ (J. Chem. Inf. Model. 2008, 48, 981-987).
 
I have attached the python-script that I have at the moment (a) in case
it is of some use to anybody else, (b) in the hope that I can improve my
python and rdkit abilities through any suggested alterations (I'm sure
there are many!), and (c) to form the basis of a couple of questions.
At the moment, the script is just running through each compound;
checking if the molecule is valid; and if so, noting how many
components, and whether any of the atoms are outside of the desired
list.  These two results are then written out to a new SDF.  I am then
using this to make sure my data-set contains only compounds that I would
say are 'reasonable' to build a melting-point model with.  Now for the
questions:
 
1.  In RDKit, has the 'cleaning / washing / salt-stripping' of molecules
already been formalised based on a set of rules, etc?
2.  When identifying compounds that contain a non-allowed atom-type, why
do I find the SMARTS def [!H;!C;!N;!O;!F;!S;!Cl;!Br;!I] gives unexpected
results, but [!#1;!#6;!#7;!#8;!#9;!#16;!#17;!#35;!#53] works as I would
expect?
 
Kind regards
 
James

______________________________________________________________________
PLEASE READ: This email is confidential and may be privileged. It is intended 
for the named addressee(s) only and access to it by anyone else is 
unauthorised. If you are not an addressee, any disclosure or copying of the 
contents of this email or any action taken (or not taken) in reliance on it is 
unauthorised and may be unlawful. If you have received this email in error, 
please notify the sender or postmas...@vernalis.com. Email is not a secure 
method of communication and the Company cannot accept responsibility for the 
accuracy or completeness of this message or any attachment(s). Please check 
this email for virus infection for which the Company accepts no responsibility. 
If verification of this email is sought then please request a hard copy. Unless 
otherwise stated, any views or opinions presented are solely those of the 
author and do not represent those of the Company.

The Vernalis Group of Companies
Oakdene Court
613 Reading Road
Winnersh, Berkshire
RG41 5UA.
Tel: +44 118 977 3133

To access trading company registration and address details, please go to the 
Vernalis website at www.vernalis.com and click on the "Company address and 
registration details" link at the bottom of the page..
______________________________________________________________________

Attachment: inorg_or_mix.py
Description: inorg_or_mix.py

------------------------------------------------------------------------------
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to