Hi all,

I'm trying to work with Chembl23, calculating ChemFP fingerprints.  Some 
compounds contain non-standard heavy atoms, e.g. this one containing Te:

CC(C)(/C(=C\\Cl)/[Te](c1ccc(cc1)OC)(Cl)Cl)O

My workflow is to convert SDF format to SMILES as I find it easier to correct 
any minor errors.  I'm trying to calculate the fingerprints using ChemFP and 
RDKit as follows:

rdkit2fps \
    --id-tag "Molecule Name" \
    chembl_23.fixed.smi \
    -o rdkit_chembl_23.fixed.fps

but I get the following error:

[11:37:55] Explicit valence for atom # 6 Te, 4, is greater than permitted
ERROR: Cannot parse the SMILES 'CC(C)(/C(=C\\Cl)/[Te-2](c1ccc(cc1)OC)(Cl)Cl)O' 
at line 155850 of chembl_23.fixed.smi. Exiting.[11:39:23] Explicit valence for 
atom # 6 Te, 6, is greater than permitted

How should atoms non-standard heavy atoms like Te be treated?

Thanks
Markus
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to