Hi all,
I'm trying to work with Chembl23, calculating ChemFP fingerprints. Some
compounds contain non-standard heavy atoms, e.g. this one containing Te:
CC(C)(/C(=C\\Cl)/[Te](c1ccc(cc1)OC)(Cl)Cl)O
My workflow is to convert SDF format to SMILES as I find it easier to correct
any minor errors. I'm trying to calculate the fingerprints using ChemFP and
RDKit as follows:
rdkit2fps \
--id-tag "Molecule Name" \
chembl_23.fixed.smi \
-o rdkit_chembl_23.fixed.fps
but I get the following error:
[11:37:55] Explicit valence for atom # 6 Te, 4, is greater than permitted
ERROR: Cannot parse the SMILES 'CC(C)(/C(=C\\Cl)/[Te-2](c1ccc(cc1)OC)(Cl)Cl)O'
at line 155850 of chembl_23.fixed.smi. Exiting.[11:39:23] Explicit valence for
atom # 6 Te, 6, is greater than permitted
How should atoms non-standard heavy atoms like Te be treated?
Thanks
Markus
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Rdkit-discuss mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss