Hi Jack, 

you are indeed getting the expected outcome. The MF generator just generate 
gross formulae (just atoms, no bonds) and you need a constitutional isomer 
generator to generator all isomers of that structure (add the bonds). 

If you only need one random isomer, I wrote a class many years ago called 
SingleStructureRandomGenerator.java in the package 
org.openscience.cdk.structgen. 
(http://cdk.github.io/cdk/latest/docs/api/org/openscience/cdk/structgen/SingleStructureRandomGenerator.html)

It is now almost exactly 20 years old and has probably not been tested since 
then, so I cannot say (and have not time at the moment to test) if it is still 
working after so many refactoring of the CDK.
Would be great if you could give it a try and report here :)

The alternative (at the moment) is to use OMG 
(https://sourceforge.net/projects/openmg/) to generate just one isomer but that 
might take longer than with above solution.

Kind regards, 

Chris

PS: just to add: We are working on a proper isomer generator for the CDK but I 
cannot say when it is ready. 

— 
Prof. Dr. Christoph Steinbeck
Analytical Chemistry - Cheminformatics and Chemometrics
Friedrich-Schiller-University Jena, Germany
Phone Secretariat: +49-3641-948171
http://cheminf.uni-jena.de
http://orcid.org/0000-0001-6966-0814

What is man but that lofty spirit - that sense of enterprise.
... Kirk, "I, Mudd," stardate 4513.3..

> On 14. Jan 2021, at 00:19, Jack <jackcampanell...@gmail.com> wrote:
> 
> Greetings CDK devs/support team,
> 
> I'm currently using MolecularFormulaGenerator to generate a list of formulas 
> and then piping those formulas to SmilesGenerator in order to retrieve a list 
> of smile strings.
> 
> However, these resulting smile strings don't have any notion of bonds, rings, 
> etc. For example, if MolecularFormulaGenerator returns say 'C6H6' then the 
> corresponding smile string looks like: 
> '[H].[H].[H].[H].[H].[H].[C].[C].[C].[C].[C].[C]'.
> 
> I'm wondering if there is some sort of intermediate step I might have to take 
> in order to generate structural isomers for each molecular formula before 
> generating smiles. I've scoured the CDK's api documentation and the wiki but 
> I can't find any classes or set of classes that would accomplish this. 
> Admittedly, this could definitely be because of my 'barely passing Chem 101' 
> knowledge of fundamental Chemistry.  
> 
> If someone could offer some insight into where I should look, which classes I 
> should use, or if I'm just simply missing something I would be forever 
> grateful.
> 
> Thanks a million!
> 
> - Jack Campanella
> _______________________________________________
> Cdk-user mailing list
> Cdk-user@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/cdk-user



_______________________________________________
Cdk-user mailing list
Cdk-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to