Hi Lee, 

I cannot help you with the R issue but wanted to add that I recently ran a few 
tests to check if RandomGenerator coverers all of constitutional space, which 
it did in my few test cases. The overhead for a small set of atoms (like 
C10H16) was 200-fold, i. e. RG needs to visit 200 times the actual size of 
constitutional space to visit all constitutional isomers of that space. 
There is the added complexity that it seems to have problems (I don’t know why) 
with explicit hydrogens. I only use it in cases where the hydrogen distribution 
is known. 
If that is the case, my gut feeling is that the RandomGenerator is a good 
choice for your problem, rather than a deterministic generator like OMG, which 
might have a sampling bias if you only take the first 100k. Furthermore, OMG 
will take very long if you move to slightly larger molecules. 

Will be interesting to learn about your progress here :)

Kind regards, 

Chris


— 
Prof. Dr. Christoph Steinbeck
Analytical Chemistry - Cheminformatics and Chemometrics
Friedrich-Schiller-University Jena, Germany
Phone Secretariat: +49-3641-948171
http://cheminf.uni-jena.de
http://orcid.org/0000-0001-6966-0814

What is man but that lofty spirit - that sense of enterprise.
... Kirk, "I, Mudd," stardate 4513.3..

> On 29 Mar 02018, at 05:03, Lee Ferguson <[email protected]> wrote:
> 
> Hello all,
> 
> I’m trying to work out a good way to perform structure generation from 
> molecular formula for integration into an analytical pipeline in our lab.  
> Specifically, what I want to do is generate (up to) ~100,000 structures for 
> any given molecular formula and subsequently down-select from that set using 
> some structural similarity filtering.  It looks like I could potentially bend 
> the RandomGenerator function within the structgen package to my structure 
> generation needs, but I was hoping to do this within R, and from what Zach 
> tells me, there is no wrapper for the structgen package in rCDK as of yet.   
> Zack suggested I could use rJava, which is a great idea, but he also pointed 
> out that I might consider posting here in case someone else had tried to do 
> something similar already and thus I might avoid reinventing the wheel.
> 
> I’d appreciate any ideas or guidance you all might have.
> 
> Best regards,
> Lee
> 
> --
> P. Lee Ferguson, Ph.D.
> Associate Professor
> Department of Civil & Environmental Engineering
> Pratt School of Engineering &
> Nicholas School of the Environment
> Duke University
> 121 Hudson Hall, Box 90287
> Durham, NC 27708-0287
> http://ferguson.cee.duke.edu
> 
> Phone: 919-660-5460
> Fax: 919-660-5454
> 
> 
> 
> 
> ------------------------------------------------------------------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! 
> http://sdm.link/slashdot_______________________________________________
> Cdk-user mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/cdk-user


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Cdk-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/cdk-user

Reply via email to