On Mon, Nov 14, 2011 at 7:31 AM, Jochen Schreiber <[email protected]> wrote:
> The databases that i could download from the pubchem ftp for the substances > havnt any 3D informations. Pubchem has a 3D version, though they only store a 1 or 10 conformers. See ftp://ftp.ncbi.nlm.nih.gov/pubchem/Compound_3D > So the following code (which i test for iterating > through all informations) will fail: Note that if you want to read through an SD file that contains multiple conformers (numbered contiguously) then you should be using the IteratingMDLCoformerReader and ConformerContainer classes > 1. Converting structure files into sdf (with babel) Do you mean generate 3D structures? > 2. CDK for creating the query.xml for CDKPSearch You could also construct the query by hand (probably easier) > 3. CDKPSearch for searching structures in the database (pubchem) > 4. CDK for analyse the results (output the results etc.) > Is this the right way? This is certainly one way, though you might want to consider whether 10 conformers per structure is appropriate for a pharmacophore search -- Rajarshi Guha NIH Chemical Genomics Center ------------------------------------------------------------------------------ All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-novd2d _______________________________________________ Cdk-user mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/cdk-user

