Dear Woon Yee, you can use the getFingerprint() method instead.
Egon On Mon, 25 Jul 2022 at 10:59, #NG WOON YEE# via Cdk-user < cdk-user@lists.sourceforge.net> wrote: > Dear Helpdesk, > > > > I was using CDK (version 2.7) to generate FCFP4 and 6 for the compound > butyramide ( > https://www.ebi.ac.uk/chembl/api/data/molecule/CHEMBL1231396.sdf) and > ethanol (https://www.ebi.ac.uk/chembl/api/data/molecule/CHEMBL545.sdf) > from their MolFiles which I got from CHEMBL. I was using the following > commands in CDK: > > > --------------------------------------------------------------------------------------------------------------- > > package ecfp; > > import java.io.*; > > import com.opencsv.CSVReader; > > import com.opencsv.CSVReaderBuilder; > > import com.opencsv.CSVWriter; > > import com.opencsv.exceptions.CsvException; > > import java.util.Arrays; > > import java.util.List; > > import java.io.FileInputStream; > > import java.io.IOException; > > > > import org.openscience.cdk.exception.CDKException; > > import org.openscience.cdk.fingerprint.CircularFingerprinter; > > import org.openscience.cdk.fingerprint.ExtendedFingerprinter; > > import org.openscience.cdk.fingerprint.ICountFingerprint; > > import org.openscience.cdk.interfaces.IAtomContainer; > > import org.openscience.cdk.interfaces.IChemObjectBuilder; > > import org.openscience.cdk.io.MDLV2000Reader; > > import org.openscience.cdk.silent.SilentChemObjectBuilder; > > > > public class main{ > > public static void main(String[] args) throws CDKException, > IOException { > > String filename = > "C:\\Users\\NGWO0001\\Downloads\\CHEMBL545.sdf.txt"; > > FileInputStream in = new FileInputStream(filename); > > MDLV2000Reader reader = new MDLV2000Reader(in); > > IChemObjectBuilder bldr = > SilentChemObjectBuilder.getInstance(); > > IAtomContainer mol = reader.read(bldr.newAtomContainer()); > > > > CircularFingerprinter fingerprinter0 = new > CircularFingerprinter( > > CircularFingerprinter.CLASS_FCFP4 > > ); > > > > > > System.out.println("FCFP4 Ethanol:"); > > ICountFingerprint result0 = > fingerprinter0.getCountFingerprint(mol); > > for (int k=0, n = result0.numOfPopulatedbins(); k < n; ++k) { > > String ans4 = ""; > > ans4 += result0.getHash(k); > > ans4 += " " + result0.getCount(k); > > System.out.printf("%s\n",ans4); > > } > > > > reader.close(); > > } > > } > > > --------------------------------------------------------------------------------------------------------------- > > > > The results I got were: > > > > FCFP4 Butyramide: > > -1393198889 1 > > -1212393386 1 > > -1131767167 2 > > 0 4 > > 2 1 > > 3 1 > > 425233353 1 > > 785469695 1 > > 824716024 1 > > 994111779 1 > > 1429107614 1 > > > > FCFP6 Butyramide: > > -1393198889 1 > > -1212393386 1 > > -1131767167 2 > > 0 4 > > 2 1 > > 3 1 > > 425233353 1 > > 785469695 1 > > 824716024 1 > > 994111779 1 > > 1429107614 1 > > > > FCFP4 Ethanol: > > -1212393386 1 > > 0 2 > > 3 1 > > 629394235 1 > > 824716024 1 > > > > FCFP6 Ethanol: > > -1212393386 1 > > 0 2 > > 3 1 > > 629394235 1 > > 824716024 1 > > > I think these results may not be right since I thought that fingerprints > are supposed to be a series of hash and so they ought to be a series of > fixed-length integers. However, as you see in the results I got, for > example, for the FCFP6 for ethanol, one is 10-digits long while others are > single digits and 9-digits long. > > > > Can you please tell me what I am doing wrong? > > > > Thanking you in advance for your assistance and time. > > > > Best regards, > > Woon Yee > _______________________________________________ > Cdk-user mailing list > Cdk-user@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/cdk-user > -- ---- Super happy with this new eLife paper describing an Open Science project where we discuss 260 thousand natural products and where they came from, all 700 thousand pairs linked to their primary literature: "The LOTUS initiative for open knowledge management in natural products research", https://doi.org/10.7554/elife.70780 ----- E.L. Willighagen Department of Bioinformatics - BiGCaT Maastricht University (http://www.bigcat.unimaas.nl/) Twitter/Mastodon: @egonwillighagen <https://twitter.com/egonwillighagen> / @egonw <https://scholar.social/@egonw> Homepage: http://egonw.github.io/ Blog: http://chem-bla-ics.blogspot.com/ PubList: https://www.zotero.org/egonw ORCID: 0000-0001-7542-0286 <http://orcid.org/0000-0001-7542-0286> ImpactStory: https://impactstory.org/u/egonwillighagen
_______________________________________________ Cdk-user mailing list Cdk-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/cdk-user