Hi all, I've recently released chemfp 4.0, with support for several diversity selection algorithms, and an improved API for interactive use in a notebook environment.
Chemfp is an analytics package for cheminformatics fingerprints. It contains command-line tools and an extensive Python library for fingerprint generation, high-performance similarity search, diversity selection, and exploratory research. The new diversity selection algorithms are MaxMin, sphere exclusion (both random and directed), and HeapSweep. Of special note, The MaxMin algorithm has improved support for selecting diverse fingerprints from a set of candidates (eg, a vendor catalog) which must also be diverse from a set of references (eg, a corporate collection), which is over an order of magnitude faster than the standard MaxMin algorithm. People who live in the Jupyter notebook will likely enjoy the new chemfp user experience. Most long-term actions support progress bars, chemfp's Python objects have more informative repr()s, search results added Pandas integration, and there are new high-level APIs that let you express a lot of functionality compactly. See https://chemfp.readthedocs.io/en/latest/whatsnew.html for more details. To install the pre-compiled package for Linux at no cost, use: python -m pip install chemfp -i https://chemfp.com/packages/ The Base License covers most in-house use of chemfp, though a few features are either limited or disabled and require a license key to unlock. For alternative licenses, including source code and no-cost academic licensing, see https://chemfp.com/license/ -- or try one of the re-formatted ChEMBL datasets at https://chemfp.com/datasets/ which include an embedded authorization key. Chemfp is not a cheminformatics toolkit. Instead, it knows how to use RDKit (or Open Babel, OpenEye's OEChem/OEGraphSim, or the CDK) for molecule I/O and fingerprint generation. For more information see the chemfp home page at https://chemfp.com/ or contact me directly. Best regards, Andrew da...@dalkescientific.com _______________________________________________ Rdkit-discuss mailing list Rdkit-discuss@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rdkit-discuss