Thanks, Takayuki, For both Repeated Bisection clustering and K-means clustering, they all need the number of clusters as input, right?
Best, Jing On Sun, Aug 23, 2015 at 1:17 AM, Taka Seri <[email protected]> wrote: > Dear Jing, > > How about your trying using bayon ? > https://code.google.com/p/bayon/ > It's not function of RDKit, but I think the library can cluster molecules > using ECFP4. > > Unfortunately, input file format of bayon is not distance matrix but easy > to prepare the format. > > Best regards. > > Takayuki > > > 2015年8月23日(日) 12:03 Jing Lu <[email protected]>: > >> Currently, I prefer fingerprint based clustering, because it's hard to >> set the cutoff for scaffold based clustering. Does RDKit have scaffold >> based clustering? >> >> On Sat, Aug 22, 2015 at 10:56 PM, <[email protected]> wrote: >> >>> Hi, how about scaffold based clustering . You extract the scaffolds and >>> then cluster it and then put the respective scaffold compounds inside the >>> cluster . >>> >>> Sent from my iPhone >>> >>> > On Aug 22, 2015, at 8:43 PM, Jing Lu <[email protected]> wrote: >>> > >>> > Dear RDKit users, >>> > >>> > If I want to cluster more than 1M molecules by ECFP4. How could I do >>> it? If I calculate the distance between every pair of molecules, the size >>> of distance matrix will be too big. Does RDKit support any heuristic >>> clustering algorithm without calculating the distance matrix of the whole >>> library? >>> > >>> > >>> > >>> > Thanks, >>> > Jing >>> > >>> ------------------------------------------------------------------------------ >>> > _______________________________________________ >>> > Rdkit-discuss mailing list >>> > [email protected] >>> > https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >>> >> >> >> ------------------------------------------------------------------------------ >> _______________________________________________ >> Rdkit-discuss mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss >> >
------------------------------------------------------------------------------
_______________________________________________ Rdkit-discuss mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

