Re: [scikit-learn] Agglomerative clustering problem

2017-07-15 Thread Jacob Schreiber
Typically when I think of limiting the number of points in a cluster I think of KD trees. I suppose that wouldn't work? On Tue, Jul 11, 2017 at 11:22 AM, Ariani A wrote: > ِDear Uri, > Thanks. I just have a pairwise distance matrix and I want to implement it > so that each cluster has at least 4

Re: [scikit-learn] Agglomerative clustering problem

2017-07-11 Thread Ariani A
ِDear Uri, Thanks. I just have a pairwise distance matrix and I want to implement it so that each cluster has at least 40 data points. (in Agglomerative). Does it work? Thanks, -Ariani On Tue, Jul 11, 2017 at 1:54 PM, Uri Goren wrote: > Take a look at scipy's fcluster function. > If M is a matri

Re: [scikit-learn] Agglomerative clustering problem

2017-07-11 Thread Uri Goren
Take a look at scipy's fcluster function. If M is a matrix of all of your feature vectors, this code snippet should work. You need to figure out what metric and algorithm work for you from sklearn.metrics import pairwise_distance from scipy.cluster import hierarchy X = pairwise_dista

[scikit-learn] Agglomerative clustering problem

2017-07-11 Thread Ariani A
Hi all, I want to perform agglomerative clustering, but I have no idea of number of clusters before hand. But I want that every cluster has at least 40 data points in it. How can I apply this to sklearn.agglomerative clustering? Should I use dendrogram and cut it somehow? I have no idea how to rela