2009/8/5 Grant Ingersoll <[email protected]> > What parameters did you use in the command line?
I'm running syntheticcontrol kmeans clustering. Three parameters are needed: 2 threshold & 1 convergence criteria for iterations. Which values are recommended to assign to each one? > > There are a couple of threads in the archives that are likely of interest > along these lines: > http://www.lucidimagination.com/search/p:mahout?q=clustering#/ > p:mahout/s:email/l:user > > Are you trying to cluster text? Or something else? > Yes, I'm trying to clustering text. I've build a tf-idf matrix compose by sparse vectors. Syntheticcontrol kmeans clustering works well with sparse vectors? Thanks again. > On Aug 5, 2009, at 10:47 AM, Allan Roberto Avendano Sudario wrote: > > Regards, >> >> I´m trying to fit the kmeans syntheticcontrol job with my own dataset, >> everything works well. >> But, only one cluster is generated. I suppose that it´s about the default >> parameters of clustering >> process. >> >> What do you recommend about how to change clustering parameters? >> *(2 threshold and 1 convergenceDelta)* >> >> Which would be the clustering algorithm into information retrieval >> process? >> >> Thanks for your help. >> >> -- >> Allan Avendaño S. >> > > > -- Allan Avendaño S. Home: 04 2 800 692 Cell: 09 700 42 48
