2009/8/5 Grant Ingersoll <[email protected]>

> What parameters did you use in the command line?


I'm running syntheticcontrol kmeans clustering. Three parameters are needed:
2 threshold & 1 convergence criteria for iterations.

Which values are recommended to assign to each one?


>
> There are a couple of threads in the archives that are likely of interest
> along these lines:
> http://www.lucidimagination.com/search/p:mahout?q=clustering#/
> p:mahout/s:email/l:user
>
> Are you trying to cluster text?  Or something else?
>

Yes, I'm trying to clustering text. I've build a tf-idf matrix compose by
sparse vectors. Syntheticcontrol kmeans clustering works well with sparse
vectors?

Thanks again.


> On Aug 5, 2009, at 10:47 AM, Allan Roberto Avendano Sudario wrote:
>
>  Regards,
>>
>> I´m trying to fit the kmeans syntheticcontrol job with my own dataset,
>> everything works well.
>> But, only one cluster is generated. I suppose that it´s about the default
>> parameters of clustering
>> process.
>>
>> What do you recommend about how to change clustering parameters?
>> *(2 threshold and 1 convergenceDelta)*
>>
>> Which would be the clustering algorithm into information retrieval
>> process?
>>
>> Thanks for your help.
>>
>> --
>> Allan Avendaño S.
>>
>
>
>


-- 
Allan Avendaño S.
Home: 04 2 800 692
Cell: 09 700 42 48

Reply via email to