Thnks for the Replay sir, actually i am doing clustering for gathering similar king of document in same cluster as much as possible. i can see from output file by cluster dump by observing top term. i also figure out that by varying Distance Measure Technique. it differs. but i want some mathematical prof that it is better then other technique. so for that i need to calculate Entropy and pureness of cluster. but i am not able to find any command in mahout which can give me entropy as a result. i found Entropy.java under mahout common math statistic package. but i don't what should i give it as input so that i can find entropy or other parameter. so i can find how much cluster is good or bed.
On Tue, Apr 22, 2014 at 7:01 PM, Ted Dunning <ted.dunn...@gmail.com> wrote: > On Tue, Apr 22, 2014 at 12:11 AM, Darshan Sonagara < > darshan.sonag...@gmail.com> wrote: > > > But the problem is that i want check that whether my clustering is good > or > > bad. so for that i need to calculate Entropy Value. I am not having any > > idea how to calculate entropy in mahout or by other technique. > > by finding entropy i can have good conclusion. > > so please can anyone help me with these. > > > > Actually, the way to tell whether your clustering is good is to see if it > works for its intended use. > > What do you want to use clustering for? > -- *Regards From:* *Darshan Sonagara* *Collaborative Platform lead,** SSN Team | Gujarat Section.* *Vice-Chairperson | **GCET IEEE SB.* (: +*91* 9408002452 : Darshan Sonagara<http://www.linkedin.com/pub/darshan-sonagara/64/11a/b54> : Darshan Sonagara <http://www.facebook.com/darshansonagara>