Thanks Pat, But how exactly can I run clusterdump using the -evaluate (-e) parameter? When i try to run it for example:
./mahout clusterdump -i /user/Data-output/clusters-1-final -o analyze.txt --evaluate I get a Java null pointer Exception 14/05/19 15:02:03 INFO common.AbstractJob: Command line arguments: {--dictionaryType=[text], --distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure], --endPhase=[2147483647], --evaluate=null, --input=[/user/Data-output/clusters-1-final], --output=[analyze.txt], --outputFormat=[TEXT], --startPhase=[0], --tempDir=[temp]} Exception in thread "main" java.lang.NullPointerException Do I have to put a parameter to evaluate? As input for clusterdump I am using the output with the clusters after running mahout K-Means. > Subject: Re: Mahout K-Means - Quality of the clusters > From: pat.fer...@gmail.com > Date: Sat, 17 May 2014 09:43:59 -0700 > To: user@mahout.apache.org > > mahout clusterdump —evaluate … > > provides some stats > > On May 15, 2014, at 10:23 PM, hiroshi leon <hiroshi_8...@hotmail.com> wrote: > > Hello everybody, > > Do you know how can I get the MSE of the clusters in mahout K-Means? > I would like to check the quality of the clusters. Thanks! > > >