Thanks Pat,

But how exactly can I run clusterdump using the -evaluate (-e) parameter?
When i try to run it for example:

./mahout clusterdump -i /user/Data-output/clusters-1-final -o analyze.txt 
--evaluate

I get a Java null pointer Exception

14/05/19 15:02:03 INFO common.AbstractJob: Command line arguments: 
{--dictionaryType=[text], 
--distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure],
 --endPhase=[2147483647], --evaluate=null, 
--input=[/user/Data-output/clusters-1-final], --output=[analyze.txt], 
--outputFormat=[TEXT], --startPhase=[0], --tempDir=[temp]}
Exception in thread "main" java.lang.NullPointerException

Do I have to put a parameter to evaluate? As input for clusterdump I am using 
the output with the clusters after running mahout K-Means.

> Subject: Re: Mahout K-Means - Quality of the clusters
> From: pat.fer...@gmail.com
> Date: Sat, 17 May 2014 09:43:59 -0700
> To: user@mahout.apache.org
> 
> mahout  clusterdump —evaluate …
> 
> provides some stats
> 
> On May 15, 2014, at 10:23 PM, hiroshi leon <hiroshi_8...@hotmail.com> wrote:
> 
> Hello everybody,
> 
> Do you know how can I get the MSE of the clusters in mahout K-Means? 
> I would like to check the quality of the clusters. Thanks!
> 
>                                         
> 
                                          

Reply via email to