I took a look at the code, and the only thing that seemed to be required was
the Sequence file. I pulled the part-00000 file from output/clusterPoints/
folder off of my Hadoop cluster, and tried the following command:
bin/mahout clusterdump --seqFileDir part-00000 --output testFile.txt
This resulted in the following error:
no HADOOP_CONF_DIR or HADOOP_HOME set, running locally
Apr 9, 2010 3:00:25 PM org.slf4j.impl.JCLLoggerAdapter error
SEVERE: MahoutDriver failed with args: [--seqFileDir, part-00000, --output,
testFile.txt, null]
null
Exception in thread "main" java.lang.NullPointerException
at
org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:121)
at
org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:302)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:172)
Strange, since $HADOOP_HOME is actually set. Thoughts?
Jeff Eastman wrote:
>
> The dictionary file contains a list (not sure how its delimited) of
> element names for the input Vectors and is optional. See the new code in
> trunk/utils in TestClusterDumper for some examples. I need to write test
> sfor meanshift and also fuzzy kmeans to make sure they work but I
> imagine they do. I also need to write tests that include the points, but
> that appears to be done in memory so it likely won't scale to your
> 5-node data set.
>
> Jeff
>
> adam35413 wrote:
>> I have been able to successfully run the kmean and meanshift examples on
>> a
>> 5-node Hadoop cluster. However, when it comes to dealing with the
>> output, I
>> am a bit confused. I found the following page:
>> http://cwiki.apache.org/MAHOUT/viewing-results.html, but when I went to
>> track down the dictionary file I was unable to find it. Do I need to
>> generate the dictionary file separately or manually?
>>
>> Thanks!
>>
>
>
>
--
View this message in context:
http://n3.nabble.com/Dealing-with-kmean-and-meanshift-output-tp708824p709022.html
Sent from the Mahout User List mailing list archive at Nabble.com.