I would investigate all of those 'Unable to add .....' messages first. Checkout the latest code and run a clean build.
On Friday, December 20, 2013 5:58 PM, Sameer Tilak <ssti...@live.com> wrote: Suneel: Yes, I am working off of trunk. I saw that example. In my case the data is numeric -- I assume that means no need for dictionary etc . I am not sure what is going on, but I still get the following errors: ./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -o /scratch/clusterdump MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Warning: $HADOOP_HOME is deprecated. Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf MAHOUT-JOB: /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar Warning: $HADOOP_HOME is deprecated. 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.clustering.ClusterDumper 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainLogistic 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.lucene.Driver 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunAdaptiveLogistic 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SequenceFileDumper 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.PrintResourceOrFile 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.WikipediaToSequenceFile 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.ConfusionMatrixDumper 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.regex.RegexConverterDriver 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromMailArchives 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.VectorDumper 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.RowIdJob 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.SplitInput 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.MatrixDumper 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.text.SequenceFilesFromDirectory 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.classifier.sgd.RunLogistic 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.ConcatenateVectorsJob 13/12/20 14:57:02 WARN driver.MahoutDriver: Unable to add class: org.apache.mahout.utils.vectors.arff.Driver Unknown program 'clusterdump' chosen. Valid program names are: baumwelch: : Baum-Welch algorithm for unsupervised HMM training canopy: : Canopy clustering cleansvd: : Cleanup and verification of SVD output clusterpp: : Groups Clustering Output In Clusters cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx) cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally. evaluateFactorization: : compute RMSE and MAE of a rating matrix factorization against probes fkmeans: : Fuzzy K-means clustering hmmpredict: : Generate random sequence of observations by given HMM itemsimilarity: : Compute the item-item-similarities for item-based collaborative filtering kmeans: : K-means clustering matrixmult: : Take the product of two matrices parallelALS: : ALS-WR factorization of a rating matrix recommendfactorized: : Compute recommendations using the factorization of a rating matrix recommenditembased: : Compute recommendations using item-based collaborative filtering rowsimilarity: : Compute the pairwise similarities of the rows of a matrix seq2encoded: : Encoded Sparse Vector generation from Text sequence files seq2sparse: : Sparse Vector generation from Text sequence files spectralkmeans: : Spectral k-means clustering splitDataset: : split a rating dataset into training and probe parts ssvd: : Stochastic SVD streamingkmeans: : Streaming k-means clustering svd: : Lanczos Singular Value Decomposition testnb: : Test the Vector-based Bayes classifier trainnb: : Train the Vector-based Bayes classifier transpose: : Take the transpose of a matrix vecdist: : Compute the distances between a set of Vectors (or Cluster or Canopy, they must fit in memory) and a list of Vectors viterbi: : Viterbi decoding of hidden states from given output states sequence > Date: Fri, 20 Dec 2013 14:42:33 -0800 > From: suneel_mar...@yahoo.com > Subject: Re: clusterdump > To: user@mahout.apache.org > > Are you working off of trunk? 'clusterdump' is being used in > examples/bin/cluster-reuters.sh. > > > > > > On Friday, December 20, 2013 5:33 PM, Sameer Tilak <ssti...@live.com> wrote: > > Hi All, > I was able to do the clustering and need some help with viewing the result. I > get the following problem. > > ./mahout clusterdump -i /scratch/dummyvectoroutput/clusters-*-final -d > /scratch/dummyvectorfinalclusters > MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. > Warning: $HADOOP_HOME is deprecated. > > Running on hadoop, using /users/p529444/software/hadoop-1.0.3/bin/hadoop and > HADOOP_CONF_DIR=/apps/hadoop/hadoop-conf > MAHOUT-JOB: > /apps/mahout/trunk/examples/target/mahout-examples-0.9-SNAPSHOT-job.jar > Warning: $HADOOP_HOME is deprecated. > > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.clustering.ClusterDumper > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.sgd.TrainLogistic > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.vectors.lucene.Driver > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.sgd.RunAdaptiveLogistic > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.SequenceFileDumper > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.sgd.PrintResourceOrFile > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.sgd.ValidateAdaptiveLogistic > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.text.WikipediaToSequenceFile > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.ConfusionMatrixDumper > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.regex.RegexConverterDriver > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.text.SequenceFilesFromMailArchives > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.sgd.TrainAdaptiveLogistic > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.vectors.VectorDumper > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.vectors.RowIdJob > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.clustering.streaming.tools.ClusterQualitySummarizer > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.SplitInput > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.clustering.streaming.tools.ResplitSequenceFiles > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.text.SequenceFilesFromLuceneStorageDriver > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.MatrixDumper > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.text.SequenceFilesFromDirectory > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.classifier.sgd.RunLogistic > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.ConcatenateVectorsJob > 13/12/20 14:21:56 WARN driver.MahoutDriver: Unable to add class: > org.apache.mahout.utils.vectors.arff.Driver > Unknown program 'clusterdump' chosen. > Valid program names are: > baumwelch: : Baum-Welch algorithm for unsupervised HMM training > canopy: : Canopy clustering > cleansvd: : Cleanup and verification of SVD output > clusterpp: : Groups Clustering Output In Clusters > cvb: : LDA via Collapsed Variation Bayes (0th deriv. approx) > cvb0_local: : LDA via Collapsed Variation Bayes, in memory locally. > evaluateFactorization: : compute RMSE and MAE of a rating matrix >factorization against probes > fkmeans: : Fuzzy K-means clustering > hmmpredict: : Generate random sequence of observations by given HMM > itemsimilarity: : Compute the item-item-similarities for item-based >collaborative filtering > kmeans: : K-means clustering > matrixmult: : Take the product of two matrices > parallelALS: : ALS-WR factorization of a rating matrix > recommendfactorized: : Compute recommendations using the factorization of a >rating matrix > recommenditembased: : Compute recommendations using item-based >collaborative filtering > rowsimilarity: : Compute the pairwise similarities of the rows of a matrix > seq2encoded: : Encoded Sparse Vector generation from Text sequence files > seq2sparse: : Sparse Vector generation from Text sequence files > spectralkmeans: : Spectral k-means clustering > splitDataset: : split a rating dataset into training and probe parts > ssvd: : Stochastic SVD > streamingkmeans: : Streaming k-means clustering > svd: : Lanczos Singular Value Decomposition > testnb: : Test the Vector-based Bayes classifier > trainnb: : Train the Vector-based Bayes classifier > transpose: : Take the transpose of a matrix > vecdist: : Compute the distances between a set of Vectors (or Cluster or >Canopy, they must fit in memory) and a list of Vectors > viterbi: : Viterbi decoding of hidden states from given output states >sequence