Hie,
    I am quite new to the  mahout. I have successfully installed it and
want to run kmeans my own parameters and files. when i run kmeans with
following parameters i get the error ('clusters' is a fake folder)

$MAHOUT_HOME/bin/mahout kmeans -i testdata/tfidf-vectors/ -o output2 -c
clusters -dm org.apache.mahout.common.distance.CosineDistanceMeasure -x 10
-ow -cd .1 -k 5 --clustering -cl


Running on hadoop, using Downloads/hadoop-1.2.1/bin/hadoop and
HADOOP_CONF_DIR=
MAHOUT-JOB:
/usr/local/mahout/mahout-distribution-0.9/examples/target/mahout-examples-0.9-job.jar
14/07/02 12:09:00 INFO common.AbstractJob: Command line arguments:
{--clustering=null, --clusters=[clusters], --convergenceDelta=[.1],
--distanceMeasure=[org.apache.mahout.common.distance.CosineDistanceMeasure],
--endPhase=[2147483647], --input=[testdata/tfidf-vectors/], --maxIter=[10],
--method=[mapreduce], --numClusters=[5], --output=[output2],
--overwrite=null, --startPhase=[0], --tempDir=[temp]}
14/07/02 12:09:01 INFO common.HadoopUtil: Deleting clusters
14/07/02 12:09:03 INFO util.NativeCodeLoader: Loaded the native-hadoop
library
14/07/02 12:09:03 INFO zlib.ZlibFactory: Successfully loaded & initialized
native-zlib library
14/07/02 12:09:03 INFO compress.CodecPool: Got brand-new compressor
14/07/02 12:09:04 INFO kmeans.RandomSeedGenerator: Wrote 5 Klusters to
clusters/part-randomSeed
14/07/02 12:09:07 INFO kmeans.KMeansDriver: Input: testdata/tfidf-vectors
Clusters In: clusters/part-randomSeed Out: output2
14/07/02 12:09:07 INFO kmeans.KMeansDriver: convergence: 0.1 max
Iterations: 10
14/07/02 12:09:07 INFO compress.CodecPool: Got brand-new decompressor
Exception in thread "main" java.lang.IllegalStateException: No input
clusters found in clusters/part-randomSeed. Check your -c argument.
    at
org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:206)
    at
org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:140)
    at
org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:103)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at
org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:47)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
    at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:606)
    at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
    at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
    at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
    at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke(Method.java:606)
    at org.apache.hadoop.util.RunJar.main(RunJar.java:160)


Regards,
Deepti Mittal

Reply via email to