Hi, sorry for inconvenience, I have reopened a issue [1] to fix this.
The problem exists only when K = N. (amount of centers is equal to the amount of vectors) The following parameter should work: file:///home/hadoop/file/kmeans/kmeansinput file:///home/hadoop/file/kmeansoutput 5 5 -g *6* 5 I will fix this asap. Kind regards, Martin [1] https://issues.apache.org/jira/browse/HAMA-834 2014/1/9 步青云 <[email protected]> > Hi, > I'm trying to run the Kmeans example of Hama.The args is > "file:///home/hadoop/file/kmeans/kmeansinput > file:///home/hadoop/file/kmeansoutput 5 5 -g 5 5". What's more , the attach > is "kmeansinput.txt".But I got the problem as follows: > > 14/01/09 16:38:52 WARN fs.FSInputChecker: Problem opening checksum file: > file:/home/hadoop/file/kmeans/center/cen.seq. Ignoring exception: > java.io.EOFException > at java.io.DataInputStream.readFully(DataInputStream.java:197) > at java.io.DataInputStream.readFully(DataInputStream.java:169) > at > org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSInputChecker.<init>(ChecksumFileSystem.java:134) > at > org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:283) > at > org.apache.hadoop.io.SequenceFile$Reader.openFile(SequenceFile.java:1437) > at org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1424) > at org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1417) > at org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1412) > at org.apache.hama.ml.kmeans.KMeansBSP.setup(KMeansBSP.java:90) > at > org.apache.hama.bsp.LocalBSPRunner$BSPRunner.run(LocalBSPRunner.java:255) > at > org.apache.hama.bsp.LocalBSPRunner$BSPRunner.call(LocalBSPRunner.java:286) > at org.apache.hama.bsp.LocalBSPRunner$BSPRunner.call(LocalBSPRunner.java:1) > at java.util.concurrent.FutureTask.run(FutureTask.java:262) > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) > at java.util.concurrent.FutureTask.run(FutureTask.java:262) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:724) > > And "cen.seq" is empty.There might be old checksum files in your > HDFS or local filesystem.But I don't how to clean them up. > Could you help me to solve this problem? > Thanks. > Best wishes. > Ping Liu > 01-09,2014 >
