Try examples /bin/cluster-reuters.sh Sent from my iPhone
> On Jan 22, 2014, at 9:56 AM, Sznajder ForMailingList > <bs4mailingl...@gmail.com> wrote: > > Hi, > > I wished to run the mahout example for Kmeans algorithm. > > I suppose that it is: > org.apache.mahout.clustering.syntheticcontrol.kmeans.Job > > (1) Is it right? > > > It looks for a /testdata/ directory. I did not find it > > (2) Where is it, please? > > > I thought to use the reuters data set described in Manning book and I > extracted it to my disk and pointed to this directory in the main method. > > However, I get the following, when running the Job: > > java.lang.NumberFormatException: For input string: "amex" > at sun.misc.FloatingDecimal.readJavaFormatString(Unknown Source) > at java.lang.Double.valueOf(Unknown Source) > at > org.apache.mahout.clustering.conversion.InputMapper.map(InputMapper.java:48) > at > org.apache.mahout.clustering.conversion.InputMapper.map(InputMapper.java:1) > at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:621) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305) > at > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177) > > > (3) What do I do wrong? > > Best regards > Benjamin