Try examples /bin/cluster-reuters.sh

Sent from my iPhone

> On Jan 22, 2014, at 9:56 AM, Sznajder ForMailingList 
> <bs4mailingl...@gmail.com> wrote:
> 
> Hi,
> 
> I wished to run the mahout example for Kmeans algorithm.
> 
> I suppose that it is:
> org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
> 
> (1) Is it right?
> 
> 
> It looks for a /testdata/ directory. I did not find it
> 
> (2) Where is it, please?
> 
> 
> I thought to use the reuters data set described in Manning book and I
> extracted it to my disk and pointed to this directory in the main method.
> 
> However, I get the following, when running the Job:
> 
> java.lang.NumberFormatException: For input string: "amex"
>    at sun.misc.FloatingDecimal.readJavaFormatString(Unknown Source)
>    at java.lang.Double.valueOf(Unknown Source)
>    at
> org.apache.mahout.clustering.conversion.InputMapper.map(InputMapper.java:48)
>    at
> org.apache.mahout.clustering.conversion.InputMapper.map(InputMapper.java:1)
>    at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
>    at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:621)
>    at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>    at
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)
> 
> 
> (3) What do I do wrong?
> 
> Best regards
> Benjamin

Reply via email to