[ https://issues.apache.org/jira/browse/MAHOUT-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107633#comment-13107633 ]
Lance Norskog commented on MAHOUT-524: -------------------------------------- Possibly a little help. When run from the command line via mvn exec, this is the error log. Note that a) an exception happens in an early m/r pass, and b) the exception is ignored by the full job executor. (MacOS X "Kitty Liver") _lance$ MAVENOPTS=Xmx1000m mvn -q exec:java -Dexec.mainClass="org.apache.mahout.clustering.display.DisplaySpectralKMeans"_ {code} SLF4J: Class path contains multiple SLF4J bindings. SLF4J: Found binding in [jar:file:/Users/lancenorskog/.m2/repository/org/slf4j/slf4j-log4j12/1.6.1/slf4j-log4j12-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class] SLF4J: Found binding in [jar:file:/Users/lancenorskog/.m2/repository/org/slf4j/slf4j-jcl/1.6.1/slf4j-jcl-1.6.1.jar!/org/slf4j/impl/StaticLoggerBinder.class] SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation. 11/09/18 22:25:26 INFO common.HadoopUtil: Deleting samples 11/09/18 22:25:26 INFO common.HadoopUtil: Deleting output 11/09/18 22:25:26 INFO display.DisplayClustering: Generating 500 samples m=[1.0, 1.0] sd=3.0 11/09/18 22:25:26 INFO display.DisplayClustering: Generating 300 samples m=[1.0, 0.0] sd=0.5 11/09/18 22:25:26 INFO display.DisplayClustering: Generating 300 samples m=[0.0, 2.0] sd=0.1 11/09/18 22:25:28 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same. 11/09/18 22:25:28 WARN mapred.JobClient: No job jar file set. User classes may not be found. See JobConf(Class) or JobConf#setJar(String). 11/09/18 22:25:28 INFO input.FileInputFormat: Total input paths to process : 1 11/09/18 22:25:28 INFO mapred.JobClient: Running job: job_local_0001 11/09/18 22:25:28 INFO mapred.MapTask: io.sort.mb = 100 *11/09/18 22:25:29 WARN mapred.LocalJobRunner: job_local_0001 java.lang.OutOfMemoryError: Java heap space at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.<init>(MapTask.java:949) at org.apache.hadoop.mapred.MapTask$NewOutputCollector.<init>(MapTask.java:674) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:756) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)* 11/09/18 22:25:29 INFO mapred.JobClient: map 0% reduce 0% 11/09/18 22:25:29 INFO mapred.JobClient: Job complete: job_local_0001 {code} > DisplaySpectralKMeans example fails > ----------------------------------- > > Key: MAHOUT-524 > URL: https://issues.apache.org/jira/browse/MAHOUT-524 > Project: Mahout > Issue Type: Bug > Components: Clustering > Affects Versions: 0.4, 0.5 > Reporter: Jeff Eastman > Assignee: Shannon Quinn > Labels: clustering, k-means, visualization > Fix For: 0.6 > > Attachments: aff.txt, raw.txt, spectralkmeans.png > > > I've committed a new display example that attempts to push the standard > mixture of models data set through spectral k-means. After some tweaking of > configuration arguments and a bug fix in EigenCleanupJob it runs spectral > k-means to completion. The display example is expecting 2-d clustered points > and the example is producing 5-d points. Additional I/O work is needed before > this will play with the rest of the clustering algorithms. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira