Yeah. it would seem CDH releases of Mahout produce some sort of cut-down version of such. I suggest to switch to official release tarbal (or write to Cloudera support about it).
On Wed, Mar 5, 2014 at 8:38 AM, Andrew Musselman <andrew.mussel...@gmail.com > wrote: > I'm not sure about this either but I think these are all the changes to > Mahout in CDH 4.6.0: > http://archive.cloudera.com/cdh4/cdh/4/mahout-0.7-cdh4.6.0.CHANGES.txt > > MAHOUT-1291 > > MAHOUT-1033 > > MAHOUT-1142 > > > > On Wed, Mar 5, 2014 at 8:30 AM, Suneel Marthi <suneel_mar...@yahoo.com > >wrote: > > > Not sure if the CDH4 patches on top of 0.7 has fixes for M-1067 and > M-1098 > > which address the issues u r seeing. > > > > > > > > The second part of the issue u r seeing with Mahout 0.9 distro seems to > be > > related to how u set it up on CDH4. I apologize for not being helpful > here > > as I am not a CDH4 user or expert. > > > > Sean? > > > > > > > > > > On Wednesday, March 5, 2014 10:23 AM, Kevin Moulart < > > kevinmoul...@gmail.com> wrote: > > > > Previous mail sent only to Suneel : (my bad sorry) > > > > According to my stacktrace it seems that I am running mahout 0.7 indeed. > > > That's the version provided by Cloudera when I install mahout using > yum. > > > But according to Sean Owen, it really is a 0.8 inside... > > > Anyway I tried with the compiled version and it didn't work : > > > Running on hadoop, using > /opt/cloudera/parcels/CDH/lib/hadoop/bin/hadoop > > > and HADOOP_CONF_DIR= > > > Exception in thread "main" java.lang.NoSuchMethodError: > > > org.apache.hadoop.util.ProgramDriver.driver([Ljava/lang/String;)V > > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:122) > > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > > at > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > > > at > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > > > at java.lang.reflect.Method.invoke(Method.java:606) > > > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) > > > > MAHOUT-JOB: > > > > /home/cacf/Downloads/mahout-distribution-0.9/mahout-examples-0.9-job.jar > > > > > > > And now I changed the conf directory of mahout 0.9 to be linked to the > one > > used by the existing working mahout and the trace changes : > > > > MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. > > Running on hadoop, using /opt/cloudera/parcels/CDH/lib/hadoop/bin/hadoop > > and HADOOP_CONF_DIR=/etc/hadoop/conf > > MAHOUT-JOB: > > > > > /home/myCompany/Downloads/mahout-distribution-0.9/mahout-examples-0.9-job.jar > > 14/03/05 16:16:23 WARN driver.MahoutDriver: Unable to add class: > > org.apache.mahout.clustering.meanshift.MeanShiftCanopyDriver > > java.lang.ClassNotFoundException: > > org.apache.mahout.clustering.meanshift.MeanShiftCanopyDriver > > at java.net.URLClassLoader$1.run(URLClassLoader.java:366) > > at java.net.URLClassLoader$1.run(URLClassLoader.java:355) > > at java.security.AccessController.doPrivileged(Native Method) > > at java.net.URLClassLoader.findClass(URLClassLoader.java:354) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:425) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:358) > > at java.lang.Class.forName0(Native Method) > > at java.lang.Class.forName(Class.java:190) > > at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:118) > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > at > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > > at java.lang.reflect.Method.invoke(Method.java:606) > > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) > > 14/03/05 16:16:23 WARN driver.MahoutDriver: Unable to add class: > > org.apache.mahout.clustering.spectral.eigencuts.EigencutsDriver > > java.lang.ClassNotFoundException: > > org.apache.mahout.clustering.spectral.eigencuts.EigencutsDriver > > at java.net.URLClassLoader$1.run(URLClassLoader.java:366) > > at java.net.URLClassLoader$1.run(URLClassLoader.java:355) > > at java.security.AccessController.doPrivileged(Native Method) > > at java.net.URLClassLoader.findClass(URLClassLoader.java:354) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:425) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:358) > > at java.lang.Class.forName0(Native Method) > > at java.lang.Class.forName(Class.java:190) > > at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:118) > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > at > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > > at java.lang.reflect.Method.invoke(Method.java:606) > > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) > > 14/03/05 16:16:23 WARN driver.MahoutDriver: Unable to add class: > > org.apache.mahout.clustering.minhash.MinHashDriver > > java.lang.ClassNotFoundException: > > org.apache.mahout.clustering.minhash.MinHashDriver > > at java.net.URLClassLoader$1.run(URLClassLoader.java:366) > > at java.net.URLClassLoader$1.run(URLClassLoader.java:355) > > at java.security.AccessController.doPrivileged(Native Method) > > at java.net.URLClassLoader.findClass(URLClassLoader.java:354) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:425) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:358) > > at java.lang.Class.forName0(Native Method) > > at java.lang.Class.forName(Class.java:190) > > at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:118) > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > at > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > > at java.lang.reflect.Method.invoke(Method.java:606) > > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) > > 14/03/05 16:16:23 WARN driver.MahoutDriver: Unable to add class: > > org.apache.mahout.clustering.dirichlet.DirichletDriver > > java.lang.ClassNotFoundException: > > org.apache.mahout.clustering.dirichlet.DirichletDriver > > at java.net.URLClassLoader$1.run(URLClassLoader.java:366) > > at java.net.URLClassLoader$1.run(URLClassLoader.java:355) > > at java.security.AccessController.doPrivileged(Native Method) > > at java.net.URLClassLoader.findClass(URLClassLoader.java:354) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:425) > > at java.lang.ClassLoader.loadClass(ClassLoader.java:358) > > at java.lang.Class.forName0(Native Method) > > at java.lang.Class.forName(Class.java:190) > > at org.apache.mahout.driver.MahoutDriver.addClass(MahoutDriver.java:237) > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:118) > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > at > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > > at java.lang.reflect.Method.invoke(Method.java:606) > > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) > > Exception in thread "main" java.lang.NoSuchMethodError: > > org.apache.hadoop.util.ProgramDriver.driver([Ljava/lang/String;)V > > at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:122) > > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > > at > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > > at > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > > at java.lang.reflect.Method.invoke(Method.java:606) > > at org.apache.hadoop.util.RunJar.main(RunJar.java:208) > > > > Changing the hadoop home to > > /opt/cloudera/parcels/CDH/lib/hadoop/bin/hadoop-mapreduce doesn't change > > the output, nor does > > /opt/cloudera/parcels/CDH/lib/hadoop/bin/hadoop-0.20-mapreduce > > > > Any idea now ? > > > > > > > > 2014-03-05 15:45 GMT+01:00 Suneel Marthi <suneel_mar...@yahoo.com>: > > > > Are u using Mahout 0.7 ? > > > > > > From this line in ur stacktrace that seems to be the case: > > > MAHOUT-JOB: /usr/lib/mahout/mahout-examples-0.7-cdh4.5.0-job.jar > > > > > > You could build Mahout outside of CDH from Mahout trunk and put the > jars > > > onto CDH5. > > > I am no Cloudera expert or CDH5 user to help with CDHx build. > > > > > > > > > > > > > > > > > > > > > On Wednesday, March 5, 2014 9:30 AM, Kevin Moulart < > > > kevinmoul...@gmail.com> wrote: > > > Hi and thanks for your help! > > > > > > I had been told that the version of mahout used by Cloudera (CDH 4.6) > was > > > in fact 0.8 with a patch for mr2 support. > > > ( > > > > > > http://mail-archives.apache.org/mod_mbox/mahout-user/201402.mbox/%3CCAEccTywqSAKA_HeX4vTZ-5XPmKtj5b8zMGQUfn5qRsiq=7o=u...@mail.gmail.com%3E > > ) > > > > > > But I tried to install 0.9 on my own, by compiling it with mvn after I > > > changed the pom.xml : > > > > > > - Added cloudera repository : > > > > > > <repository> > > > <id>cloudera-repo</id> > > > <name>Cloudera Repository</name> > > > <url>https://repository.cloudera.com/artifactory/cloudera-repos > > > </url> > > > </repository> > > > > > > - Changed the version of hadoop to use : > > > <hadoop.1.version>2.0.0-mr1-cdh4.6.0</hadoop.1.version> > > > - I tried adding this one too : > > > <hadoop2.version>2.0.0-cdh4.6.0</hadoop2.version> > > > > > > But then I get a lot of errors when Maven begins to compile the core > > > package : > > > https://gist.github.com/kmoulart/9368193 > > > > > > Could you tell me what I did wrong ? > > > > > > > > > 2014-03-04 19:02 GMT+01:00 Suneel Marthi <suneel_mar...@yahoo.com>: > > > > > > The -us option was fixed for Mahout 0.8, seems like u r using Mahout > 0.7 > > > which had this issue (from ur stacktrace, its apparent u r using Mahout > > > 0.7). Please upgrade to the latest mahout version. > > > > > > > > > > > > > > > > > > On Tuesday, March 4, 2014 8:54 AM, Kevin Moulart < > kevinmoul...@gmail.com > > > > > > wrote: > > > > > > Hi, > > > > > > I'm trying to apply a PCA to reduce the dimension of a matrix of 1603 > > > columns and 100.000 to 30.000.000 lines using ssvd with the pca option, > > and > > > I always get a StackOverflowError : > > > > > > Here is my command line : > > > mahout ssvd -i /user/myUser/Echant100k -o /user/myUser/Echant/SVD100 -k > > 100 > > > -pca "true" -U "false" -V "false" -t 3 -ow > > > > > > I also tried to put "-us true" as mentionned in > > > > > > > > > https://cwiki.apache.org/confluence/download/attachments/27832158/SSVD-CLI.pdf?version=18&modificationDate=1381347063000&api=v2but > > > the option is not available anymore. > > > > > > The output of the previous command is : > > > MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. > > > Running on hadoop, using > /opt/cloudera/parcels/CDH/lib/hadoop/bin/hadoop > > > and HADOOP_CONF_DIR=/etc/hadoop/conf > > > MAHOUT-JOB: /usr/lib/mahout/mahout-examples-0.7-cdh4.5.0-job.jar > > > 14/03/04 14:45:16 INFO common.AbstractJob: Command line arguments: > > > {--abtBlockHeight=[200000], --blockHeight=[10000], --broadcast=[true], > > > --computeU=[false], --computeV=[false], --endPhase=[2147483647], > > > --input=[/user/myUser/Echant100k], --minSplitSize=[-1], > > > --outerProdBlockHeight=[30000], --output=[/user/myUser/Echant/SVD100], > > > --oversampling=[15], --overwrite=null, --pca=[true], --powerIter=[0], > > > --rank=[100], --reduceTasks=[3], --startPhase=[0], --tempDir=[temp], > > > --uHalfSigma=[false], --vHalfSigma=[false]} > > > Exception in thread "main" java.lang.StackOverflowError > > > at > > > > > > > > > org.apache.mahout.math.hadoop.MatrixColumnMeansJob.run(MatrixColumnMeansJob.java:55) > > > at > > > > > > > > > org.apache.mahout.math.hadoop.MatrixColumnMeansJob.run(MatrixColumnMeansJob.java:55) > > > at > > > > > > > > > org.apache.mahout.math.hadoop.MatrixColumnMeansJob.run(MatrixColumnMeansJob.java:55) > > > ... > > > > > > I search online and didn't find a solution to my problem. > > > > > > Can you help me ? > > > > > > Thanks in advance, > > > > > > -- > > > Kévin Moulart > > > > > > > > > > > > > > > -- > > > Kévin Moulart > > > GSM France : +33 7 81 06 10 10 > > > GSM Belgique : +32 473 85 23 85 > > > Téléphone fixe : +32 2 771 88 45 > > > > > > > > > > > > > > > -- > > Kévin Moulart > > GSM France : +33 7 81 06 10 10 > > GSM Belgique : +32 473 85 23 85 > > Téléphone fixe : +32 2 771 88 45 > > >