[ https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404266#comment-13404266 ]
Leting Wu edited comment on MAHOUT-1034 at 6/29/12 10:06 PM: ------------------------------------------------------------- I tried Mahout 0.6. Does not look well either. My account is one of the ones on the workstation. I can run sudo to change the global setting. New to Mahout and really need help. Thanks. {noformat} $ ./examples/bin/classify-20newsgroups.sh Please select a number to choose the corresponding task to run 1. naivebayes 2. sgd 3. clean -- cleans up the work area in /tmp/mahout-work-lwu Enter your choice : 1 ok. You chose 1 and we'll use naivebayes creating work directory at /tmp/mahout-work-lwu Downloading 20news-bydate % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 13.7M 100 13.7M 0 0 186k 0 0:01:15 0:01:15 --:--:-- 441k Extracting... Preparing Training Data MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Running on hadoop, using HADOOP_HOME=/opt/hadoop HADOOP_CONF_DIR=/opt/hadoop/conf MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar 12/06/29 14:56:50 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only 12/06/29 14:56:51 INFO driver.MahoutDriver: Program took 1143 ms (Minutes: 0.01905) Preparing Test Data MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Running on hadoop, using HADOOP_HOME=/opt/hadoop HADOOP_CONF_DIR=/opt/hadoop/conf MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar 12/06/29 14:56:52 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only 12/06/29 14:56:53 INFO driver.MahoutDriver: Program took 794 ms (Minutes: 0.013233333333333333) DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. rmr: DEPRECATED: Please use 'rm -r' instead. 12/06/29 14:56:55 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main] java.lang.NullPointerException at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781) rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. rmr: DEPRECATED: Please use 'rm -r' instead. 12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main] java.lang.NullPointerException at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781) rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. 12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (165149691) connection to localhost/127.0.0.1:8888 from lwu,5,main] java.lang.NullPointerException at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781) put: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; {noformat} was (Author: rhinewlt): I tried Mahout 0.6. Does not look well either. My account is one of the ones on the workstation. I can run sudo to change the global setting. {noformat} $ ./examples/bin/classify-20newsgroups.sh Please select a number to choose the corresponding task to run 1. naivebayes 2. sgd 3. clean -- cleans up the work area in /tmp/mahout-work-lwu Enter your choice : 1 ok. You chose 1 and we'll use naivebayes creating work directory at /tmp/mahout-work-lwu Downloading 20news-bydate % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 13.7M 100 13.7M 0 0 186k 0 0:01:15 0:01:15 --:--:-- 441k Extracting... Preparing Training Data MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Running on hadoop, using HADOOP_HOME=/opt/hadoop HADOOP_CONF_DIR=/opt/hadoop/conf MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar 12/06/29 14:56:50 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only 12/06/29 14:56:51 INFO driver.MahoutDriver: Program took 1143 ms (Minutes: 0.01905) Preparing Test Data MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Running on hadoop, using HADOOP_HOME=/opt/hadoop HADOOP_CONF_DIR=/opt/hadoop/conf MAHOUT-JOB: /opt/mahout/examples/target/mahout-examples-0.6-job.jar 12/06/29 14:56:52 WARN driver.MahoutDriver: No org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups.props found on classpath, will use command-line arguments only 12/06/29 14:56:53 INFO driver.MahoutDriver: Program took 794 ms (Minutes: 0.013233333333333333) DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. rmr: DEPRECATED: Please use 'rm -r' instead. 12/06/29 14:56:55 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main] java.lang.NullPointerException at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781) rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. rmr: DEPRECATED: Please use 'rm -r' instead. 12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (820233764) connection to localhost/127.0.0.1:8888 from lwu,5,main] java.lang.NullPointerException at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781) rmr: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. 12/06/29 14:56:56 WARN ipc.Client: Unexpected error reading responses on connection Thread[IPC Client (165149691) connection to localhost/127.0.0.1:8888 from lwu,5,main] java.lang.NullPointerException at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:852) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:781) put: Failed on local exception: java.io.IOException: Broken pipe; Host Details : local host is: "puser-lwu/127.0.0.1"; destination host is: "localhost":8888; {noformat} > ERROR in Navie Bayes Training(trainnb) > -------------------------------------- > > Key: MAHOUT-1034 > URL: https://issues.apache.org/jira/browse/MAHOUT-1034 > Project: Mahout > Issue Type: Bug > Components: Classification > Affects Versions: 0.7 > Environment: Ubuntu 11.04 > Reporter: Leting Wu > Priority: Critical > > When run either examples/classify-20newsgrouops.sh or ash-email-examples.sh, > trainnb always fails: > {noformat} > INFO mapred.JobClient: Task Id : attempt_201206281546_0003_m_000000_0, Status > : FAILED > java.lang.IllegalArgumentException > at > com.google.common.base.Preconditions.checkArgument(Preconditions.java:72) > at > org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42) > at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323) > at org.apache.hadoop.mapred.Child$4.run(Child.java:270) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:396) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1177) > at org.apache.hadoop.mapred.Child.main(Child.java:264) > {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira