[ https://issues.apache.org/jira/browse/MAHOUT-814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13109689#comment-13109689 ]
Nathan Halko commented on MAHOUT-814: ------------------------------------- Not sure it this is related, but sounds similar. I can't run more than one power iteration, ie q=2 produces 11/09/21 11:25:46 INFO mapred.LocalJobRunner: reduce > reduce 11/09/21 11:25:46 INFO mapred.Task: Task 'attempt_local_0004_r_000000_0' done. 11/09/21 11:25:50 INFO mapred.JobClient: Cleaning up the staging area file:/tmp/hadoop-nathanhalko/mapred/staging/nathanhalko-200181280/.staging/job_local_0005 Exception in thread "main" org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory temp/ABt-job-1 already exists at org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.checkOutputSpecs(FileOutputFormat.java:134) at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:830) at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:791) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059) at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:791) at org.apache.hadoop.mapreduce.Job.submit(Job.java:465) at org.apache.mahout.math.hadoop.stochasticsvd.ABtJob.run(ABtJob.java:454) at org.apache.mahout.math.hadoop.stochasticsvd.SSVDSolver.run(SSVDSolver.java:312) at org.apache.mahout.math.hadoop.stochasticsvd.SSVDCli.run(SSVDCli.java:118) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at org.apache.mahout.math.hadoop.stochasticsvd.SSVDCli.main(SSVDCli.java:163) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:188) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) for q=0,1 everything works fine. I am running with --overwrite and I rm -rf the temp dir before running. > SSVD local tests should use their own tmp space to avoid collisions > ------------------------------------------------------------------- > > Key: MAHOUT-814 > URL: https://issues.apache.org/jira/browse/MAHOUT-814 > Project: Mahout > Issue Type: Bug > Affects Versions: 0.5 > Reporter: Grant Ingersoll > Assignee: Dmitriy Lyubimov > Priority: Minor > Fix For: 0.6 > > Attachments: MAHOUT-814.patch > > > Running Mahout in an environment with Jenkins also running and am getting: > {quote} > java.io.FileNotFoundException: /tmp/q-temp.seq (Permission denied) > at java.io.FileOutputStream.open(Native Method) > at java.io.FileOutputStream.<init>(FileOutputStream.java:209) > at > org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.<init>(RawLocalFileSystem.java:187) > at > org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.<init>(RawLocalFileSystem.java:183) > at > org.apache.hadoop.fs.RawLocalFileSystem.create(RawLocalFileSystem.java:241) > at > org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSOutputSummer.<init>(ChecksumFileSystem.java:335) > at > org.apache.hadoop.fs.ChecksumFileSystem.create(ChecksumFileSystem.java:368) > at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:528) > at > org.apache.hadoop.io.SequenceFile$BlockCompressWriter.<init>(SequenceFile.java:1198) > at > org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:401) > at > org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:284) > at > org.apache.mahout.math.hadoop.stochasticsvd.qr.QRFirstStep.getTempQw(QRFirstStep.java:263) > at > org.apache.mahout.math.hadoop.stochasticsvd.qr.QRFirstStep.flushSolver(QRFirstStep.java:104) > at > org.apache.mahout.math.hadoop.stochasticsvd.qr.QRFirstStep.map(QRFirstStep.java:175) > at > org.apache.mahout.math.hadoop.stochasticsvd.qr.QRFirstStep.collect(QRFirstStep.java:279) > at > org.apache.mahout.math.hadoop.stochasticsvd.QJob$QMapper.map(QJob.java:142) > at > org.apache.mahout.math.hadoop.stochasticsvd.QJob$QMapper.map(QJob.java:71) > at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370) > at > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) > {quote} > Also seeing the following tests fail: > {quote} > Tests in error: > > testSSVDSolverSparse(org.apache.mahout.math.hadoop.stochasticsvd.LocalSSVDSolverSparseSequentialTest): > Q job unsuccessful. > > testSSVDSolverPowerIterations1(org.apache.mahout.math.hadoop.stochasticsvd.LocalSSVDSolverSparseSequentialTest): > Q job unsuccessful. > > testSSVDSolverPowerIterations1(org.apache.mahout.math.hadoop.stochasticsvd.LocalSSVDSolverDenseTest): > Q job unsuccessful. > > testSSVDSolverDense(org.apache.mahout.math.hadoop.stochasticsvd.LocalSSVDSolverDenseTest): > Q job unsuccessful. > {quote} > I haven't checked all of them, but I suspect they are all due to the same > reason. We should dynamically create a temp area for each test using > temporary directories under the main temp dir. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira