[ 
https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13642654#comment-13642654
 ] 

Isabel Drost commented on MAHOUT-916:
-------------------------------------

The problem is that some of our tests run in a local Hadoop cluster. Starting 
such a cluster automatically creates a directory /tmp/hadoop-$USER and stores 
the job configuration there. In multi threaded test execution this can lead to 
race conditions of tests trying to write to the same file at the same time.

To reproduce in a single threaded setting just create /tmp/hadoop-$USER on your 
machine, issue a "chmod 000 /tmp/hadoop-$USER" and try to re-run the test suite.

Setting the option "mapred.local.dir" to point to a test specific location 
avoids part of the issue. In many of our tests this can easily be added to the 
configuration object we hand over to the job anyway (see updated patch for an 
idea of what the changes for core might look like).

There are issues left with some tests executing jobs via the command line 
interface and some data still being stored in the "/tmp/hadoop-$USER" 
directory. Will look into that later this week.

                
> Make Mahout's tests run in parallel
> -----------------------------------
>
>                 Key: MAHOUT-916
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-916
>             Project: Mahout
>          Issue Type: Improvement
>          Components: build
>            Reporter: Grant Ingersoll
>            Assignee: Isabel Drost
>            Priority: Minor
>              Labels: MAHOUT_INTRO_CONTRIBUTE
>         Attachments: MAHOUT-916.patch, MAHOUT-916.patch, MAHOUT-916.patch
>
>
> Maven now supports parallel execution of tests.  We should hook this in to 
> Mahout.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to