[ 
https://issues.apache.org/jira/browse/PIG-2898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan A. Veselovsky updated PIG-2898:
------------------------------------

    Patch Info: Patch Available

We provided parallelized mode of the e2e tests execution using 
Parallel::ForkManager.
Two parameters affect the behavior: 
1) file.fork.factor -- max number of subprocesses when running test 
configuration files (.conf);
2) fork.factor -- max number of subprocesses when running tests within one 
.conf file.
Total max number of subprocesses canot exceed the product of the 2 values.
Value <= 1 mean no paralellizing.
Example: ant -Dfork.factor=3 -Dfile.fork.factor=3 ... test-e2e

The attached patch is to be applied to 
http://svn.apache.org/repos/asf/pig/branches/branch-0.9/ branch.

The patch testing procedure gives the following results for the patch:
     [exec] -1 overall.  
     [exec] 
     [exec]     +1 @author.  The patch does not contain any @author tags.
     [exec] 
     [exec]     +1 tests included.  The patch appears to include 24 new or 
modified tests.
     [exec] 
     [exec]     -1 javadoc.  The javadoc tool appears to have generated 1 
warning messages.
     [exec] 
     [exec]     +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
     [exec] 
     [exec]     +1 findbugs.  The patch does not introduce any new Findbugs 
warnings.
     [exec] 
     [exec]     +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.
     [exec] 
                
> Multithreaded execution of e2e tests
> ------------------------------------
>
>                 Key: PIG-2898
>                 URL: https://issues.apache.org/jira/browse/PIG-2898
>             Project: Pig
>          Issue Type: Improvement
>          Components: e2e harness
>            Reporter: Andrey Klochkov
>            Assignee: Andrey Klochkov
>
> Today it takes ~19 hours to run the full set of e2e tests in mapred mode. The 
> bottleneck here is the client side, and per our observations it can help a 
> lot if the e2e harness would be able to run tests in parallel threads.
> We prototyped changes in e2e harness allowing to run tests in a configurable 
> number of threads. Preliminary results show more than 6x reduction in 
> execution time when using a small 3-nodes M/R cluster with modest 
> configuration. Going to share a patch shortly.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to