-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/7053/#review11749
-----------------------------------------------------------


General comments:
1) It would be good to rename file.fork.factor to fork.factor.conf.file and 
fork.factor to fork.factor.group for easy understanding of the intention of the 
parameters.

2)      [exec] Results so far, PASSED: 1 FAILED: 0 SKIPPED: 0 ABORTED: 0 FAILED 
DEPENDENCY: 0
 - It would be good to have another context[] before Results, which puts the 
forkname like [nightly.conf-CoGroup] or [nightly.conf] if it is a parallel 
execution. Without it reading results is difficult.

3) dbg and dumpHash
  There are lot of commented dbg and dumpHash calls which need to be 
uncommented manually for debug purposes. Could we have these printed based on a 
-debug option or -De2e.debug=true system property instead of having commented 
code and rename dumpHash to dbgDumpHash. 

Issues:
  - Running test in local mode hangs and does not work . Needs to be fixed. 
  - When running only two tests the time taken was twice sequential mode 
because of the mkdir/rmr. That could be improved. More details in the code 
section.

Performance:
   - Performance improvement is really good when I tested it. Test for mapred 
mode was down to 2.5 hrs from 9 hrs with benchmark results already cached in 
our setup.  




http://svn.apache.org/repos/asf/pig/trunk/test/e2e/harness/TestDriver.pm
<https://reviews.apache.org/r/7053/#comment25316>

    Context of the fork here



http://svn.apache.org/repos/asf/pig/trunk/test/e2e/harness/TestDriver.pm
<https://reviews.apache.org/r/7053/#comment25315>

    Context of the fork here.



http://svn.apache.org/repos/asf/pig/trunk/test/e2e/pig/drivers/TestDriverPig.pm
<https://reviews.apache.org/r/7053/#comment25314>

    This was taking 40 secs for each group in parallel mode as opposed to 6 
secs in sequential mode and I am suspecting it is due to lot of forking. So 
running two tests with tests.to.run option took double the time of sequential 
(5 min to 12 min). That cost is immaterial if we are running the whole suite. 
But reducing it helps as most of the time you will be running a subset for 
testing. 
    
     Would be good to optimize by combining the two hdfs mkdirs into one 
command and use perl mkdir instead of IPC::Run (which internally does fork and 
1 sec sleep).  


- Rohini Palaniswamy


On Sept. 12, 2012, 8:31 a.m., Ivan Veselovsky wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/7053/
> -----------------------------------------------------------
> 
> (Updated Sept. 12, 2012, 8:31 a.m.)
> 
> 
> Review request for pig and Rohini Palaniswamy.
> 
> 
> Description
> -------
> 
> Please see https://issues.apache.org/jira/browse/PIG-2898 for details.
> 
> 
> This addresses bug https://issues.apache.org/jira/browse/PIG-2898.
>     
> https://issues.apache.org/jira/browse/https://issues.apache.org/jira/browse/PIG-2898
> 
> 
> Diffs
> -----
> 
>   http://svn.apache.org/repos/asf/pig/trunk/test/e2e/harness/TestDriver.pm 
> 1383357 
>   http://svn.apache.org/repos/asf/pig/trunk/test/e2e/harness/test_harness.pl 
> 1383357 
>   http://svn.apache.org/repos/asf/pig/trunk/test/e2e/pig/build.xml 1383357 
>   
> http://svn.apache.org/repos/asf/pig/trunk/test/e2e/pig/deployers/ExistingClusterDeployer.pm
>  1383357 
>   
> http://svn.apache.org/repos/asf/pig/trunk/test/e2e/pig/drivers/TestDriverPig.pm
>  1383357 
> 
> Diff: https://reviews.apache.org/r/7053/diff/
> 
> 
> Testing
> -------
> 
> Tested e2e tests execution in both sequential (default) and parallel modes.
> The test run duration measurement data (in dependency on the fork parameters) 
> will be available soon.
> 
> 
> Thanks,
> 
> Ivan Veselovsky
> 
>

Reply via email to