-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/16926/
-----------------------------------------------------------

Review request for pig, Cheolsoo Park, Daniel Dai, Mark Wagner, and Rohini 
Palaniswamy.


Bugs: PIG-3557
    https://issues.apache.org/jira/browse/PIG-3557


Repository: pig-git


Description
-------

Implement LIMIT optimizations in Tez - 
https://issues.apache.org/jira/browse/PIG-3557

1. If the previous Tez vertex has a requestedParallelism of 1 and does not 
start with a POLoad, we don't need to add a second LIMIT vertex (since the 
LIMIT we put at the end of the previous vertex is good enough).

2. If we are not in the "limited order by" case, instead of the regular 
shuffle-sort edge we can use an unsorted shuffle edge.
--This code is added but commented out, since it depends on TEZ-661.

3. I manually verified that the LimitOptimizer can push LIMIT to the 
InputHandler in certain cases (no code changes).


Diffs
-----

  src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java 
5c6a242 
  test/org/apache/pig/test/data/GoldenFiles/TEZC7.gld 9cf5baf 

Diff: https://reviews.apache.org/r/16926/diff/


Testing
-------

TestTezCompiler unit test updated
ant test-tez passes
e2e tests - same results as in clean tez branch


Thanks,

Alex Bain

Reply via email to