> On Feb. 17, 2014, 5:30 a.m., Rohini Palaniswamy wrote:
> > branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/MultiQueryOptimizerTez.java,
> >  line 52
> > <https://reviews.apache.org/r/17878/diff/2/?file=486607#file486607line52>
> >
> >     Can we create a jira to do this later? Currently nested splits work.

Will do. Actually it is very easy to just not to check that in 
MultiqueryOptimizer, but need to think through if there are any complications.


> On Feb. 17, 2014, 5:30 a.m., Rohini Palaniswamy wrote:
> > branches/tez/src/org/apache/pig/impl/builtin/ReadScalarsTez.java, lines 
> > 48-49
> > <https://reviews.apache.org/r/17878/diff/1/?file=481154#file481154line48>
> >
> >     1) Need to handle case where scalar is null here. 
> >     
> >     2) Need to handle case where scalar has more than one record and throw 
> > error. It would be better to do that while writing scalar itself instead of 
> > here as we have better control now. This would avoid cases where wrong 
> > usage of scalars can bring down namenode.

Good catch, I will fix it.


> On Feb. 17, 2014, 5:30 a.m., Rohini Palaniswamy wrote:
> > branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java,
> >  line 2110
> > <https://reviews.apache.org/r/17878/diff/2/?file=486610#file486610line2110>
> >
> >     You will need these, else there will be hanging operators in the plan

I also changed compile(PhysicalOperator op), which now handles split 
differently, splitter will only be added once


> On Feb. 17, 2014, 5:30 a.m., Rohini Palaniswamy wrote:
> > branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezLauncher.java,
> >  line 172
> > <https://reviews.apache.org/r/17878/diff/2/?file=486612#file486612line172>
> >
> >     Might be easier if this optimizer is the first one to be called before 
> > Combiner and secondary key optimizer. Just a thought. If there are no 
> > issues with having it later, then fine.

This is follow MR version. In MR, multiquery and combiner optimization are 
mutually exclusive (multiquery MR cannot have combiner), and we find combiner 
is more desired than multiquery, so we run combiner optimization first. 
Fortunately that is not the case for Tez. But I still want to keep the sequence 
of rules triggered to minimize Tez only issue.


> On Feb. 17, 2014, 5:30 a.m., Rohini Palaniswamy wrote:
> > branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java,
> >  line 2112
> > <https://reviews.apache.org/r/17878/diff/2/?file=486610#file486610line2112>
> >
> >     You will need this as well, else input will be pointing to wrong 
> > physical operataors.

Checked seems to be Ok. Guess that is also due the change of how I generate the 
first split. Splitter's physical plan is composed with PhysicalPlan.connect, 
which should deal with inputs correctly.


> On Feb. 17, 2014, 5:30 a.m., Rohini Palaniswamy wrote:
> > branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java,
> >  lines 772-773
> > <https://reviews.apache.org/r/17878/diff/2/?file=486610#file486610line772>
> >
> >     It would be easier to keep this here for curTezOp.getSplitParent() != 
> > null (no multiquery)

Not sure if I understand, but seems there is nothing special for filter for 
splitter, can you be more specific?


- Daniel


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/17878/#review34061
-----------------------------------------------------------


On Feb. 17, 2014, 4:25 a.m., Daniel Dai wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/17878/
> -----------------------------------------------------------
> 
> (Updated Feb. 17, 2014, 4:25 a.m.)
> 
> 
> Review request for pig, Cheolsoo Park and Rohini Palaniswamy.
> 
> 
> Bugs: PIG-3757
>     https://issues.apache.org/jira/browse/PIG-3757
> 
> 
> Repository: pig
> 
> 
> Description
> -------
> 
> See PIG-3757
> 
> 
> Diffs
> -----
> 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/expressionOperators/POUserFunc.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/plans/PhyPlanVisitor.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/physicalLayer/relationalOperators/CombinerPackager.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/MultiQueryOptimizerTez.java
>  PRE-CREATION 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/PigProcessor.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/SecondaryKeyOptimizerTez.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezDagBuilder.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezLauncher.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/TezOperator.java
>  1567297 
>   
> branches/tez/src/org/apache/pig/backend/hadoop/executionengine/tez/util/TezCompilerUtil.java
>  1567297 
>   branches/tez/src/org/apache/pig/impl/builtin/ReadScalarsTez.java 
> PRE-CREATION 
>   branches/tez/src/org/apache/pig/newplan/logical/visitor/ScalarVisitor.java 
> 1567297 
> 
> Diff: https://reviews.apache.org/r/17878/diff/
> 
> 
> Testing
> -------
> 
> 
> Thanks,
> 
> Daniel Dai
> 
>

Reply via email to