Cheolsoo Park created PIG-3835:
----------------------------------

             Summary: Optimize union followed by store
                 Key: PIG-3835
                 URL: https://issues.apache.org/jira/browse/PIG-3835
             Project: Pig
          Issue Type: Sub-task
          Components: tez
    Affects Versions: tez-branch
            Reporter: Cheolsoo Park
             Fix For: tez-branch


PIG-3742 implements union using VertexGroup. Currently, union is a blocking 
operator meaning that a new vertex is added for its succeeding operators.

But if there is only one store in the succeeding vertex, MROutput could be 
directly attached to VertexGroup instead of adding a new vertex for it. Then, 
each union source vertex will write directly to the destination, and therefore, 
it will be faster.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to