[ 
https://issues.apache.org/jira/browse/PIG-3775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rohini Palaniswamy reassigned PIG-3775:
---------------------------------------

    Assignee: Rohini Palaniswamy

> Use unsorted shuffle in Union, Orderby, Skewed Join to improve performance in 
> Tez
> ---------------------------------------------------------------------------------
>
>                 Key: PIG-3775
>                 URL: https://issues.apache.org/jira/browse/PIG-3775
>             Project: Pig
>          Issue Type: Sub-task
>          Components: tez
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>              Labels: GSOC2014
>             Fix For: tez-branch
>
>
> When implementing Pig union, we need to gather data from two or more upstream 
> vertexes without sorting. The vertex itself might consists of several tasks. 
> Same can be done for the partitioner vertex in orderby and skewed join 
> instead of 1-1 edge for some cases of parallelism.
> TEZ-661 has been created to add custom output and input for that in Tez. It 
> is currently not in the Tez team priorities but it is important for us as it 
> will give good performance gains. We can write the custom input/output and 
> contribute it to Tez and make the corresponding changes in Pig. 
> This is a candidate project for Google summer of code 2014. More information 
> about the program can be found at 
> https://cwiki.apache.org/confluence/display/PIG/GSoc2014



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to