[ https://issues.apache.org/jira/browse/PIG-3849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jeff Zhang updated PIG-3849: ---------------------------- Summary: Integrate YSmart into Pig on tez (was: Optimize group by followed by join on the same key) > Integrate YSmart into Pig on tez > -------------------------------- > > Key: PIG-3849 > URL: https://issues.apache.org/jira/browse/PIG-3849 > Project: Pig > Issue Type: Sub-task > Components: tez > Reporter: Rohini Palaniswamy > > e.g Group by followed by join on the same key > This can be done in one vertex with multiple inputs instead of having an > extra vertex to do the join. i.e Currently Vertex 1 (load relation1) -> > Vertex 2 (group by) -> Vertex 4 (join) <- Vertex 3 (load relation 2). This > could be changed to Vertex 1 (load relation1) -> Vertex 2 (group by and join) > <- Vertex 3 (load relation 2) > And idea of this kind of optimization from YSmart that hive already integrate > it. Now pig has already integrate tez, so it would be natural to integrate > YSmart into pig on tez. -- This message was sent by Atlassian JIRA (v6.3.4#6332)