[ https://issues.apache.org/jira/browse/PIG-4657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14696149#comment-14696149 ]
Daniel Dai commented on PIG-4657: --------------------------------- +1. Do you have any performance numbers with/without the patch for group/distinct? > [Pig on Tez] Optimize GroupBy and Distinct key comparison > --------------------------------------------------------- > > Key: PIG-4657 > URL: https://issues.apache.org/jira/browse/PIG-4657 > Project: Pig > Issue Type: Sub-task > Reporter: Rohini Palaniswamy > Assignee: Rohini Palaniswamy > Fix For: 0.16.0 > > Attachments: PIG-4657-1.patch > > > While bytes comparator cannot be used for joins till TEZ-2715 is > available, they can be used for group by and distinct if they have only one > Tez input. If there is more than one input due to union optimization > (OrderedGroupedMergedKVInput) , full comparator has to be still used as > OrderedGroupedMergedKVInput uses the comparator to merge the two underlying > inputs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)