[ 
https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prakash Ramachandran updated TEZ-1733:
--------------------------------------
    Attachment: TEZ-1733.3.patch

apologies for the confusing naming of the patch. renamed it. 
[~gopalv]/[~rajesh.balamohan]/[~sseth] can you have a look. 
also created a ticket to track the package naming issue. TEZ-1739

> TezMerger should sort FileChunks on decompressed size
> -----------------------------------------------------
>
>                 Key: TEZ-1733
>                 URL: https://issues.apache.org/jira/browse/TEZ-1733
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.5.2
>            Reporter: Gopal V
>            Assignee: Prakash Ramachandran
>            Priority: Critical
>         Attachments: TEZ-1733.1.patch, TEZ-1733.1.patch, TEZ-1733.2.patch, 
> TEZ-1733.3.patch
>
>
>  MAPREDUCE-3685 fixed the Merger sort order for file chunks to use the 
> decompressed size, to cut-down on CPU and IO costs.
> TezMerger needs an equivalent sorted TreeSet which sorts by the data with-in 
> sizes rather than actual file sizes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to