[ https://issues.apache.org/jira/browse/TEZ-3769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rajesh Balamohan updated TEZ-3769: ---------------------------------- Attachment: TEZ-3769.3.patch Uploading .3 with review comments addressed. Agreed that unordered writer needs refactoring to reduce the complexity. > Unordered: Fix wrong stats being sent out in the last event, when final merge > is disabled > ----------------------------------------------------------------------------------------- > > Key: TEZ-3769 > URL: https://issues.apache.org/jira/browse/TEZ-3769 > Project: Apache Tez > Issue Type: Bug > Reporter: Rajesh Balamohan > Attachments: TEZ-3769.1.patch, TEZ-3769.2.patch, TEZ-3769.3.patch > > > When final merge is disabled (without pipelining), wrong stats was sent out > in the last event. > It was based on {{numRecordsPerPartition}} which contains the overall > partition data. It should be ideally be based on the spill result and its > buffers. > Also, {{finalSpill}} was unncessarily sending events when no data was present > (i.e, when currentBuffer didn't have any data). This can be optimized to > reduce the number of events being sent across. -- This message was sent by Atlassian JIRA (v6.4.14#64029)