[ https://issues.apache.org/jira/browse/TEZ-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14648007#comment-14648007 ]
Saikat commented on TEZ-2172: ----------------------------- rebased the patch after TEZ-2613 has been merged. > FetcherOrderedGrouped using List to store InputAttemptIdentifier can lead to > some inefficiency during remove() operation > ------------------------------------------------------------------------------------------------------------------------ > > Key: TEZ-2172 > URL: https://issues.apache.org/jira/browse/TEZ-2172 > Project: Apache Tez > Issue Type: Improvement > Reporter: Rajesh Balamohan > Assignee: Saikat > Attachments: TEZ-2172.1.patch, TEZ-2172.patch > > > As part of fixing TEZ-2001, FetcherOrderedGrouped stores > InputAttemptIdentifier in List. This can lead to some inefficiency - since > the size of this list can be ~30, and remove() calls can be expensive. > Option 1: by using the spillId in the hashCode - or a wrapping structure for > just this. However, SpillId can not be added to the hashCode as it would > break ShuffleScheduler shuffleInfoEventsMap. > Option 2: consider using Map with an identifier. > Need to consider other options as well. Creating this jira as a placeholder > to fix this issue. -- This message was sent by Atlassian JIRA (v6.3.4#6332)