[ 
https://issues.apache.org/jira/browse/TEZ-2172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14648007#comment-14648007
 ] 

Saikat commented on TEZ-2172:
-----------------------------

rebased the patch after TEZ-2613 has been merged.

> FetcherOrderedGrouped using List to store InputAttemptIdentifier can lead to 
> some inefficiency during remove() operation
> ------------------------------------------------------------------------------------------------------------------------
>
>                 Key: TEZ-2172
>                 URL: https://issues.apache.org/jira/browse/TEZ-2172
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Rajesh Balamohan
>            Assignee: Saikat
>         Attachments: TEZ-2172.1.patch, TEZ-2172.patch
>
>
> As part of fixing TEZ-2001, FetcherOrderedGrouped stores 
> InputAttemptIdentifier in List.  This can lead to some inefficiency - since 
> the size of this list can be ~30, and remove() calls can be expensive. 
> Option 1:  by using the spillId in the hashCode - or a wrapping structure for 
> just this. However, SpillId can not be added to the hashCode as it would 
> break ShuffleScheduler shuffleInfoEventsMap. 
> Option 2: consider using Map with an identifier. 
> Need to consider other options as well. Creating this jira as a placeholder 
> to fix this issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to