[jira] Commented: (HADOOP-5638) More improvement on block placement performance

Tsz Wo (Nicholas), SZE (JIRA) Wed, 08 Apr 2009 16:00:35 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-5638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12697283#action_12697283
 ]


Tsz Wo (Nicholas), SZE commented on HADOOP-5638:
------------------------------------------------

The type of excludedNodes in countNumOfAvailableNodes(..) should be 
Collection<Node>.   Everything else looks good.

> More improvement on block placement performance
> -----------------------------------------------
>
>                 Key: HADOOP-5638
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5638
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: dfs
>            Reporter: Hairong Kuang
>            Assignee: Hairong Kuang
>             Fix For: 0.21.0
>
>         Attachments: excludedList.patch
>
>
> Block placement algorithm currently has an excluded node list, which contains 
> all datanodes that have been visited. This list is implemented as an array 
> list, whose cost of inserting is O(1) but the cost of query "contains" is O( 
> n ), where n is the number of datanodes. This makes the cost of block 
> placement to be O(n*n) when a cluster is full.
> I propose to change the data structure of the excluded node list as a 
> HashMap. So in average, the cost of insertion is O(1) and the cost of query 
> is O(1). This makes the block placement algorithm to be O( n ) in average.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-5638) More improvement on block placement performance

Reply via email to