[ 
https://issues.apache.org/jira/browse/HDFS-7034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14129267#comment-14129267
 ] 

Jing Zhao commented on HDFS-7034:
---------------------------------

Looks like there are several issues causing the current TestStorageMover flaky:
# A block report, which contains only block deletion, currently happens every 
(100 * heartbeat_interval) period of time. Thus before we verify files' storage 
policy, we should manually trigger the block report from all the datanodes.
# Currently {{testHotWarmColdDirs}} contains a DISK-only DataNode, and the 
total number of DN is 6. Then for a hot file, if its initial 3 hot replicas are 
all in DN with both DISK and ARCHIVE, finally we cannot make it "pure cold".
# For {{testNoSpaceDisk}}, the creation of the mover.id file may fail because 
of the limited DISK space. Maybe we can add a configuration prop to disable the 
mover.id file creation only for tests.
# The current delay time for a DN with error is 10s. In our test, since we only 
have 6 DN, and the block deletion report may be delayed, a DN can be marked 
with error if its original replica was actually removed but itself was still 
selected for block replacement. In this scenario, this long time delay may 
cause the migration of other blocks to fail. Thus we should use smaller delay 
value in our tests.

Will upload a patch with the above fix later.

> Archival Storage: Fix TestBlockPlacement and TestStorageMover
> -------------------------------------------------------------
>
>                 Key: HDFS-7034
>                 URL: https://issues.apache.org/jira/browse/HDFS-7034
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: balancer, namenode
>            Reporter: Jing Zhao
>            Assignee: Jing Zhao
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to