[ https://issues.apache.org/jira/browse/HDFS-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13016506#comment-13016506 ]
Konstantin Boudnik commented on HDFS-1806: ------------------------------------------ Great analysis, Matt! Thanks for looking into this long-standing issue (I believe there's a JIRA for this already). So, if the second loop is merged with the first one and the poll time is shortened significantly (say, 5 ms?) would it solve the problem? > TestBlockReport.blockReport_08() and _09() are timing-dependent and likely to > fail on fast servers > -------------------------------------------------------------------------------------------------- > > Key: HDFS-1806 > URL: https://issues.apache.org/jira/browse/HDFS-1806 > Project: Hadoop HDFS > Issue Type: Bug > Components: data-node, name-node > Affects Versions: 0.22.0 > Reporter: Matt Foley > Attachments: blockReport_08_failure_log.html > > > Method waitForTempReplica() polls every 100ms during block replication, > attempting to "catch" a datanode in the state of having a TEMPORARY replica. > But examination of a current Hudson test failure log shows that the replica > goes from "start" to "TEMPORARY" to "FINALIZED" in only 50ms, so of course > the poll usually misses it. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira