[jira] [Work logged] (HDFS-16598) All datanodes [DatanodeInfoWithStorage[127.0.0.1:57448,DS-1b5f7e33-a2bf-4edc-9122-a74c995a99f5,DISK]] are bad. Aborting...

ASF GitHub Bot (Jira) Sat, 11 Jun 2022 07:30:06 -0700


     [ 
https://issues.apache.org/jira/browse/HDFS-16598?focusedWorklogId=780512&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-780512
 ]


ASF GitHub Bot logged work on HDFS-16598:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 11/Jun/22 14:29
            Start Date: 11/Jun/22 14:29
    Worklog Time Spent: 10m 
      Work Description: ZanderXu commented on PR #4366:
URL: https://github.com/apache/hadoop/pull/4366#issuecomment-1152938046

   > Sure. It is not necessary to check GS when acquire BP/VOLUME lock which is 
totally not related with GS IMO. Thanks.
   
   @Hexiaoqiao @MingXiangLi @slfan1989 I have update the patch, please help me 
review it. Thanks 
   
   About this jira, please refer to UT 
`org.apache.hadoop.hdfs.TestClientProtocolForPipelineRecovery.testPipelineRecoveryOnRestartFailure`.
   
   Tips: this PR is blocked by 
[HDFS-16600](https://github.com/apache/hadoop/pull/4367). After it is merged 
into trunk, I will submit a commit to fix the new lock in HDFS-16600.




Issue Time Tracking
-------------------

    Worklog Id:     (was: 780512)
    Time Spent: 2h 40m  (was: 2.5h)

> All datanodes 
> [DatanodeInfoWithStorage[127.0.0.1:57448,DS-1b5f7e33-a2bf-4edc-9122-a74c995a99f5,DISK]]
>  are bad. Aborting...
> --------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-16598
>                 URL: https://issues.apache.org/jira/browse/HDFS-16598
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: ZanderXu
>            Assignee: ZanderXu
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> org.apache.hadoop.hdfs.testPipelineRecoveryOnRestartFailure failed with the 
> stack like:
> {code:java}
> java.io.IOException: All datanodes 
> [DatanodeInfoWithStorage[127.0.0.1:57448,DS-1b5f7e33-a2bf-4edc-9122-a74c995a99f5,DISK]]
>  are bad. Aborting...
>       at 
> org.apache.hadoop.hdfs.DataStreamer.handleBadDatanode(DataStreamer.java:1667)
>       at 
> org.apache.hadoop.hdfs.DataStreamer.setupPipelineInternal(DataStreamer.java:1601)
>       at 
> org.apache.hadoop.hdfs.DataStreamer.setupPipelineForAppendOrRecovery(DataStreamer.java:1587)
>       at 
> org.apache.hadoop.hdfs.DataStreamer.processDatanodeOrExternalError(DataStreamer.java:1371)
>       at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:674)
> {code}
> After tracing the root cause, this bug was introduced by 
> [HDFS-16534|https://issues.apache.org/jira/browse/HDFS-16534]. Because the 
> block GS of client may be smaller than DN when pipeline recovery failed.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

[jira] [Work logged] (HDFS-16598) All datanodes [DatanodeInfoWithStorage[127.0.0.1:57448,DS-1b5f7e33-a2bf-4edc-9122-a74c995a99f5,DISK]] are bad. Aborting...

Reply via email to