[ 
https://issues.apache.org/jira/browse/HDDS-343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Elek, Marton updated HDDS-343:
------------------------------
    Attachment: HDDS-343.001.patch

> Containers are stuck in closing state in scm
> --------------------------------------------
>
>                 Key: HDDS-343
>                 URL: https://issues.apache.org/jira/browse/HDDS-343
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>          Components: SCM
>            Reporter: Elek, Marton
>            Assignee: Elek, Marton
>            Priority: Blocker
>             Fix For: 0.2.1
>
>         Attachments: HDDS-343.001.patch
>
>
> Containers could not been closed currently.
> The datanode is closing the containers and sending the CLOSED state in the 
> container report but SCM doesn't register that the state is closed and 
> sending the close command again and again.
> I think the ContainerMapping.processContainerReport should be improved.
> {code}
> scm_1           | --> RPC message request: SCMHeartbeatRequestProto from 
> 172.25.0.2:33912
> scm_1           | datanodeDetails {
> scm_1           |   uuid: "9c8f80bd-9424-4d74-99ef-a2bd58e66d7f"
> scm_1           |   ipAddress: "172.25.0.2"
> scm_1           |   hostName: "365fd1f44f0b"
> scm_1           |   ports {
> scm_1           |     name: "STANDALONE"
> scm_1           |     value: 9859
> scm_1           |   }
> scm_1           |   ports {
> scm_1           |     name: "RATIS"
> scm_1           |     value: 9858
> scm_1           |   }
> scm_1           |   ports {
> scm_1           |     name: "REST"
> scm_1           |     value: 9880
> scm_1           |   }
> scm_1           | }
> scm_1           | nodeReport {
> scm_1           |   storageReport {
> scm_1           |     storageUuid: "DS-61e76107-85c5-437a-95a7-aeb8b3e7827f"
> scm_1           |     storageLocation: "/tmp/hadoop-hadoop/dfs/data"
> scm_1           |     capacity: 491630870528
> scm_1           |     scmUsed: 2708828160
> scm_1           |     remaining: 24263614464
> scm_1           |     storageType: DISK
> scm_1           |     failed: false
> scm_1           |   }
> scm_1           | }
> scm_1           | containerReport {
> scm_1           |   reports {
> scm_1           |     containerID: 1
> scm_1           |     used: 1061158912
> scm_1           |     readCount: 0
> scm_1           |     writeCount: 64
> scm_1           |     readBytes: 0
> scm_1           |     writeBytes: 1061158912
> scm_1           |     state: CLOSED
> scm_1           |   }
> scm_1           |   reports {
> scm_1           |     containerID: 2
> scm_1           |     used: 1048576000
> scm_1           |     readCount: 0
> scm_1           |     writeCount: 64
> scm_1           |     readBytes: 0
> scm_1           |     writeBytes: 1048576000
> scm_1           |     state: CLOSED
> scm_1           |   }
> scm_1           |   reports {
> scm_1           |     containerID: 3
> scm_1           |     used: 511705088
> scm_1           |     readCount: 0
> scm_1           |     writeCount: 32
> scm_1           |     readBytes: 0
> scm_1           |     writeBytes: 511705088
> scm_1           |     state: OPEN
> scm_1           |   }
> scm_1           | }
> scm_1           | commandStatusReport {
> scm_1           | }
> scm_1           | containerActions {
> scm_1           |   containerActions {
> scm_1           |     containerID: 1
> scm_1           |     action: CLOSE
> scm_1           |     reason: CONTAINER_FULL
> scm_1           |   }
> scm_1           |   containerActions {
> scm_1           |     containerID: 2
> scm_1           |     action: CLOSE
> scm_1           |     reason: CONTAINER_FULL
> scm_1           |   }
> scm_1           | }
> scm_1           | 
> scm_1           | --> RPC message response: SCMHeartbeatRequestProto to 
> 172.25.0.2:33912
> scm_1           | datanodeUUID: "9c8f80bd-9424-4d74-99ef-a2bd58e66d7f"
> scm_1           | 
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:56 - 
> Close container Event triggered for container : 1
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:105 - 
> container with id : 1 is in CLOSING state and need not be closed.
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:56 - 
> Close container Event triggered for container : 2
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:105 - 
> container with id : 2 is in CLOSING state and need not be closed.
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to