[ 
https://issues.apache.org/jira/browse/AMBARI-22257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16208010#comment-16208010
 ] 

Jayush Luniya commented on AMBARI-22257:
----------------------------------------

+1

> Metrics collector fails to stop after Datanode is stopped in distributed mode
> -----------------------------------------------------------------------------
>
>                 Key: AMBARI-22257
>                 URL: https://issues.apache.org/jira/browse/AMBARI-22257
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-metrics
>    Affects Versions: 2.0.0
>            Reporter: Siddharth Wagle
>            Assignee: Siddharth Wagle
>            Priority: Critical
>             Fix For: 2.6.0
>
>         Attachments: AMBARI-22257.patch
>
>
> AMS collector stop failed due to timeout at the ams-hbase regionserver stop. 
> The log contains lots of exceptions related to DN connection issues during 
> the stop. The problem here is that DNs were stopped before the collector. 
> {code}
> 2017-10-17 14:29:10,689 ERROR [Thread-274] hdfs.DFSClient: Failed to close 
> inode 17762
> org.apache.hadoop.ipc.RemoteException(java.io.IOException): File 
> /user/ams/hbase/WALs/ctr-e134-1499953498516-230429-01-000007.hwx.site,61320,1508248489809/ctr-e134-1499953498516-230429-01-000007.hwx.site%2C61320%2C1508248489809.default.1508250548392
>  could only be replicated to 0 nodes instead of minReplication (=1).  There 
> are 3 datanode(s) running and 3 node(s) are excluded in this operation.
>         at 
> org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget4NewBlock(BlockManager.java:1719)
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to