[ https://issues.apache.org/jira/browse/HDFS-11914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yongjun Zhang updated HDFS-11914: --------------------------------- Labels: supportability (was: ) > Add more diagnosis info for fsimage transfer failure. > ----------------------------------------------------- > > Key: HDFS-11914 > URL: https://issues.apache.org/jira/browse/HDFS-11914 > Project: Hadoop HDFS > Issue Type: Bug > Reporter: Yongjun Zhang > Assignee: Yongjun Zhang > Labels: supportability > Attachments: HDFS-11914.001.patch, HDFS-11914.002.patch > > > Hit a fsimage download problem: > Client tries to download fsimage, and got: > WARN org.apache.hadoop.security.UserGroupInformation: > PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException: > File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length > xyz is not of the advertised size abc. > Basically client does not get enough fsimage data and finished prematurely > without any exception thrown, until it finds the size of data received is > smaller than expected. The client then closed the conenction to NN, that > caused NN to report > INFO org.apache.hadoop.hdfs.server.namenode.TransferFsImage: Connection > closed by client > This jira is to add some more information in logs to help debugging the > sitaution. Specifically, report the stack trace when the connection is > closed. And how much data has been sent at that point. etc. > -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org