zengqiuyang created SPARK-9629:
----------------------------------

             Summary:  Client session timed out, have not heard from server in
                 Key: SPARK-9629
                 URL: https://issues.apache.org/jira/browse/SPARK-9629
             Project: Spark
          Issue Type: Bug
          Components: Deploy
    Affects Versions: 1.4.1, 1.4.0
         Environment: spark1.4.1    ./make-distribution.sh --tgz 
-Dhadoop.version=2.5.2 -Dyarn.version=2.5.2 -Phive -Phive-thriftserver  -Pyarn  
zookeeper-3.4.6.tar.gz 
            Reporter: zengqiuyang
            Priority: Critical


the spark  HA   running  every few days , then " Client session timed out" 
appear。
show reconnect but not do it,  and master shutting down.
logs:
 15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Client session timed out, have 
not heard from server in 37753ms for sessionid 0x34ee39684b70005, closing 
socket connection and attempting reconnect
15/08/05 05:32:57 INFO state.ConnectionStateManager: State change: SUSPENDED
15/08/05 05:32:57 WARN state.ConnectionStateManager: There are no 
ConnectionStateListeners registered.
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Opening socket connection to 
server h5/192.168.0.18:2181. Will not attempt to authenticate using SASL 
(unknown error)
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Socket connection established to 
h5/192.168.0.18:2181, initiating session
15/08/05 05:32:57 INFO zookeeper.ClientCnxn: Session establishment complete on 
server h5/192.168.0.18:2181, sessionid = 0x34ee39684b70005, negotiated timeout 
= 40000
15/08/05 05:32:57 INFO state.ConnectionStateManager: State change: RECONNECTED
15/08/05 05:32:57 WARN state.ConnectionStateManager: There are no 
ConnectionStateListeners registered.
15/08/05 05:32:58 INFO zookeeper.ClientCnxn: Client session timed out, have not 
heard from server in 37753ms for sessionid 0x34ee39684b70006, closing socket 
connection and attempting reconnect
15/08/05 05:32:58 INFO state.ConnectionStateManager: State change: SUSPENDED
15/08/05 05:32:58 INFO master.ZooKeeperLeaderElectionAgent: We have lost 
leadership
15/08/05 05:32:58 ERROR master.Master: Leadership has been revoked -- master 
shutting down.
15/08/05 05:32:58 INFO util.Utils: Shutdown hook called



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to