Eroma created AIRAVATA-1651: ------------------------------- Summary: Zookeeper connection lost error; Experiment failed Key: AIRAVATA-1651 URL: https://issues.apache.org/jira/browse/AIRAVATA-1651 Project: Airavata Issue Type: Bug Environment: http://test-drive.airavata.org/pga/public Reporter: Eroma
Two experiment has the same error message in log One experiment got FAILED at experiment level and no job status recorded. Other Experiment failed but the job got COMPLETE. Randomely occurs. was unable to recreate error messages retrived from log; 2015-03-26 09:33:34,693 [main-SendThread(gw127.iu.xsede.org:9181)] INFO org.apache.zookeeper.ClientCnxn - Opening socket connection to server gw127.iu.xsede.org/149.165.228.125:9181 ...skipping... org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /gfac-experiments/gfac-node0/SLM-WRF-Stampede_c0697813-a8f4-4d8a-b0f3-6808f8538b18+IDontNeedaNode_a3b6133f-f8af-435d-9b2a-76838db535f6/org.apache.airavata.gfac.gsissh.handler.GSISSHInputHandler/state at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1228) at org.apache.airavata.gfac.core.utils.GFacUtils.updatePluginState(GFacUtils.java:1013) at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeInFlowHandlers(BetterGfacImpl.java:902) at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.launch(BetterGfacImpl.java:690) at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:481) at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:210) at org.apache.airavata.gfac.core.utils.InputHandlerWorker.call(InputHandlerWorker.java:49) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) and aused by: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /gfac-experiments/gfac-node0/SLM-Trinity-Stampede_0bd73a38-6931-498f-af7b-d700dc177c43+IDontNeedaNode_db287294-796d-43c1-896d-e3b412b4c8a7/org.apache.airavata.gfac.ssh.handler.AdvancedSCPOutputHandler at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003) at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1031) at org.apache.airavata.gfac.core.utils.GFacUtils.createPluginZnode(GFacUtils.java:935) at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeOutFlowHandlers(BetterGfacImpl.java:939) -- This message was sent by Atlassian JIRA (v6.3.4#6332)