Eroma created AIRAVATA-1651:
-------------------------------

             Summary: Zookeeper connection lost error; Experiment failed
                 Key: AIRAVATA-1651
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1651
             Project: Airavata
          Issue Type: Bug
         Environment: http://test-drive.airavata.org/pga/public
            Reporter: Eroma


Two experiment has the same error message in log
One experiment got FAILED at experiment level and no job status recorded.
Other Experiment failed but the job got COMPLETE. Randomely occurs. was unable 
to recreate

error messages retrived from log;
2015-03-26 09:33:34,693 [main-SendThread(gw127.iu.xsede.org:9181)] INFO  
org.apache.zookeeper.ClientCnxn  - Opening socket connection to server 
gw127.iu.xsede.org/149.165.228.125:9181
...skipping...
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = 
ConnectionLoss for 
/gfac-experiments/gfac-node0/SLM-WRF-Stampede_c0697813-a8f4-4d8a-b0f3-6808f8538b18+IDontNeedaNode_a3b6133f-f8af-435d-9b2a-76838db535f6/org.apache.airavata.gfac.gsissh.handler.GSISSHInputHandler/state
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
       at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1228)
       at 
org.apache.airavata.gfac.core.utils.GFacUtils.updatePluginState(GFacUtils.java:1013)
       at 
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeInFlowHandlers(BetterGfacImpl.java:902)
       at 
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.launch(BetterGfacImpl.java:690)
       at 
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:481)
       at 
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:210)
       at 
org.apache.airavata.gfac.core.utils.InputHandlerWorker.call(InputHandlerWorker.java:49)
       at java.util.concurrent.FutureTask.run(FutureTask.java:262)
       at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
       at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
       at java.lang.Thread.run(Thread.java:745)

and 

aused by: org.apache.zookeeper.KeeperException$ConnectionLossException: 
KeeperErrorCode = ConnectionLoss for 
/gfac-experiments/gfac-node0/SLM-Trinity-Stampede_0bd73a38-6931-498f-af7b-d700dc177c43+IDontNeedaNode_db287294-796d-43c1-896d-e3b412b4c8a7/org.apache.airavata.gfac.ssh.handler.AdvancedSCPOutputHandler
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
       at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003)
       at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1031)
       at 
org.apache.airavata.gfac.core.utils.GFacUtils.createPluginZnode(GFacUtils.java:935)
       at 
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeOutFlowHandlers(BetterGfacImpl.java:939)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to