Dimuthu Upeksha created AIRAVATA-2956:
-----------------------------------------

             Summary: Possible race condition in job monitoring
                 Key: AIRAVATA-2956
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-2956
             Project: Airavata
          Issue Type: Bug
          Components: helix implementation
            Reporter: Dimuthu Upeksha


When Job submission task submits a job to a compute resource, it returns a job 
id and then it is saved in a zookeeper path for post workflow execution. But in 
some cases, job completes before those metadata is saved in zookeeper and then 
post workflow fails. 

018-11-21 18:15:55,783 [main] INFO  o.a.a.h.i.w.PostWorkflowManager  - 
Processing job result of job id 9839 sent by EmailBasedProducer
2018-11-21 18:15:55,785 [main] WARN  o.a.a.h.i.w.PostWorkflowManager  - Could 
not find a monitoring register for job id 9839
2018-11-21 18:15:55,785 [main] INFO  o.a.a.h.i.w.PostWorkflowManager  - Status 
of processing 9839 : false



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to