[ 
https://issues.apache.org/jira/browse/AIRAVATA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eroma closed AIRAVATA-1354.
---------------------------
       Resolution: Cannot Reproduce
    Fix Version/s: 0.15 

Tested experiments in 
http://dev.test-drive.airavata.org/portal/ultrascan-testing/public/ and 
couldn't reproduce it

> Job monitor for Stampede unknow status
> --------------------------------------
>
>                 Key: AIRAVATA-1354
>                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1354
>             Project: Airavata
>          Issue Type: Improvement
>          Components: GFac
>            Reporter: Raminderjeet Singh
>            Assignee: Shameera Rathnayaka
>              Labels: Monitoring
>             Fix For: 0.15 
>
>
> We should using experiment id to name the jobs for unique identifier and then 
> use that job name to identify if the job get to unknown status. If the job 
> still is in unknown state we should check in working directory for stdout/err 
> and make corrective action to correct the UNKNOWN statues. Same logic will be 
> useful for job recovery if Airavata server restart.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to