[jira] [Commented] (AIRAVATA-2747) OOM issue in Helix Participant

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16467931#comment-16467931 ] Dimuthu Upeksha commented on AIRAVATA-2747: --- Moved to SSHJ based ssh adaptor

[jira] [Resolved] (AIRAVATA-2747) OOM issue in Helix Participant

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2747. --- Resolution: Fixed > OOM issue in Helix Participant > --

[jira] [Commented] (AIRAVATA-2746) Job completed and experiment failed due to error in initializing SSH agent

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16467935#comment-16467935 ] Dimuthu Upeksha commented on AIRAVATA-2746: --- Fixed in new SSHJ based ssh adap

[jira] [Resolved] (AIRAVATA-2745) Job cancellations in the cluster should cancel the job and experiment in the gateway portal.

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2745. --- Resolution: Fixed > Job cancellations in the cluster should cancel the job and expe

[jira] [Resolved] (AIRAVATA-2746) Job completed and experiment failed due to error in initializing SSH agent

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2746. --- Resolution: Fixed > Job completed and experiment failed due to error in initializin

[jira] [Resolved] (AIRAVATA-2743) Experiment in CANCELLED while job is still QUEUED or SUBMITTED and canceling at cluster side

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2743. --- Resolution: Fixed > Experiment in CANCELLED while job is still QUEUED or SUBMITTED

[jira] [Resolved] (AIRAVATA-2740) Non-existing file transfer has failed the experiment

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2740. --- Resolution: Fixed > Non-existing file transfer has failed the experiment >

[jira] [Resolved] (AIRAVATA-2733) Improvements to Helix log messages

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2733. --- Resolution: Fixed > Improvements to Helix log messages > --

[jira] [Resolved] (AIRAVATA-2736) Job submitted and running in HPC while the experiment is tagged as FAILED

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2736. --- Resolution: Fixed > Job submitted and running in HPC while the experiment is tagged

[jira] [Resolved] (AIRAVATA-2735) When transferring input files, check for the file size and 0 byte files transfers should be restricted

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2735. --- Resolution: Fixed > When transferring input files, check for the file size and 0 by

[jira] [Resolved] (AIRAVATA-2734) Experiment status in LAUNCEHD while job is in ACTIVE. Experiment status should be EXECUTING.

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2734. --- Resolution: Fixed > Experiment status in LAUNCEHD while job is in ACTIVE. Experimen

[jira] [Resolved] (AIRAVATA-2737) Too many Zookeeper connections created

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2737. --- Resolution: Fixed > Too many Zookeeper connections created > --

[jira] [Resolved] (AIRAVATA-2713) In helix test bed the outputs are not displayed in the experiment summary

2018-05-08 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2713. --- Resolution: Fixed > In helix test bed the outputs are not displayed in the experime

[jira] [Updated] (AIRAVATA-2792) Staging seagrid fails to submit a job

2018-05-18 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha updated AIRAVATA-2792: -- Component/s: helix implementation > Staging seagrid fails to submit a job > -

[jira] [Created] (AIRAVATA-2874) Data staging tasks should retry if a file transfer is failed

2018-08-24 Thread Dimuthu Upeksha (JIRA)
Dimuthu Upeksha created AIRAVATA-2874: - Summary: Data staging tasks should retry if a file transfer is failed Key: AIRAVATA-2874 URL: https://issues.apache.org/jira/browse/AIRAVATA-2874 Project: A

[jira] [Commented] (AIRAVATA-2874) Data staging tasks should retry if a file transfer is failed

2018-08-24 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16591652#comment-16591652 ] Dimuthu Upeksha commented on AIRAVATA-2874: --- Fixed and deployed in staging e

[jira] [Resolved] (AIRAVATA-2874) Data staging tasks should retry if a file transfer is failed

2018-08-24 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2874. --- Resolution: Fixed > Data staging tasks should retry if a file transfer is failed >

[jira] [Resolved] (AIRAVATA-2833) Several experiments failed at various stages of job submission due to connection lost

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2833. --- Resolution: Fixed Added job submission retrying logic > Several experiments faile

[jira] [Resolved] (AIRAVATA-2831) Experiment FAILED with an error on output file staging! But the file referring in the error is actually downloaded and available in storage.

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2831. --- Resolution: Fixed This should be fixed after data staging retrying implementation

[jira] [Resolved] (AIRAVATA-2826) Helix participant server was stopped and started while experiments are launched and job submissions to Jetstream cluster failed

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2826. --- Resolution: Fixed > Helix participant server was stopped and started while experim

[jira] [Commented] (AIRAVATA-2826) Helix participant server was stopped and started while experiments are launched and job submissions to Jetstream cluster failed

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623942#comment-16623942 ] Dimuthu Upeksha commented on AIRAVATA-2826: --- Added job submission retrying l

[jira] [Resolved] (AIRAVATA-2792) Staging seagrid fails to submit a job

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2792. --- Resolution: Fixed > Staging seagrid fails to submit a job > --

[jira] [Resolved] (AIRAVATA-2790) File uploading error due to session channel opening error occurred!

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2790. --- Resolution: Fixed > File uploading error due to session channel opening error occu

[jira] [Resolved] (AIRAVATA-2789) Experiment failed with unexpected error in opening a session channel

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2789. --- Resolution: Fixed > Experiment failed with unexpected error in opening a session c

[jira] [Resolved] (AIRAVATA-2784) Airavata unable to connect with the compute resource, comet.sdsc.edu

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2784. --- Resolution: Fixed > Airavata unable to connect with the compute resource, comet.sd

[jira] [Resolved] (AIRAVATA-2786) Job COMPLETED but experiment failed with error message "unknown error occurred when initializing ..... "

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2786. --- Resolution: Fixed > Job COMPLETED but experiment failed with error message "unknow

[jira] [Resolved] (AIRAVATA-2750) Helix Participant is not picking up tasks after a restart

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2750. --- Resolution: Fixed > Helix Participant is not picking up tasks after a restart > --

[jira] [Closed] (AIRAVATA-2783) Gateway output file (.tar.gz) not existing when staging out but in real it exists in the working directory

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha closed AIRAVATA-2783. - Resolution: Fixed Closed as this is no longer an issue as we are deprecating gfac > G

[jira] [Resolved] (AIRAVATA-2689) Distributed email clients to improve email monitoring

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2689. --- Resolution: Fixed Fixed as a part of new helix implementation. Job monitors were t

[jira] [Resolved] (AIRAVATA-2386) Fix issues with email monitoring

2018-09-21 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2386. --- Resolution: Fixed New job monitors are running based on a state model so the order

[jira] [Created] (AIRAVATA-2940) Sporadic JPA errors when invoking Registry Server APIs

2018-11-12 Thread Dimuthu Upeksha (JIRA)
Dimuthu Upeksha created AIRAVATA-2940: - Summary: Sporadic JPA errors when invoking Registry Server APIs Key: AIRAVATA-2940 URL: https://issues.apache.org/jira/browse/AIRAVATA-2940 Project: Airavata

[jira] [Commented] (AIRAVATA-2940) Sporadic JPA errors when invoking Registry Server APIs

2018-11-12 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16684215#comment-16684215 ] Dimuthu Upeksha commented on AIRAVATA-2940: --- Still couldn't identify the cau

[jira] [Commented] (AIRAVATA-2942) Experiment cancelation request was not processed in Helix

2018-11-16 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16689597#comment-16689597 ] Dimuthu Upeksha commented on AIRAVATA-2942: --- Fixed in [https://github.com/a

[jira] [Resolved] (AIRAVATA-2942) Experiment cancelation request was not processed in Helix

2018-11-16 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2942. --- Resolution: Fixed > Experiment cancelation request was not processed in Helix > -

[jira] [Assigned] (AIRAVATA-2956) Possible race condition in job monitoring

2018-11-24 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha reassigned AIRAVATA-2956: - Assignee: Dimuthu Upeksha > Possible race condition in job monitoring > -

[jira] [Created] (AIRAVATA-2956) Possible race condition in job monitoring

2018-11-24 Thread Dimuthu Upeksha (JIRA)
Dimuthu Upeksha created AIRAVATA-2956: - Summary: Possible race condition in job monitoring Key: AIRAVATA-2956 URL: https://issues.apache.org/jira/browse/AIRAVATA-2956 Project: Airavata Is

[jira] [Commented] (AIRAVATA-2956) Possible race condition in job monitoring

2018-11-25 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16698201#comment-16698201 ] Dimuthu Upeksha commented on AIRAVATA-2956: --- Fixed in [https://github.com/a

[jira] [Resolved] (AIRAVATA-2956) Possible race condition in job monitoring

2018-11-25 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2956. --- Resolution: Fixed Added validation logic into AbstactParser before putting a job s

[jira] [Commented] (AIRAVATA-2962) Issue with experiment cancelation request prior to job submission

2018-12-19 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16725171#comment-16725171 ] Dimuthu Upeksha commented on AIRAVATA-2962: --- Fixed in  [https://github.com/

[jira] [Resolved] (AIRAVATA-2962) Issue with experiment cancelation request prior to job submission

2018-12-19 Thread Dimuthu Upeksha (JIRA)
[ https://issues.apache.org/jira/browse/AIRAVATA-2962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dimuthu Upeksha resolved AIRAVATA-2962. --- Resolution: Fixed > Issue with experiment cancelation request prior to job submiss

<    1   2