machristie commented on pull request #260:
URL: https://github.com/apache/airavata/pull/260#issuecomment-683982167


   @vivekshresta,
   
   Regarding validating internally, I would add the check to launchExperiment 
instead of createExperiment. 
   
   
   > We will check the size limit only after the experiment creation is done.
   
   That's true, but the current approach only checks before the experiment runs 
and so doesn't account for experiment output files.
   
    >  When we know the size limit is exceeded, helix needs to communicate back 
to APIServer for deleting the created
    >   experiment entries and if needed, deleting the experiment directory.
    >   Considering these and after discussing with Dimuthu, I thought this 
might be the better approach when we use 'StorageResourceAdaptor', but this 
does seem to complicate things.
   
   Well, my two cents, but I don't think the experiment needs to be deleted. We 
just need to set a flag in the database that the user is over quote on that 
storage resource and then prevent further file uploads/experiments on that 
storage resource.
   
   >  Also can you please elaborate a little on transient network failure in 
helix.
   
   Sure, Helix's task framework has builtin fault tolerance support, for 
example retrying in the case of failure: 
https://helix.apache.org/0.8.0-docs/tutorial_task_framework.html. By transient 
network failure I mean some sort of transient network failure between Airavata 
and the SSH host that prevents the SSH connection from establishing or 
completing successfully.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to