[slurm-dev] Re: checkpoint/restart feature in SLURM

2016-03-20 Thread Ralph Castain
I am not aware of any MPI that would allow you to relocate a process while the 
job is running. You have to checkpoint it, terminate it, and then restart the 
entire job with the new node included.

> On Mar 16, 2016, at 9:58 PM, Husen R  wrote:
> 
> Dear Slurm-dev,
> 
> 
> Does checkpoint/restart feature available in SLURM able to relocate MPI 
> application from one node to another node while it is running ?
> 
> For the example, I run MPI application in node A,B and C in a cluster and I 
> want to migrate/relocate process running in node A to other node, let's say 
> to node C while it is running. 
> 
> is there a way to do this with SLURM ? Thank you.
> 
> 
> Regards,
> 
> Husen



[slurm-dev] RE: checkpoint/restart feature in SLURM

2016-03-19 Thread John Hearns
O I'll we k lo

Sent from my Windows Phone

From: Husen R
Sent: ‎17/‎03/‎2016 05:56
To: slurm-dev
Subject: [slurm-dev] checkpoint/restart feature in SLURM

Dear Slurm-dev,


Does checkpoint/restart feature available in SLURM able to relocate MPI 
application from one node to another node while it is running ?

For the example, I run MPI application in node A,B and C in a cluster and I 
want to migrate/relocate process running in node A to other node, let's say to 
node C while it is running.

is there a way to do this with SLURM ? Thank you.


Regards,

Husen


Scanned by MailMarshal - M86 Security's comprehensive email content security 
solution.


Any views or opinions presented in this email are solely those of the author 
and do not necessarily represent those of the company. Employees of XMA Ltd are 
expressly required not to make defamatory statements and not to infringe or 
authorise any infringement of copyright or any other legal right by email 
communications. Any such communication is contrary to company policy and 
outside the scope of the employment of the individual concerned. The company 
will not accept any liability in respect of such communication, and the 
employee responsible will be personally liable for any damages or other 
liability arising. XMA Limited is registered in England and Wales (registered 
no. 2051703). Registered Office: Wilford Industrial Estate, Ruddington Lane, 
Wilford, Nottingham, NG11 7EP