So, the lock file you create is just there to identify the assigned GPU? I 
haven't done anything with prolog stuff before, but I'll take a look.


Date: Wed, 19 Nov 2014 16:02:02 +0000
From: [email protected]
To: [email protected]
Subject: Re: [gridengine users] Requesting a resource OR another resource

On Wed, 19 Nov 2014 13:44:37 +0000
Kevin Taylor <[email protected]> wrote:
 
> I'm not sure if this is possible or not, but thought I'd ask it.
> 
> We have a setup of consumable resources for our GPUs. If a system has two we 
> have a complex called gpu1_free and gpu2_free. They'll be equal to 1 if 
> they're free and zero if they're not. Typically we just request like this:   
> qsub -l gpu1_free=1 job.sh
> 
> Is there a way though qsub to say
> 
> qsub -l gpu1_free=1 OR gpu2_free=1 job.sh
> 
> I know putting multiple -l's will ask for both, but we just want one or the 
> other, whichever.
> 
> Univa has that nice RSMAP feature that would solve our issue, but we haven't 
> worked out finances on that yet so we're seeing if we can just make it work a 
> little easier with what we have.
 
The solution we use on our cluster is as follows: we have a single integer 
complex for all GPUs attached to a node.  The prolog then twiddles lock files 
to allocates one or both GPUs to the job as appropriate and changes permissions 
on the /dev/nv* files to ensure the job can only use the GPU it has been 
assigned (you have to pass a a special parameter to the nvidia module to stop 
it setting the permissions itself though).
 
-- 
William Hay <[email protected]>

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users                                   
  
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to