[gridengine users] issues with USERNAME variable in batch jobs

2011-11-22 Thread iqtcub
Hi all, We're having an issue in our cluster. We are running SLES10 and SLES11 in the execution hosts(each OS separated with hostgroups, with each hostgroup having its own queue), with LDAP. Our SGE version is GE6.2u5. I've done a simple script that runs 'env'. The problem is that in some no

Re: [gridengine users] issues with USERNAME variable in batch jobs

2011-11-22 Thread Reuti
Am 22.11.2011 um 10:03 schrieb iqtcub: > Hi all, > > We're having an issue in our cluster. We are running SLES10 and SLES11 in the > execution hosts(each OS separated with hostgroups, with each hostgroup having > its own queue), with LDAP. Our SGE version is GE6.2u5. > > I've done a simple scr

Re: [gridengine users] issues with USERNAME variable in batch jobs

2011-11-22 Thread Petter Gustad
From: iqtcub Subject: [gridengine users] issues with USERNAME variable in batch jobs Date: Tue, 22 Nov 2011 10:03:30 +0100 > I've done a simple script that runs 'env'. The problem is that in some > nodes, the USERNAME variable has root as value, while in other nodes > it isn't defined at all. Co

Re: [gridengine users] issues with USERNAME variable in batch jobs

2011-11-22 Thread iqtcub
On 11/22/2011 10:44 AM, Reuti wrote: Am 22.11.2011 um 10:03 schrieb iqtcub: Hi all, We're having an issue in our cluster. We are running SLES10 and SLES11 in the execution hosts(each OS separated with hostgroups, with each hostgroup having its own queue), with LDAP. Our SGE version is GE6.2u

Re: [gridengine users] resource management and over-subscription?

2011-11-22 Thread Reuti
Hi, Am 22.11.2011 um 02:17 schrieb Julie Ashworth: > Thanks Reuti and all for being so responsive on this list. you're welcome. > I have (probably) a newbie question... > > I manage a small compute cluster for a university. I configured > a simple user-based functional share policy, since w

[gridengine users] Why define many PEs?

2011-11-22 Thread mahbube rustaee
Hi , I'm so thankful in advance for your time and your help. suppose we have many users with different request on "how to get slots for running mpi jobs", such some want fillup allocation rule, some users round robin, some 2 slots per host, some 4 slots per host ,... sgeadmin had to define many

[gridengine users] Fwd: backfilling, s_rt

2011-11-22 Thread baf035
Hello all. 1) When s_rt time is reached a job is signaled by default by SIGUSR1. One of our applications is dying by the signal. I changed in a global configuration in execd_params a parameter NOTIFY_KILL=none or NOTIFY_KILL=SIGCONT bur without effect. Applications are still signaled by SIGUSR1. H

Re: [gridengine users] problem in run mpi jobs

2011-11-22 Thread mahbube rustaee
On Mon, Nov 21, 2011 at 1:44 PM, Reuti wrote: > Am 21.11.2011 um 05:30 schrieb mahbube rustaee: > > > On Mon, Nov 21, 2011 at 3:27 AM, Reuti > wrote: > > Hi, > > > > Am 20.11.2011 um 12:37 schrieb mahbube rustaee: > > > > > 1) I run intel mpi jobs. when $NSLOTS<=50 , qsub is ok, but for slots >

Re: [gridengine users] problem in run mpi jobs

2011-11-22 Thread Reuti
Am 22.11.2011 um 13:25 schrieb mahbube rustaee: > On Mon, Nov 21, 2011 at 1:44 PM, Reuti wrote: > Am 21.11.2011 um 05:30 schrieb mahbube rustaee: > > > On Mon, Nov 21, 2011 at 3:27 AM, Reuti wrote: > > Hi, > > > > Am 20.11.2011 um 12:37 schrieb mahbube rustaee: > > > > > 1) I run intel mpi jobs

Re: [gridengine users] Fwd: backfilling, s_rt

2011-11-22 Thread Reuti
Hi, Am 22.11.2011 um 12:58 schrieb baf035: > > Hello all. > > 1) When s_rt time is reached a job is signaled by default by SIGUSR1. One of > our applications is dying by the signal. > I changed in a global configuration in execd_params a parameter > NOTIFY_KILL=none or NOTIFY_KILL=SIGCONT > b

Re: [gridengine users] lock number of cores

2011-11-22 Thread Reuti
Hi, Am 20.11.2011 um 09:46 schrieb mahbube rustaee: > Am 19.11.2011 um 05:57 schrieb mahbube rustaee: > > > Some users wants to get cores less than all core of a host and lock other > > slots. > > How can config GE to do that? > > e.g. a user use 20 slots of a host with 48 core and lock other s

Re: [gridengine users] Why define many PEs?

2011-11-22 Thread Reuti
Hi, Am 22.11.2011 um 12:52 schrieb mahbube rustaee: > I'm so thankful in advance for your time and your help. > > suppose we have many users with different request on "how to get slots for > running mpi jobs", > such some want fillup allocation rule, some users round robin, some 2 slots > pe

Re: [gridengine users] Beware Univa FUD

2011-11-22 Thread Ron Chen
I have been promoting and working on Grid Engine since June 2001: http://beowulf.org/archive/2001-July/004410.html http://beowulf.org/archive/2001-July/004341.html And where was Univa during all those years? Did Univa contribute a single line of code to the SGE cvs? Did Univa answer a single

[gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Chris Dagdigian
Hi folks, I'm hands-on with a shiny new cluster running Univa's 8.0.1 release and am having some issues running jobs as a non-root user via an account that lives in Active Directory. The cluster is the standard sort of RHEL 5.7 based system but we are using Centrify and in particular the Ce

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Reuti
Hi Chris, Am 22.11.2011 um 21:05 schrieb Chris Dagdigian: > I'm hands-on with a shiny new cluster running Univa's 8.0.1 release and am > having some issues running jobs as a non-root user via an account that lives > in Active Directory. isn't Univa offering "Full, Enterprise Class Support"? I

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Bill Bryce
Hi Chris, I think the best way is to log this as an issue at Univa and we can go from there. Is this cluster for your personal use or are you configuring it on behalf of a customer? You can send an email to supp...@univa.com or login to the support portal http://www.univa.com/support and we

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Rayson Ho
While I worked with Billl before, and I've started working with Chris even before SGE was opensourced (that was in early 2001), and I don't want to be rude on this list - I just cannot agree more with Reuti!! At the beginning of this year, Univa said that this list can't help users using Oracle Gr

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Chris Dagdigian
Like many users of the Oracle branded SGE I was hopefully assuming faster and more targeted support would be available from the smart people who inhabit the users- list. Don't we have a history going back (like forever?) of doing that? Univa support is going to be my 2nd stop mainly because

Re: [gridengine users] resource management and over-subscription?

2011-11-22 Thread Julie Ashworth
hi Reuti and all, On 22-11-2011 11.40 +0100, Reuti wrote: > > suspend_threshold. But how does SGE choose which jobs get > > suspended? > > They are just hanging around there, just stopped. No memory, consumable > resource or diskspace will be freed. ---end quoted text--- I have a simple follow

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread bergman
In the message dated: Tue, 22 Nov 2011 15:05:43 EST, The pithy ruminations from Chris Dagdigian on <[gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?> were: => => Hi folks, In the spirit of supporting the community of SGE users, rather than sp

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Rayson Ho
Hi Chris, I *DID NOT* say that all discussions related to Univa Grid Engine had to be banned. As we don't have the Univa Grid Engine source code, we just can't debug the problem. That's basically the same reason Bill asked others to turn to Oracle for help with issues related to Oracle Grid Engine

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Brooks Davis
On Tue, Nov 22, 2011 at 03:05:43PM -0500, Chris Dagdigian wrote: > > Hi folks, > > I'm hands-on with a shiny new cluster running Univa's 8.0.1 release and > am having some issues running jobs as a non-root user via an account > that lives in Active Directory. > > The cluster is the standard sor

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Chris Dagdigian
Hi Rayson, Did not mean to imply that it was you who made those statements - I actually thought you were referring to or quoting someone else who had attempted in the past to dictate what the community list can be used for. All I wanted to say was that nobody can dictate how this list is used

Re: [gridengine users] resource management and over-subscription?

2011-11-22 Thread Reuti
Hi Julie, Am 22.11.2011 um 22:10 schrieb Julie Ashworth: > hi Reuti and all, > > On 22-11-2011 11.40 +0100, Reuti wrote: >>> suspend_threshold. But how does SGE choose which jobs get >>> suspended? >> >> They are just hanging around there, just stopped. No memory, consumable >> resource or di

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Rayson Ho
Hi Chris, Thanks for the clarification. I have been a supporter of open-source Grid Engine for over 8 years, and the last thing I wanted to see is the mailing list fragmented up. As you know OGS' mailing list was not created until Oracle announced its plan to finally shutdown the original site, b

Re: [gridengine users] resource management and over-subscription?

2011-11-22 Thread Julie Ashworth
On 22-11-2011 23.26 +0100, Reuti wrote: > > sorry for the confusion: the "no" was meant for memory + consumable resource > + diskspace (i.e. the $TMPDIR). None of them will be freed. Ah, this is my fault - a bad combination of optimism and reading too quickly ;). > What you can do if your jobs

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Ron Chen
Chris, 1) There really are differences between Oracle Grid Engine and Univa Grid Engine. First and foremost, Oracle has never used misleading or false information just to get an extra customer to pay for Oracle Grid Engine. If you have not read all the messages in the mail thread started by Dav

Re: [gridengine users] SGE (univa 8.0.1) - anyone running SGE with Centrify active directory integration?

2011-11-22 Thread Beat Rubischon
Hi Chris! On 22.11.11 21:05, Chris Dagdigian wrote: > The user errors I see are familiar ones: > "can't get password entry for user "x". Either user does not exist or > NIS error!" I assume those errors are intermittent and not permanent. Otherwise I'm pretty sure you found them long before your

[gridengine users] qacct in xml?

2011-11-22 Thread Gerard Henry
hello all, for a home app, i need to get the stats of finished jobs. "qacct -j jid" gives me datas not easily usable, i'd prefer to have them in xml. But there is no "-xml" to qacct (or am i missing something?) the DRMAA way is not an option. Is there a way to get datas in another format? i c