Re: [gridengine users] SGE crashes immediately after re-start

2016-03-06 Thread Simon Matthews
Re-starting one of the execd nodes solved the issue. I then found some jobs that I force deleted and the problem seems to have gone away. Thanks. Simon On Sun, Mar 6, 2016 at 10:07 AM, Reuti wrote: > Hi, > > Am 04.03.2016 um 16:40 schrieb Simon Matthews: > >> I am getting this error message: >>

Re: [gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Yuri Burmachenko
Thank you very much Reuti! Sent from my iPhone On Mar 6, 2016, at 8:02 PM, Reuti mailto:re...@staff.uni-marburg.de>> wrote: Hi, Am 06.03.2016 um 18:48 schrieb Yuri Burmachenko: Hello Reuti, Just last question, is adjusting MAX_DYN_EC is a disruptive action or not and is the effect immediate

Re: [gridengine users] SGE crashes immediately after re-start

2016-03-06 Thread Reuti
Hi, Am 04.03.2016 um 16:40 schrieb Simon Matthews: > I am getting this error message: > 03/04/2016 07:30:14|listen|sgemaster|E|commlib error: local host name > error (remote rdata host name "turquoise" is not equal to local > resolved host name "h2.sj.bps") > 03/04/2016 > 07:30:23|worker|sgemast

Re: [gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Reuti
Hi, Am 06.03.2016 um 18:48 schrieb Yuri Burmachenko: > Hello Reuti, > > Just last question, is adjusting MAX_DYN_EC is a disruptive action or not and > is the effect immediate or restart required for SGE master? All changes are live and won't interrupt any scheduling. In fact: with SGE you ca

Re: [gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Yuri Burmachenko
Hello Reuti, Just last question, is adjusting MAX_DYN_EC is a disruptive action or not and is the effect immediate or restart required for SGE master? Thank You! -Original Message- From: Reuti [mailto:re...@staff.uni-marburg.de] Sent: Sunday, March 06, 2016 7:39 PM To: Yuri Burmachenko

Re: [gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Reuti
Am 06.03.2016 um 18:21 schrieb Yuri Burmachenko: > Hi Reuti, > > No we didn't change that setting, but anyway I could not find it in `qconf > -mconf`, see below the contents of this command: Please have a look at `man sge_conf` about it. - Reuti > execd_spool_dir /local/sge_spoo

Re: [gridengine users] SoGE 8.1.8 - Job IDs getting reset very fast 9999999 ==> 1 - 6-7 times in a month

2016-03-06 Thread Reuti
Hi, Am 06.03.2016 um 18:04 schrieb Yuri Burmachenko: > Hallo to distinguished forum members, > > Recently we have found that something is wrong with SGE Job IDs – they are > getting reset very fast: 6-7 times in a month. > We don’t really have so many jobs executed in such a short period of ti

Re: [gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Yuri Burmachenko
Hi Reuti, No we didn't change that setting, but anyway I could not find it in `qconf -mconf`, see below the contents of this command: execd_spool_dir /local/sge_spool mailer /bin/mail xterm/usr/bin/xterm load_sensor /hom

Re: [gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Reuti
Hi, Am 06.03.2016 um 17:56 schrieb Yuri Burmachenko: > Hallo to distinguished forum members, > > Is there any limit of grid nodes in SoGE 8.1.8. > > We see in the qmaster spool message the following notice: > qmaster will accept max. 950 dynamic event clients There is a setting of MAX_DYN_EC

[gridengine users] SoGE 8.1.8 - Job IDs getting reset very fast 9999999 ==> 1 - 6-7 times in a month

2016-03-06 Thread Yuri Burmachenko
Hallo to distinguished forum members, Recently we have found that something is wrong with SGE Job IDs - they are getting reset very fast: 6-7 times in a month. We don't really have so many jobs executed in such a short period of time. We use JobId (via qacct) as a primary key for different home-

[gridengine users] SoGE 8.1.8 - Max number of grid nodes

2016-03-06 Thread Yuri Burmachenko
Hallo to distinguished forum members, Is there any limit of grid nodes in SoGE 8.1.8. We see in the qmaster spool message the following notice: qmaster will accept max. 950 dynamic event clients Does this mean that the limit is 950 grid nodes? Is it possible to increase this number? What are ou