Re: [Nagios-users] New user with nagios 3.0.3 hanging.
> Hello, > > My name is Jose Luis and I'm wrinting from Spain. > > I have recently installed Nagios in a Solaris 9 box. > > While I was in version 3.0.1, 1 month or so, everything has been working > smoothly, but now, with version 3.0.3 Nagios has "hanged" twice (once a week > more or less). > > Nagios daemon is running but neither checks are performed nor external > commands processed. So I have to restart it. > > Anyone else have experienced this behaviour? > > Any suggestions? We are thinking of setting up a cron check to detect this > situation (no entries in nagios.log), but perhaps we are experiencing some > kind of bug that should be solved in new releases. > > Thank you very much indded > > Jose Luis > - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New user with nagios 3.0.3 hanging.
Hello agian, I'm experiencing right now the same problem. And I'm trying to doing some forensic. It's working in a 2 CPU system and main daemon is taking one of them. With truss I just get many many lines like this /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 /2: poll(0xFEF79F88, 1, 500)= 1 And the only pool entry in code is in utill.c, function void * command_file_worker_thread(void *arg){ And the output (1) match this #define EPERM 1 /* Not super-user */ But although with this error it should write to nagios.log, nothing is written. I have also checked the external command file # ls -l /usr/local/nagios/var/rw/nagios.cmd prw-rw 1 nagios nagcmd 47675 Jul 14 09:08 /usr/local/nagios/var/rw/nagios.cmd And I've seen than this code is the same than in 3.0.1. Any idea? Trying to stop nagios (/etc/init.d/nagios stop) I can't. I just get Warning - nagios did not exit in a timely manner And 3 nagios processes are still running. So I had to "kill -9" them Thanks a lot Jose Luis 2008/7/11 Jose Luis Garcia <[EMAIL PROTECTED]>: > > Hello, >> >> My name is Jose Luis and I'm wrinting from Spain. >> >> I have recently installed Nagios in a Solaris 9 box. >> >> While I was in version 3.0.1, 1 month or so, everything has been working >> smoothly, but now, with version 3.0.3 Nagios has "hanged" twice (once a week >> more or less). >> >> Nagios daemon is running but neither checks are performed nor external >> commands processed. So I have to restart it. >> >> Anyone else have experienced this behaviour? >> >> Any suggestions? We are thinking of setting up a cron check to detect this >> situation (no entries in nagios.log), but perhaps we are experiencing some >> kind of bug that should be solved in new releases. >> >> Thank you very much indded >> >> Jose Luis >> > > - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New user with nagios 3.0.3 hanging.
Hello again, Has anybody else experienced this behaviour? I'm suffering it every few days. Thank you very much Jose Luis 2008/7/14 Jose Luis Garcia <[EMAIL PROTECTED]> > Hello agian, > > I'm experiencing right now the same problem. And I'm trying to doing some > forensic. > > It's working in a 2 CPU system and main daemon is taking one of them. > > With truss I just get many many lines like this > > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > /2: poll(0xFEF79F88, 1, 500)= 1 > > And the only pool entry in code is in utill.c, function > > void * command_file_worker_thread(void *arg){ > > And the output (1) match this > > #define EPERM 1 /* Not super-user */ > > But although with this error it should write to nagios.log, nothing is > written. > > I have also checked the external command file > > # ls -l /usr/local/nagios/var/rw/nagios.cmd > prw-rw 1 nagios nagcmd 47675 Jul 14 09:08 > /usr/local/nagios/var/rw/nagios.cmd > > And I've seen than this code is the same than in 3.0.1. > > Any idea? > > Trying to stop nagios (/etc/init.d/nagios stop) I can't. I just get > > Warning - nagios did not exit in a timely manner > > And 3 nagios processes are still running. So I had to "kill -9" them > > Thanks a lot > > Jose Luis > > > > 2008/7/11 Jose Luis Garcia <[EMAIL PROTECTED]>: > > >> Hello, >>> >>> My name is Jose Luis and I'm wrinting from Spain. >>> >>> I have recently installed Nagios in a Solaris 9 box. >>> >>> While I was in version 3.0.1, 1 month or so, everything has been working >>> smoothly, but now, with version 3.0.3 Nagios has "hanged" twice (once a week >>> more or less). >>> >>> Nagios daemon is running but neither checks are performed nor external >>> commands processed. So I have to restart it. >>> >>> Anyone else have experienced this behaviour? >>> >>> Any suggestions? We are thinking of setting up a cron check to detect >>> this situation (no entries in nagios.log), but perhaps we are experiencing >>> some kind of bug that should be solved in new releases. >>> >>> Thank you very much indded >>> >>> Jose Luis >>> >> >> > - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New user with nagios 3.0.3 hanging.
Do you by chance have a pipe output defined, such as the nagios irc bot? And, is that bot not running on the other side of the pipe to read what is written through the pipe? If you have a pipe defined for an output, 3.x will hang if nothing is reading from the other side of the pipe. -david Jose Luis Garcia wrote: > Hello again, > > Has anybody else experienced this behaviour? > > I'm suffering it every few days. > > Thank you very much > > Jose Luis - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New user with nagios 3.0.3 hanging.
Hello David, I think I haven't. How can I know if i have defined a pipe. If I have done I didn't it in purpose, I promise :-) Thanks a lot Jose Luis 2008/8/1 David Ford <[EMAIL PROTECTED]> > Do you by chance have a pipe output defined, such as the nagios irc > bot? And, is that bot not running on the other side of the pipe to read > what is written through the pipe? > > If you have a pipe defined for an output, 3.x will hang if nothing is > reading from the other side of the pipe. > > -david > > > Jose Luis Garcia wrote: > > Hello again, > > > > Has anybody else experienced this behaviour? > > > > I'm suffering it every few days. > > > > Thank you very much > > > > Jose Luis > > - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null