Here is my $0.02:
I have a distributed Nagios2 system with 24,000+ service checks and 4000+
hosts. I rely heavily on NSCA to get the results from the slaves to the master.
My issue seems to be with Nagios since I can't get a Nagios slave to process a
mere thousand service checks using the docum
If you’re using the embedded Perl interpreter a restart is probably better
since the interpreter leaks memory.
If you have a very large solution (thousands of service checks) a restart will
take a considerable amount of time so a HUP would probably be wise in that
situation.
Jonathan
> -O
See responses inline:
> -Original Message-
> From: Thomas Guyot-Sionnest [mailto:derm...@aei.ca]
> Sent: Sunday, December 13, 2009 9:23 PM
> To: Jonathan Call
> Cc: nagios-user Mailinglist
> Subject: Re: [Nagios-users] Nagios2 process overwhelmed by NSCA
daemon?
>
&
eporting any
errors, or indicating that there is any problem processing passive results.
Nagios just starts to lose the data at a certain point.
Jonathan
> -Original Message-
> From: Greg Pangrazio [mailto:pangr...@gmail.com]
> Sent: Thursday, December 10, 2009 7:26 AM
> To:
I recently added two new slaves to a distributed Nagios system. The
central server now passively processes 17,000+ service checks on 3000+
servers.
It's been over an hour and a half since I brought those new slaves
online and I have about 150 hosts still stuck in 'Pending' and about
1300 services
Any timetable for a STABLE release (i.e. not beta)?
Any timetable for supporting distributed deployments?
Jonathan
> -Original Message-
> From: Taylor Dondich [mailto:tdond...@gmail.com]
> Sent: Monday, October 12, 2009 1:10 PM
> To: nagios-user Mailinglist
> Subject: [Nagios-users] Lila
Have you considered OCP_daemon?
http://wiki.nagios.org/index.php/OCP_Daemon
> -Original Message-
> From: d...@chatham.org [mailto:d...@chatham.org]
> Sent: Tuesday, September 08, 2009 1:00 PM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] NSCA speed problem
>
> I have
Since I have a large Nagios distributed system the possibility of a
Nagios process going AWOL on one of my many servers is a serious
concern. Has anyone come up with a sure way to confirm (i.e. a cron job)
that Nagios is processing checks properly?
For example, I had one OCP_daemon process die, a
That sounds very familiar to the locking/contention issue FreeBSD 7.x has with
Nagios 2.x. It has to do with how Nagios and FreeBSD handle threading.
Unfortunately I don’t have any answers on how to fix it. I’ve had to leave my
Nagios deployment on FreeBSD 6/Nagios 2 for the same reason: anythin
age-
> From: Taylor Dondich [mailto:tdond...@gmail.com]
> Sent: Monday, March 09, 2009 10:28 AM
> To: Jonathan Call
> Cc: nagios-user Mailinglist
> Subject: Re: [Nagios-users] Lilac, a Nagios 3.x Configuration Tool,has
> released 1.0 Release Candidate 1.
>
> That's on the
I don't see it mentioned anywhere so I thought I would ask,
Does Lilac support distributed Nagios deployments?
Jonathan
> -Original Message-
> From: Taylor Dondich [mailto:tdond...@gmail.com]
> Sent: Sunday, March 08, 2009 10:12 PM
> To: nagios-user Mailinglist
> Subject: [Nagios-users]
> -Original Message-
> From: Randall R. Saeks [mailto:rsa...@district30.k12.il.us]
> Sent: Friday, January 16, 2009 1:55 PM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] Nagios 3.0.6 on 10.5.6 Server
>
> Ever since I upgraded my server running Nagios 3.0.6 to 10.5.6, I
> -Original Message-
> From: Marc Powell [mailto:m...@ena.com]
> Sent: Friday, January 16, 2009 1:20 PM
> To: nagios-users Mailinglist
> Subject: Re: [Nagios-users] Hosts report 'DOWN, HARD' after first
attempt.
>
>
> On Jan 16, 2009, at 12:40 PM, Patrick Morris wrote:
>
> > The max_chec
> -Original Message-
> From: Patrick Morris [mailto:patrick.mor...@hp.com]
> Sent: Friday, January 16, 2009 11:40 AM
> To: Jonathan Call
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Hosts report 'DOWN, HARD' after first
attempt
some hosts
this is generating a tone of false "HOST DOWN" notifications. Is there
any way to fix it?
Jonathan Call
This email message is intended for the use of the person to whom it has been
sent, and may contain information that is confidential or legally protected. If
you are n
Does Lilac support distributed configuration? I looked over the site
briefly and did not see any such capability.
Jonathan
> -Original Message-
> From: Taylor Dondich [mailto:tdond...@gmail.com]
> Sent: Wednesday, December 10, 2008 12:45 PM
> To: nagios-user Mailinglist
> Subject: [Nagios
NSCA just doesn't scale well within Nagios.
You will need to try something like the OCP Daemon mentioned here:
http://www.nagioscommunity.org/wiki/index.php/OCP_Daemon
I believe Andreas Ericsson has also written a broker module for NSCA. It
is apparently still in its testing/alpha stages s
-Original Message-
From: Jon Angliss [mailto:[EMAIL PROTECTED]
Sent: Wednesday, September 24, 2008 5:52 PM
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] intermittent CGI failure
On Mon, 15 Sep 2008 12:29:55 -0400 (EDT), [EMAIL PROTECTED] wrote:
>Installed 3.0.3 from
In Nagios 2.x Nagios the Obessive Compulsive Service Processor (OCSP) is
not very robust. Even with a few hundred service checks the OCSP stuff
on the distributed servers bogs down and does not send anything out.
This forced people like me to use tools like OCP_daemon.
Has the OCSP infrastructure
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Dave
Horsfall
Sent: Monday, September 08, 2008 4:46 PM
To: Nagios Users
Subject: Re: [Nagios-users] Anyone tried Nagios 3.0.3 on FreeBSD yet?
On Mon, 8 Sep 2008, Sean McAfee wrote:
> This has long been in
I noticed the port change a few days ago. Anyone tried it?
Does it behave better than Nagios 2 on FreeBSD 7?
Jonathan Call
Network Engineer - NTT/Verio
(801) 437-7476
This email message is intended for the use of the person to whom it has been
sent, and may contain information that is
instead of libpthread those issues went away.
I've been tempted to try that on the amd64 system but I'm waiting for
Nagios to hang/fail again.
Jonathan
-Original Message-
From: Sean McAfee [mailto:[EMAIL PROTECTED]
Sent: Thursday, September 04, 2008 8:16 AM
To: Jonathan Call
Cc: D
Yes I have. And it is very annoying. A service check goes and
the thread hangs, which makes Nagios hang. The service check,
its thread parent remain as unkillable zombies until the server is
rebooted.
No one has offered any sort of solution other than "Have you tried
Nagios 3?" (Which I have not
and went and nothing happened.
Perhaps this is because I have active checks disabled globally (via
nagios.cfg) on my central server instead of on a per host basis?
-Original Message-
From: Tom Ammon [mailto:[EMAIL PROTECTED]
Sent: Friday, August 15, 2008 12:03 PM
To: Jonathan Call
Cc
Correct me if I'm wrong:
In order to run a distributed system, the central server should have
active service checks disabled. But freshness checking executes the
check command when it doesn't receive a passive response in a timely
manner. This means the freshness check never runs.
How do you get a
I'm running FreeBSD 7 (amd64 at that) and Nagios 2.12.
It ran great for about a month. And then today I found that Nagios had
stopped processing checks and there are a few unkillable processes
lingering.
I remember at least one other person posting something similar to this.
Has anyone found a so
ices that have been acknowledged or
that are in Scheduled Downtime? I can do it with Nagios' status.cgi
using serviceprops=10 and hostsprops=10 variables.
Thank you,
Jonathan Call
This email message is intended for the use of the person to whom it has been
sent, and may contain information
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Marc Powell
> Sent: Sunday, June 22, 2008 10:31 AM
> To: nagios-user Mailinglist
> Subject: Re: [Nagios-users] FreeBSD Nagios 2.12
>
>
> On Jun 20, 2008, at 12:16 PM, Andrew D wrote:
>
>
have zombie processes that do not respond to
any kill command:
nagios 46134 0.0 0.0 0 0 ?? Z11:43PM 0:00.06
nagios 46133 0.0 0.0 0 8 ?? DE 11:43PM 0:00.02
/usr/local/bin/nagios -d /usr/local/etc/nagios/nagios.cfg
Has anyone else run into this issue?
Jonathan Call
I've got one server that is getting tanked right now. (Load average in
the 50's) Is it worth it to rewrite the many perl scripts I have to use
Embedded Perl and the Nagios::Plugin CPAN module?
I'm speaking in terms of performance and also in terms of future Nagios
releases/compatibility.
Jonatha
I can't find it right now but someone sent out an email saying they
could not get Nagios 2.10 to run under FreeBSD 6.3. I've upgraded two
systems to FreeBSD 6.3, both are running Nagios 2.10 without any
problems. I didn't even have to recompile the port. I do have the
/etc/libmap.conf entries for l
I just installed Nagios 2.10 on a FreeBSD 6.3 server. It just has the
localhost config on it right now but it runs without any problems.
> -Original Message-
> From: Bernd Kuhlen [mailto:[EMAIL PROTECTED]
> Sent: Friday, January 18, 2008 9:24 AM
> To: Jonathan Call
> Cc: M
Did you try my libmap suggestion? I'd be surprised to learn that
something in FreeBSD 6.3 breaks Nagios. There just aren't that many
changes. I'm building 6.3 right now to find out though.
Jonathan
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] O
Sounds like the fork/vfork issue with FreeBSD's libpthread and Nagios.
The only solution I know of is to add the following to /etc/libmap.conf
and then do a stop/start of Nagios:
[nagios]
libpthread.so.2 libthr.so.2
libpthread.so libthr.so
This forces Nagios to use an a
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Alex French
> Sent: Friday, December 28, 2007 9:00 AM
> To: Chris Haulmark; nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Processes hanging - Nagios 3rc1 on FreeBSD
>
Hello Patrick
The Cisco 3560 and 3550 are very different from the Cisco 7200. You
cannot get an actual temperature value from them, just a temperature
state.
You'll need to use this SNMP Table:
ciscoEnvMonTemperatureState .1.3.6.1.4.1.9.9.13.1.3.1.6
Possible states are:
1:normal 2:warning
Are you aware of the fork/vfork issue between Nagios and the FreeBSD
pthread library? This may be causing your problem.
Try using these /etc/libmap.conf entries:
[nagios]
libpthread.so.2 libthr.so.2
libpthread.so libthr.so
You will need to restart Nagios for the settings
Sean;
I have a very large deployment so I use this tool:
http://www.nagioscommunity.org/wiki/index.php/OCP_Daemon
This daemon runs on each of the distributed servers while a normal ncsa
daemon listens on the central server.
Jonathan
> -Original Message-
> From: [EMAIL PROTECTED] [mail
I currently use nagiosweb (http://sourceforge.net/projects/nagiosweb/)
to maintain a Nagios configuration for a central server in mysql. Based
off of certain host groups I want to generate configuration files for
distributed Nagios servers for that central server.
Has anyone written code (for exam
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Douglas K. Rand
> Sent: Tuesday, June 19, 2007 3:16 PM
> To: Kyle Sexton
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Problems with FreeBSD and Nagios
>
> Doug> T
SD and Nagios
>
> On Mon, Jun 18, 2007 at 06:42:18PM -0500, Kyle Sexton wrote:
> > On 12/14/06, Andreas Ericsson <[EMAIL PROTECTED]> wrote:
> > > Jonathan Call wrote:
> > > >
> > > > Given your ideas and some google work I seem to have found
ICMP) it kills
performance on the Nagios server.
Jonathan Call
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Cli
t;
>
> > -Original Message-
> > From: [EMAIL PROTECTED]
[mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of Jonathan Call
> > Sent: Wednesday, May 02, 2007 10:07 AM
> > To: nagios-users@lists.sourceforge.net
> > Subject: Re: [Nagios-users] Pass
> -Original Message-
> From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED]
> Sent: Tuesday, May 01, 2007 4:29 PM
> To: Jonathan Call
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Passive monitoring is running slow?
>
> On 01/05/07 05:15
I have set up a distributed monitoring system per the Nagios documentation.
I initially tested it out by having the distributed server monitor only 24 or
so services on about 8 hosts. There didn't seem to be any problems.
I then cranked it up to 427 services on 81 hosts. I'm watching the distrib
Is there some documentation somewhere that breaks down the possible
variables and options available to status.cgi?
For example, what are all the possible binary operands for
servicestatustypes or style?
I'm trying to create a Current Network Status view that will be more
appropriate for NOC peopl
one that stands out among the others?
Jonathan Call
Network Engineer - NTT/Verio
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
onder
why they even bother maintaining it in ports?)
> -Original Message-
> From: Andreas Ericsson [mailto:[EMAIL PROTECTED]
> Sent: Thursday, December 14, 2006 2:26 AM
> To: Jonathan Call
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Problems with FreeBSD and Na
r host check is hung? Or
how to deal with this problem at all?
Jonathan Call
Network Engineer - NTT/Verio
801.437.7476
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you&
49 matches
Mail list logo