[Nagios-users] Sending an "all is well" status report

2008-01-24 Thread Patrick Kremer
I finally have Nagios working the way I want it and I'm about to replace our 
old What's Up Gold monitoring software. What's Up does have one feature that I 
like that I can't figure out how to reproduce with Nagios. It has a "recurring 
notification" feature that can send e-mail alerts containing the last few lines 
of the event log and "200 hosts up, 0 down". I like seeing that in my e-mail 
ever hour or two because it let me know that What's Up is still working 
correctly. I'd like a periodic status e-mail from Nagios if it's possible.

I'm not sure if I'm not searching using the correct terminology or if nobody 
really does this. I know I could do something like write a little shell script 
and check nagios.log periodically for auto-save entries, I'm just wondering if 
there is some builtin way.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Problem with high latencies after going distributed

2008-01-24 Thread Thomas Guyot-Sionnest
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On 24/01/08 05:13 PM, Frost, Mark {PBG} wrote:
>  
> 
>> -Original Message-
>> From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED] 
>>> I don't know how many people use OCP_Daemon but I had reports 
 >from a few
>>> people that greatly reduced their latency using it and I 
>>> haven't had any
>>> bug reported yet. I believe it's well documented as well, but If you
>>> have any feedback on this I'll be happy to get it.
>>>
 I'm playing with it a bit and have so far had good results. 
>> I'll have
 some
 feedback after I've played with it a bit longer.  Thanks 
>> for writing it
 and
 writing up the docs for it as well!
>> Pass the thanks over to Ethan who sent me a Nagios NSA t-shirt 
>> for it ;)
>>
>> Thomas
> 
> I can see that using the OCP Daemon script cut down on my latencies
> quite a lot.  Unfortunately,
> I'm still seeing some "stale" checks on the master server that I can't
> explain.  I'm starting to
> get the feeling that going distributed isn't all it's cracked up to be.
> I haven't seen mention in
> the docs of the caveats with oc[sh]p and latencies (my books sure don't
> mention it) and even the
> fact that the supplied submit_service_check script in the distribution
> from Ethan is a shell
> script that pipes to send_nsca.  I'm not all that excited about having
> to do a workaround
> for this issue.

Although I haven't been able to reproduce it on a test setup recently,
some time ago when we had a few passive checks running with Cron on all
servers we had problems with checks going stale. All the crons running
at the same time flodded Nagios with data and caused this behavior. I
didn't find the root cause at that time, but I found out that
artificially filling up the pipe while the checks were coming back in
clearly showed up the problem, although all NSCA were successfully
writing to the pipe as Nagios was reading it. I also had much less
passive checks than the default 1024 buckets in the Nagios buffer, to
this couldn't have been the issue. I reported it on Nagios-devel at that
time but haven't got any follow-ups.

I'm not sure if it only affects specific archs or have been fixed since
then, but that could be the problem you're having. OTOH the tests I ran
with OCP_Daemon were pretty hardcore, feeding it with hundreads of megs
of random check results as fast as possible, in different mode of
operation, and checking if all check results came back from the NSCA
daemon, so I'd be surprised it would have anything to do with that.

What you could try is:

1. Shutting off OC[HS]P on all but one server
2. Running -r0 to try to avoid batched results to send_nsca.

And check if it has any impact. Also make sure you have
command_check_interval=-1 in nagios.cfg on the central server.

BTW which version of Nagios and NSCA are you using?

Thomas
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHmVGZ6dZ+Kt5BchYRApudAJ48S8n2X24+iRbp6zNcoFTbgSOPVwCfS9IO
Otcnvi7iI6ACqOgiN/Q6ONI=
=m8rQ
-END PGP SIGNATURE-

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Howto monitor intermittent clients

2008-01-24 Thread Thomas Guyot-Sionnest
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On 24/01/08 02:04 PM, Lars Stavholm wrote:
> Thomas Guyot-Sionnest wrote:
>> If you need to monitor many hosts, you will likely be more comfortable
>> with Nagios 3 as it can schedule host checks instead of running them in
>> serial, which can be a problem if many hosts are down.
> 
> I'm on latest 3.x with just a few hosts.
> 
>> Ideally use check_icmp instead of check_ping and fine-tune the
>> parameters so that it returns within one second on dead hosts. This will
>> make the host check be performed much faster. This is most useful in
>> Nagios 2 due to the limitation mentioned above.
> 
> I wouldn't know how to tweak it, but I'm trying "check_icmp -n 1"
> at the moment, and it seems to take about 1.08 to 1.15 seconds.
> Good enough maybe? Or can you give any further advice on tweaking
> check_icmp?

The important things are the critical thresholds and timeout value. The
timeout you specify is the maximum time the plugin will take before
returning, but it may take more if the number of packet * critical
threshold is higher. Try on an unresponsive IP that doesn't return back
ICMP errors (will depend on your network; you can try IPs like
10.10.10.10, 1.2.3.4, or an unused IP on your *local* network not going
trough the default gateway).

That's not a big deal since you're using Nagios 3, but if it ever find
you're having too many concurrent processes or too running host on
system time, reducing the timeout will certainly help.

Thomas
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHmU056dZ+Kt5BchYRArS/AJ9r134P1BggxmF1VPaKNLkkvYYgtgCfftCq
9q2hFw/njEs3275yBwnHWdQ=
=l2lC
-END PGP SIGNATURE-

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Problem with Send_NSCA on Windows

2008-01-24 Thread Anthony Montibello
There is also writing a BAT file to do it If you already have the file
created,
This is How I Test NSCA is running when my test server is up from the
command line.
something like:
send_nsca.exe From C#  Start a Process that is the Bat file,
once again, readup in the msdn library about processes and redirecting the
standard INput  from consol  to a filestream. both consol and filestream
inherit from iostream and thus are accepted as io for processes
I thought I mentioned this in the last message?

tony (Author of NC_Net)
On Jan 24, 2008 1:28 AM, <[EMAIL PROTECTED]> wrote:

>
>
> Hi,
>
> I know how to echo a message on the console . my problem is of
> redirecting the string to Send_Nsca.exe , which needs to be sent to the
> Nagios Server for the passive service check.
>
>
> Any mechanism for doing it will be helpfull.
> When I tried redirection through a file to the  SendNsca.exe, it gave me
> the message "0 packets Sent  to the Nagios Server".
>
> @ Anthony
> thanks for the Pointer to NC_Net. Its a good tool and seems to be doing
> more than I need for at the moment.
>
> Thanks
> Ankit
>
>
>
>
>
>   *"Anthony Montibello" <[EMAIL PROTECTED]>*
>
> 01/24/2008 06:26 AM
>To
> [EMAIL PROTECTED]  cc
> "Nagios Users"   Subject
> Re: [Nagios-users] Problem with Send_NSCA on Windows
>
>
>
>
> Actually C# Does have echo,  try
> System.Console.WriteLine("ECHO TO STANDARD IO");
> you can also write to standard error
> and you can also redirect IO streams,
>
> C# give pleanty of ways for doing this, since it is a full development
> language and the Dot NEt Framework offers pleanty of classes, Go to MSDN and
> read throught the Library. particullary the  IO,Console, and posibly the
> Process class depending on how you want to accomplish things.
>
> Using C# NC_Net can feed an IO into  Win32_Send_NSCA.exe
> I also wrote my own version of SendNSCA in C# (internally in NC_NEt)
> doing that is a bit too complicated to discuss, but to oversimpify things:
> To reimplement SendNSCA just convert the Send_nsca.c into C#
> FOr commiunicating with Win32SEnd_Nsca read up on running processes
> Asyncronous and Syncronously. (spelling may be off)
>
> Have you tried using NC_NEt it provides both Active and Passive checks in
> a single windows plugin client It also has the ability to run other Scripts
> loaded into its script folder.
> IT is availible on sourceforge: 
> *http://sourceforge.net/projects/nc-net*
>
> TOny (Author of NC_Net)
>
> On Jan 23, 2008 8:13 AM, <[EMAIL PROTECTED]<[EMAIL PROTECTED]>>
> wrote:
>
> Hi,
>
> I am trying to monitor windows machines passively.
> For the same, I am writing some code in C#.
>
> The problem is, C# does not support anything like echo command(The way we
> used to do it on Linux).
> I have tried to redirect the output from a file to the send_nsca exe, but
> to no avail.
>
> Can you suggest some formats in which the information can be passed to the
> send_nsca exe ?
>
> Thanks
> Ankit
>
> =-=-=
> Notice: The information contained in this e-mail
> message and/or attachments to it may contain
> confidential or privileged information. If you are
> not the intended recipient, any dissemination, use,
> review, distribution, printing or copying of the
> information contained in this e-mail message
> and/or attachments to it are strictly prohibited. If
> you have received this communication in error,
> please notify us by reply e-mail or telephone and
> immediately and permanently delete the message
> and any attachments. Thank you
>
>
>
>
> -
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.*
> **http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/*
> ___
> Nagios-users mailing list*
> [EMAIL PROTECTED] 
> *
> **https://lists.sourceforge.net/lists/listinfo/nagios-users*
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
> ForwardSourceID:NT0001D2C6
>
> =-=-=
> Notice: The information contained in this e-mail
> message and/or attachments to it may contain
> confidential or privileged information. If you are
> not the intended recipient, any dissemination, use,
> review, distribution, printing or copying of the
> information contained in this e-mail message
> and/or attachments to it are strictly prohibited. If
> you have received this communication in error,
> please notify us by reply e-mail or telephone and
> immediately and permanently delete the message
> and any attachments. Thank you
>
>
>
>
-
This SF.net email is sponsored by: Microsoft
Defy a

Re: [Nagios-users] Problem with high latencies after going distributed

2008-01-24 Thread Sean McAvoy
Hi Mark,
I have been having similar problems with my distributed setup. The  
OCSP daemon reduced the latency in returning check results greatly,  
but I still am seeing (seemingly) random services go stale. I'm still  
try to track down the problem, recreating it on a small scale has so  
far been unsuccessful. I will let you (and the list) know how my  
investigations go.
As for determining the execution time of a particular check, this can  
be found in retention.dat. The field is check_execution_time=


On 24-Jan-08, at 5:13 PM, Frost, Mark {PBG} wrote:

>
>
>> -Original Message-
>> From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED]
>> Sent: Thursday, January 24, 2008 3:33 AM
>> To: Frost, Mark {PBG}
>> Cc: Nagios Users
>> Subject: Re: [Nagios-users] Problem with high latencies after
>> going distributed
>>
>> -BEGIN PGP SIGNED MESSAGE-
>> Hash: SHA1
>>
>> Some heavily broken intending there (looks like my mail client gets
>> confused)... don't trust the number of ">"!
>>
>> On 23/01/08 10:47 PM, Frost, Mark {PBG} wrote:
>>>
>>>
 -Original Message-
 From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED]
 Sent: Wednesday, January 23, 2008 10:24 PM
 To: Frost, Mark {PBG}
 Cc: Nagios Users
 Subject: Re: [Nagios-users] Problem with high latencies after
 going distributed
>>> I don't think so. I remember an email from Ton Voon some time
>>> ago asking
>>> Ethan why the oc[hs]p command are run serially but I don't recall if
>>> there was a reply or what else was said...
>>>
>>> I believe it's either documented in the official doc or some
>>> user-contributed doc that the oc[hs]p commands should return
>> as soon as
>>> possible. It's usually done in Perl using a fork:
>>>
>>> if (fork==0) {
>>>  # send stuff via NSCA here...
>>> }
>>> exit(0);
>>>
>>>
 I guess what I'm thinking here is that unlike a custom
>> check, I can't
 see most
 people needing to customize the passive check result
>> process.  All the
 solutions I've
 seen seem to include a named pipe.  So why couldn't Nagios support
 making the ocsp/ochp
 "commands" just named pipes instead.   Then instead of a standalone
 send_nsca binary,
 have the nsca source build a send_nscaD binary (I'm making
>> that up) that
 reads from the
 pipe that nagios writes to and sends directly to nsca on the  
 server.
 That sort of
 eliminates the middle-man in the process of reporting passive check
 results.
>>>
 I know, I know, I'm free to write the send_nscaD.c code and
>> send it to
 Ethan :-)
>>
>> Well... I was thinking about partly re-writing nsca as an event-based
>> daemon (supporting only the --single mode, but that would be really
>> scalable) using libevent, allowing to pass along the timestamp
>> (this is
>> a recent feature request) and supporting multi-line responses (for
>> Nagios 3) in the process, and finally suggesting this as a base for a
>> NSCA v3... I'm not even sure if I would have enough time but since my
>> main objective it to learn I wouldn't loose anything trying :).
>>
>> In the unlikely event that I write it, In the same step I could  
>> surely
>> to a C version of OCP_Daemon supporting natively the "NSCA v3"  
>> protocol
>> (it wouldn't be hard)...
>>
>> I'll have to think about it... I quess the only sane separator to  
>> write
>> multiple multi-line results on a pipe would be \000 (NULL), so there
>> would be 3 mode of operation for send_nsca (and two for nsca_sendd
>> (don't you think it sounds better reversed?)):
>> send_nsca: compatible (v2 behavior), Single check (additional  
>> lines are
>> taken as additional output) and multi-check (NULL separated)
>> nsca_sendd: single-line (one check/line, OCP_Daemon style) and
>> multi-line "NULL-separated).
>>
>>> I don't know how many people use OCP_Daemon but I had reports
> from a few
>>> people that greatly reduced their latency using it and I
>>> haven't had any
>>> bug reported yet. I believe it's well documented as well, but If you
>>> have any feedback on this I'll be happy to get it.
>>>
 I'm playing with it a bit and have so far had good results.
>> I'll have
 some
 feedback after I've played with it a bit longer.  Thanks
>> for writing it
 and
 writing up the docs for it as well!
>>
>> Pass the thanks over to Ethan who sent me a Nagios NSA t-shirt
>> for it ;)
>>
>> Thomas
>
> I can see that using the OCP Daemon script cut down on my latencies
> quite a lot.  Unfortunately,
> I'm still seeing some "stale" checks on the master server that I can't
> explain.  I'm starting to
> get the feeling that going distributed isn't all it's cracked up to  
> be.
> I haven't seen mention in
> the docs of the caveats with oc[sh]p and latencies (my books sure  
> don't
> mention it) and even the
> fact that the supplied submit_service_check script in the distribution
> from Ethan is a shell
> script that pipes to send_nsca.  I'm not all that exci

Re: [Nagios-users] Problem with high latencies after going distributed

2008-01-24 Thread Frost, Mark {PBG}
 

>-Original Message-
>From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED] 
>Sent: Thursday, January 24, 2008 3:33 AM
>To: Frost, Mark {PBG}
>Cc: Nagios Users
>Subject: Re: [Nagios-users] Problem with high latencies after 
>going distributed
>
>-BEGIN PGP SIGNED MESSAGE-
>Hash: SHA1
>
>Some heavily broken intending there (looks like my mail client gets
>confused)... don't trust the number of ">"!
>
>On 23/01/08 10:47 PM, Frost, Mark {PBG} wrote:
>>  
>> 
>>> -Original Message-
>>> From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED] 
>>> Sent: Wednesday, January 23, 2008 10:24 PM
>>> To: Frost, Mark {PBG}
>>> Cc: Nagios Users
>>> Subject: Re: [Nagios-users] Problem with high latencies after 
>>> going distributed
>> I don't think so. I remember an email from Ton Voon some time 
>> ago asking
>> Ethan why the oc[hs]p command are run serially but I don't recall if
>> there was a reply or what else was said...
>> 
>> I believe it's either documented in the official doc or some
>> user-contributed doc that the oc[hs]p commands should return 
>as soon as
>> possible. It's usually done in Perl using a fork:
>> 
>> if (fork==0) {
>>  # send stuff via NSCA here...
>> }
>> exit(0);
>> 
>> 
>>> I guess what I'm thinking here is that unlike a custom 
>check, I can't
>>> see most
>>> people needing to customize the passive check result 
>process.  All the
>>> solutions I've
>>> seen seem to include a named pipe.  So why couldn't Nagios support
>>> making the ocsp/ochp
>>> "commands" just named pipes instead.   Then instead of a standalone
>>> send_nsca binary,
>>> have the nsca source build a send_nscaD binary (I'm making 
>that up) that
>>> reads from the
>>> pipe that nagios writes to and sends directly to nsca on the server.
>>> That sort of
>>> eliminates the middle-man in the process of reporting passive check
>>> results.
>> 
>>> I know, I know, I'm free to write the send_nscaD.c code and 
>send it to
>>> Ethan :-)
>
>Well... I was thinking about partly re-writing nsca as an event-based
>daemon (supporting only the --single mode, but that would be really
>scalable) using libevent, allowing to pass along the timestamp 
> (this is
>a recent feature request) and supporting multi-line responses (for
>Nagios 3) in the process, and finally suggesting this as a base for a
>NSCA v3... I'm not even sure if I would have enough time but since my
>main objective it to learn I wouldn't loose anything trying :).
>
>In the unlikely event that I write it, In the same step I could surely
>to a C version of OCP_Daemon supporting natively the "NSCA v3" protocol
>(it wouldn't be hard)...
>
>I'll have to think about it... I quess the only sane separator to write
>multiple multi-line results on a pipe would be \000 (NULL), so there
>would be 3 mode of operation for send_nsca (and two for nsca_sendd
>(don't you think it sounds better reversed?)):
>send_nsca: compatible (v2 behavior), Single check (additional lines are
>taken as additional output) and multi-check (NULL separated)
>nsca_sendd: single-line (one check/line, OCP_Daemon style) and
>multi-line "NULL-separated).
>
>> I don't know how many people use OCP_Daemon but I had reports 
from a few
>> people that greatly reduced their latency using it and I 
>> haven't had any
>> bug reported yet. I believe it's well documented as well, but If you
>> have any feedback on this I'll be happy to get it.
>> 
>>> I'm playing with it a bit and have so far had good results. 
> I'll have
>>> some
>>> feedback after I've played with it a bit longer.  Thanks 
>for writing it
>>> and
>>> writing up the docs for it as well!
>
>Pass the thanks over to Ethan who sent me a Nagios NSA t-shirt 
>for it ;)
>
>Thomas

I can see that using the OCP Daemon script cut down on my latencies
quite a lot.  Unfortunately,
I'm still seeing some "stale" checks on the master server that I can't
explain.  I'm starting to
get the feeling that going distributed isn't all it's cracked up to be.
I haven't seen mention in
the docs of the caveats with oc[sh]p and latencies (my books sure don't
mention it) and even the
fact that the supplied submit_service_check script in the distribution
from Ethan is a shell
script that pipes to send_nsca.  I'm not all that excited about having
to do a workaround
for this issue.

While the OCP_Daemon seems to help me, I'm a little uncomfortable
running it as a solution to our
issue.  First, we don't normally have root access on our boxes so
recreating the FIFOs could be
a problem (or at least a wait).  I'm also concerned about requiring
another process external to
Nagios as part of the process.  If OCP_Daemon dies at some point, my
distributed nodes are hosed.
I had a few issues with correctly starting Nagios and OCP_Daemon in the
right order when playing
with it last night.  Once I got it all going, it worked well but I'm
thinking of having to explain
this to someone here who isn't the Nagios person.

I was thinking of your fork/exec comment above.  What 

Re: [Nagios-users] Determining the success or failure ofexternalcommands

2008-01-24 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Scott Sanders
> Sent: Thursday, January 24, 2008 11:01 AM
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Determining the success or failure
> ofexternalcommands
> 

> It seems like some effort was put into better error handling in the
> external command algorithm, but the changes don't seem to actually
have
> any effect. I'm curious as to why this is, as it affects the code I am
> adding to track the progress of an external command from submission to
> completion.

99% of us are nagios users like yourself. We can speculate and some even
have a little inside knowledge about parts of nagios but Ethan, the
primary developer and sole maintainer, doesn't watch this list. If you
have questions about why something is the way it is you'll probably have
better luck on nagios-devel.

--
Marc


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Just a general concensus

2008-01-24 Thread Brian Loe
what's the view on Opsview? Is it just, more or less, a wrapper for
Nagios? Easier or more difficult to manage than plane Nagios? Addons
like Nagvis work with it (or is Nagvis unnecessary?)?

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Howto monitor intermittent clients

2008-01-24 Thread Lars Stavholm
Thomas Guyot-Sionnest wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
> 
> On 19/01/08 05:42 AM, Hugo van der Kooij wrote:
>> Lars Stavholm wrote:
>> | Hi All,
>> |
>> | I know this is a bit of an odd one, but still:
>> |
>> | I'm looking for ways of monitoring intermittent
>> | clients, i.e. client computers that are not always
>> | there, like laptops that come and go or similar,
>> | clients that gets switched off after work hours.
>> |
>> | Ideally, without installing anything on the clients,
>> | so that sort of rules out passive checking. Instead,
>> | I would like to use active checking (e.g. using SNMP),
>> | but configured in a way so that when a host is unreachable
>> | or down, that's OK, and the services needn't be checked.
>> | Whereas if the client host is on line, all defined
>> | services should be checked and issue alarms and so on
>> | if check results in CRITICAL or WARNING, as per usual.
>>
>> Make sure you can ping them. Then setup a service to do that at a
>> relative high rate. And make all other services dependent on that ping
>> service. Then say you do not care about unreachable services.
>>
>> Just make sure that ping reaches a hard state before any service can
>> reach a hard state.
> 
> You don't even need that. Obviously you won't need to know then these
> systems are down, so just set the "notification_options" to "n" in the
> host setup and you'll never get paged when the host is down.

Good one. Implemented.

> Nagios check logic is to make sure a host is up before sending a service
> alert. If the host is down, you get HOST DOWN notification instead of
> any service alert, and obviously id you disable host notifications
> you'll get nothing.

Very good.

> Take note of the following:
> 
> If you need to monitor many hosts, you will likely be more comfortable
> with Nagios 3 as it can schedule host checks instead of running them in
> serial, which can be a problem if many hosts are down.

I'm on latest 3.x with just a few hosts.

> Ideally use check_icmp instead of check_ping and fine-tune the
> parameters so that it returns within one second on dead hosts. This will
> make the host check be performed much faster. This is most useful in
> Nagios 2 due to the limitation mentioned above.

I wouldn't know how to tweak it, but I'm trying "check_icmp -n 1"
at the moment, and it seems to take about 1.08 to 1.15 seconds.
Good enough maybe? Or can you give any further advice on tweaking
check_icmp?

> The tricky part is host coming back up. On some systems the network
> comes up before all services are up so you'll get notifications for
> these services. There's no easy way around that, but personally I'd look
> into event handlers.

Seems like a good idea. I have a feeling that
the new "Adaptive Monitoring" might be useful.
I will try that in the next step of testing.

Thanks for your help, I'll report back to this
list on my progress.

Thanks
/Lars

> You could set-up an event handler for hosts and:
> 
> - - When a "volatile" host goes HARD DOWN, disable notifications for all
> services by sending the appropriate command to the host.
> - - When a "volatile" host goes HARD UP, sleep 1 or 2 minutes then enable
> notifications for all services. You will probably have to fork before
> any sleep in your event handler to avoid blocking Nagios...
> 
> References:
> http://nagios.sourceforge.net/docs/2_0/notifications.html
> http://nagios.sourceforge.net/docs/2_0/checkscheduling.html
> http://nagios.sourceforge.net/docs/2_0/eventhandlers.html
> http://www.nagios.org/developerinfo/externalcommands/commandlist.php
> 
> Thomas
> -BEGIN PGP SIGNATURE-
> Version: GnuPG v1.4.6 (GNU/Linux)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
> 
> iD8DBQFHkmrK6dZ+Kt5BchYRAmtUAKCYE4Zu30G7TM10yUXz9oHyZ0HGWACg6s0Y
> rbSKYe/x2Q/oyU+Riz7GVyQ=
> =jaja
> -END PGP SIGNATURE-
> 
> -
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
> any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null
> 



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info wi

Re: [Nagios-users] check host_name or IP

2008-01-24 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED]
> Sent: Thursday, January 24, 2008 12:46 PM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] check host_name or IP
> 
> 99% of the time I can check and IP but sometimes I need to check the
> host_name.
> I tried:
> command_line$USER1$/check_http --hostname=$HOST_NAME$ -u$ARG1$
-r$ARG2
> $

You mean $HOSTNAME$. This also means that host_name in your host{}
definition much be fully qualified or otherwise resolvable.

http://nagios.sourceforge.net/docs/2_0/macros.html#hostname

--
Marc

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] check host_name or IP

2008-01-24 Thread bjlockie
99% of the time I can check and IP but sometimes I need to check the
host_name.
I tried:
command_line$USER1$/check_http --hostname=$HOST_NAME$ -u$ARG1$ -r$ARG2
$
instead of:
command_line$USER1$/check_http --hostname=$HOSTADDRESS$ -u$ARG1$ -r$AR
G2$
but it didn't work.



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Perf Data Issue

2008-01-24 Thread Joerg Linge
Paul Wilson schrieb:
> Hi All,

Hi Paul,

> Firstly, thanks for everyone involved with Nagios, its a fantastic
> piece of softwarel!
> 
> My problem is that I cannot enable performance data from my monitors.
> I'm running the latest version of nagios2 which I downloaded a week
> ago, and compiled it with the default perfdata setting.
> 
> I have also installed pnp successfully, however it tells me that there
> are no valid rrdfiles in the given host's directory.
> 
> I've set up the follow configuration options in my nagios.cfg:
> 
> process_performance_data=1

OK

> host_perfdata_command=process-host-perfdata
> service_perfdata_command=process-service-perfdata

OK

> And in my commands.cfg
> 
> define command {
>command_nameprocess-host-perfdata
> #   command_line/usr/bin/perl
> /usr/local/nagios/libexec/process_perfdata.pl -d HOSTPERFDATA
> command_line/bin/echo -e
> "$LASTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$STATETYPE$\t$EXECUTIONTIME$\t$OUTPUT$\t$PERFDATA$"
>>> /usr/local/nagios/var/host-perfdata

Do you use regular host checks with Nagios 2.x ? Tis is not the best 
Idea i think.

> Notice I've commented out the process_perfata.pl line in favour of a
> simple echo to a file, just for testing.
> 
> I have also restarted the nagios process to take into account my changes.
> 
> I have a host set up to monitor with the check_icmp command:
> 
> -rwsr-xr-x 1 root root 30524 2007-10-22 15:17 check_icmp
> 
> A simple check determines that it's returning performance data:
> ./check_icmp -H 128.1.2.181
>   OK - 128.1.2.181: rta 0.451ms, lost
> 0%|rta=0.451ms;200.000;500.000;0; pl=0%;40;80;;

OK, check_icmp looks good.

> And the service command looks like:
> 
> define service{
> use   generic-service
> host_nameVirgin.Invader; All our
> boxes are named after trains
> service_description ICMP Echo Stats
> check_commandcheck_icmp
> }
> 

Is "process_perf_data 1" set in your service definition or templates ?


> Within Nagios I get the status printed without problems, but no
> /usr/local/nagios/var/host-perfdata exists.

Thats OK. PNP dont need this file if your using the PNP Default Mode.

> I've checked books and online articles for a solution but I guess I
> must be doing something fundimentally wrong.

First choose the right mode for your installation.
http://www.pnp4nagios.org/pnp/modes

Configure PNP based on http://www.pnp4nagios.org/pnp/config

and check the function as described on

http://www.pnp4nagios.org/pnp/verify

Kind regards,
Jörg

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_http returning success full results on aurlwhich is down

2008-01-24 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Darren McMillan
> Sent: Thursday, January 24, 2008 11:13 AM
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] check_http returning success full results
on
> aurlwhich is down
> 
> Hi Mark, All,
> 
> First thanks for the quick response I appreciate your help.
> 
> The test harness is run by a simple batch script on each client
machine,
> if I execute it the harness is on & I can verify that with the
following
> command from command line, using the -s that you suggested.
> 
>  ./check_http  -I 10.77.40.123 -p 2000 -s "QA-testHarness"
> 
> HTTP OK HTTP/1.1 200 OK - 0.340 second response time
> |time=0.339870s;;;0.00 size=5505B;;;0
> 
> If i close the harness I correctly get the following response with the
> same command
> 
> Connection refused
> HTTP CRITICAL - Unable to open TCP socket

Good.

> I've edited my commands.cfg to include the -s
> 
> command_line $USER1$/check_http -H $HOSTADDRESS$ -p $ARG1$ -u $ARG2$
-s
> $ARG3$
> 
> Within my windows.cfg I have altered check command for check_http to
> include the -s.
> 
> check_commandcheck_http!2000!/test/results!QA-testHarness
> 
> Please correct me if I am wrong with either of these two, I'm still
> getting to grips with the proper syntax to use here.

Yes, the ARGx use is correct now. I assume that when it's working, your
test harness actually displays the text 'QA-testHarness' in the page.
 
> So I do a pre-flight check & reboot nagios, then via the web interface
I
> force a service check on check_http with the harness down so I should
get
> an error, however it still passes successfully.
> 
> My apache logs show no errors, and the following for the checks.
> 
> 127.0.0.1 - nagiosadmin [24/Jan/2008:16:56:01 +] "GET /nagios/cgi-
> bin/extinfo.cgi?type=2&host=PONGO&service=HTTP HTTP/1.1" 200 9709
> "http://localhost/nagios/cgi-bin/status.cgi?host=all"; "Mozilla/5.0
(X11;
> U; Linux i686; en-US; rv:1.8.1.3) Gecko/20070417 Fedora/2.0.0.3-4.fc7
> Firefox/2.0.0.3"

You want to check the web server logs on the machine running your test
harness. That's the server returning the 200 OK that nagios is properly
accepting as successful. You'll want to verify that the request is
actually for the host/port/url you think it is.

However, based on what you've shown above and your previous post, I
suspect that you have multiple copies of nagios running, one with an old
configuration and one with the new. Stop nagios and make sure it's
really stopped, then start again.

--
Marc


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Perf Data Issue

2008-01-24 Thread Paul Wilson
Hi All,

Firstly, thanks for everyone involved with Nagios, its a fantastic
piece of softwarel!

My problem is that I cannot enable performance data from my monitors.
I'm running the latest version of nagios2 which I downloaded a week
ago, and compiled it with the default perfdata setting.

I have also installed pnp successfully, however it tells me that there
are no valid rrdfiles in the given host's directory.

I've set up the follow configuration options in my nagios.cfg:

process_performance_data=1

host_perfdata_command=process-host-perfdata
service_perfdata_command=process-service-perfdata

And in my commands.cfg

define command {
   command_nameprocess-host-perfdata
#   command_line/usr/bin/perl
/usr/local/nagios/libexec/process_perfdata.pl -d HOSTPERFDATA
command_line/bin/echo -e
"$LASTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$STATETYPE$\t$EXECUTIONTIME$\t$OUTPUT$\t$PERFDATA$"
>> /usr/local/nagios/var/host-perfdata

Notice I've commented out the process_perfata.pl line in favour of a
simple echo to a file, just for testing.

I have also restarted the nagios process to take into account my changes.

I have a host set up to monitor with the check_icmp command:

-rwsr-xr-x 1 root root 30524 2007-10-22 15:17 check_icmp

A simple check determines that it's returning performance data:
./check_icmp -H 128.1.2.181
  OK - 128.1.2.181: rta 0.451ms, lost
0%|rta=0.451ms;200.000;500.000;0; pl=0%;40;80;;

And the service command looks like:

define service{
use   generic-service
host_nameVirgin.Invader; All our
boxes are named after trains
service_description ICMP Echo Stats
check_commandcheck_icmp
}

Within Nagios I get the status printed without problems, but no
/usr/local/nagios/var/host-perfdata exists.

I've checked books and online articles for a solution but I guess I
must be doing something fundimentally wrong.

Any help would be massively appreciated!?

Thanks,
Paul

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_http returning success full results on a urlwhich is down

2008-01-24 Thread Darren McMillan
Hi Mark, All,

First thanks for the quick response I appreciate your help.

The test harness is run by a simple batch script on each client machine, if I 
execute it the harness is on & I can verify that with the following command 
from command line, using the -s that you suggested.

 ./check_http  -I 10.77.40.123 -p 2000 -s "QA-testHarness"

HTTP OK HTTP/1.1 200 OK - 0.340 second response time |time=0.339870s;;;0.00 
size=5505B;;;0

If i close the harness I correctly get the following response with the same 
command

Connection refused
HTTP CRITICAL - Unable to open TCP socket

I've edited my commands.cfg to include the -s 

command_line$USER1$/check_http -H $HOSTADDRESS$ -p $ARG1$ -u $ARG2$ -s 
$ARG3$

Within my windows.cfg I have altered check command for check_http to include 
the -s.

check_commandcheck_http!2000!/test/results!QA-testHarness

Please correct me if I am wrong with either of these two, I'm still getting to 
grips with the proper syntax to use here.

So I do a pre-flight check & reboot nagios, then via the web interface I force 
a service check on check_http with the harness down so I should get an error, 
however it still passes successfully.

My apache logs show no errors, and the following for the checks.

127.0.0.1 - nagiosadmin [24/Jan/2008:16:56:01 +] "GET 
/nagios/cgi-bin/extinfo.cgi?type=2&host=PONGO&service=HTTP HTTP/1.1" 200 9709 
"http://localhost/nagios/cgi-bin/status.cgi?host=all"; "Mozilla/5.0 (X11; U; 
Linux i686; en-US; rv:1.8.1.3) Gecko/20070417 Fedora/2.0.0.3-4.fc7 
Firefox/2.0.0.3"
127.0.0.1 - nagiosadmin [24/Jan/2008:16:56:06 +] "GET 
/nagios/cgi-bin/cmd.cgi?cmd_typ=7&host=PONGO&service=HTTP&force_check HTTP/1.1" 
200 3067 
"http://localhost/nagios/cgi-bin/extinfo.cgi?type=2&host=PONGO&service=HTTP"; 
"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.3) Gecko/20070417 
Fedora/2.0.0.3-4.fc7 Firefox/2.0.0.3"
127.0.0.1 - nagiosadmin [24/Jan/2008:16:56:08 +] "POST 
/nagios/cgi-bin/cmd.cgi HTTP/1.1" 200 1230 
"http://localhost/nagios/cgi-bin/cmd.cgi?cmd_typ=7&host=PONGO&service=HTTP&force_check";
 "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.3) Gecko/20070417 
Fedora/2.0.0.3-4.fc7 Firefox/2.0.0.3"
127.0.0.1 - nagiosadmin [24/Jan/2008:16:56:11 +] "GET 
/nagios/cgi-bin/extinfo.cgi?type=2&host=PONGO&service=HTTP HTTP/1.1" 200 9709 
"http://localhost/nagios/cgi-bin/status.cgi?host=all"; "Mozilla/5.0 (X11; U; 
Linux i686; en-US; rv:1.8.1.3) Gecko/20070417 Fedora/2.0.0.3-4.fc7 
Firefox/2.0.0.3"
127.0.0.1 - nagiosadmin [24/Jan/2008:16:56:16 +] "GET 
/nagios/cgi-bin/status.cgi?host=all HTTP/1.1" 200 100331 
"http://localhost/nagios/side.html"; "Mozilla/5.0 (X11; U; Linux i686; en-US; 
rv:1.8.1.3) Gecko/20070417 Fedora/2.0.0.3-4.fc7 Firefox/2.0.0.3"

Thanks for your time & patience,

Darren.


Marc Powell <[EMAIL PROTECTED]> wrote: 

> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Darren McMillan
> Sent: Wednesday, January 23, 2008 10:51 AM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] check_http returning success full results on a
> urlwhich is down
> 
> Hi All,
> 
> I'm currently running Nagios 3.0 on a Fedora 7 box, with the latest
plug-
> in package.
> 
> I'm trying to configure the check_http plug-in to check a test harness
> running on a Windows box which can be accessed via a web interface.
The
> problem is that I have brought the test harness down, so that when you
> manually connect to it via a browser
(http://hostname:2000/test/results)
> it will return a page cannot be displayed error as expected.  However
the
> check_http is still responding with a success "HTTP OK HTTP/1.1 200 OK
-
> 5029 bytes in 0.004 seconds".
> 

> define command{
> command_namecheck_http
> command_line$USER1$/check_http -H $HOSTADDRESS$ -p $ARG1$ -u
> $ARG2$
> }
> 
> I have set this as a service in the windows.cfg as follows
> 
> define service{
> use generic-service
> host_name  PONGO
> service_description HTTP
> check_command
check_http!pongo!2000!/test/results
> notifications_enabled  0
> }

This should be failing loudly. It can't be what nagios is actually
using. The command being executed by nagios would be --

/path/to/plugins/check_http -H  -p pongo -u 2000

$ ./check_http -H 172.27.0.101 -p pongo -u 2000
check_http: Invalid port number - pongo


> Can anyone spot why this is not failing even though I've closed the
URL
> it's trying to access?

It's getting a 200 (successful) response code from your webserver.
You're not testing what you think you're testing.

Check your web server logs to see what nagios really is checking. You
might also be interested in the -s option to verify output from your
webserver.

--
Marc


-
This SF.net email is sponsored by: Microsoft
Defy 

[Nagios-users] Error: Cannot open main configuration file '/usr/local/nagios/etc/nagios.cfg' for reading!

2008-01-24 Thread Scott McCoy
When using the Fedora Quickstart, I get to step 7 and type 
/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg and I 
get the following error

Error: Cannot open main configuration file 
'/usr/local/nagios/etc/nagios.cfg' for reading!

I have check the permissions on /usr/local/nagios/etc/nagios.cfg and 
nagios  owner's the file.

[EMAIL PROTECTED] smccoy]# ls -l /usr/local/nagios/etc/
total 68
-rw-rw-r-- 1 nagios nagios 10459 Jan 24 09:47 cgi.cfg
-rw-r--r-- 1  root   root  26 Jan 24 09:34 htpasswd.users
-rw-r--r-- 1  nagios nagios 42466 Jan 24 09:54 nagios.cfg
drwxrwxr-x 2 nagios nagios  4096 Jan 24 09:47 objects
-rw-rw 1 nagios nagios  1341 Jan 24 09:47 resource.cfg

Thanks



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Determining the success or failure of externalcommands

2008-01-24 Thread Scott Sanders
I have been digging though commands.c to see what I can find out. One thing
that stands out is that Nagios 2.x calls process_external_command() which
has a void return, and Nagios 3.x calls process_external_command1() which
returns an int. It seems the 3.x implementation is, in fact, checking for
errors in the command its attempting to process and will return -2 if it
encounters any problems. This would be quite helpful in determining the
status of a submitted command, but unfortunately the return value of both
check_external_command1() and check_external_command2() is never considered
by the calling function. Because of this, commands that are malformed or
have other issues are silently ignored and check_for_external_commands()
will always return OK. It seems like some effort was put into better error
handling in the external command algorithm, but the changes don't seem to
actually have any effect. I'm curious as to why this is, as it affects the
code I am adding to track the progress of an external command from
submission to completion.

Thanks,

-Scott

On 1/23/08, Marc Powell <[EMAIL PROTECTED]> wrote:
>
>
>
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of Scott Sanders
> > Sent: Wednesday, January 23, 2008 8:39 AM
> > To: nagios-users@lists.sourceforge.net
> > Subject: Re: [Nagios-users] Determining the success or failure of
> > externalcommands
> >
>
>
> > nagios.cmd pipe. In order for the external command functionality to
> work
> > the way I intend, I need to obtain a response from nagios so my server
> can
> > respond back to the client with the results of processing the external
> > command. I am really hoping there is an easy way to do this, as
> replacing
> > the nagios.cmd file with something besides a FIFO pipe requires
> > modification to the nagios source and is something I was trying to
> avoid.
>
> Nagios was never designed to provide status back to a script submitting
> a command to the external command pipe because it is not a real-time
> operation and the commands should always succeed (why wouldn't they?).
> The command may not be read for quite some time after it's submitted,
> depending on what nagios is doing at the time.
>
> As for logging, Nagios assumes that all external commands succeed and
> logs nothing unless the command is malformed. There is additional
> logging that is available by running nagios in DEBUG3 but that still
> doesn't indicate success or failure. commands.c handles external
> commands. I'd suggest you start there and add in what additional logging
> you feel you need.
>
> --
> Marc
>
>
> -
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] flap_detection_enabled persistenceacrossreload/restart?

2008-01-24 Thread Tom Throckmorton
On Jan 24 09:42, Marc Powell wrote:
> 
> 
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of Tom Throckmorton
> > Sent: Thursday, January 24, 2008 8:12 AM
> > To: Marc Powell
> > Cc: nagios-users@lists.sourceforge.net
> > Subject: Re: [Nagios-users] flap_detection_enabled
> > persistenceacrossreload/restart?
> > 
> 
> 
> > Would you (or anyone) mind testing this on your own installation:
> > 
> > - find a host/service for which flapping is enabled
> > - disable flap detection for that host/service; wait until that state
> is
> >   reflected in the extended info
> > - reload the configs
> > - check the state of flap detection - has it reverted to enabled?
> 
> I can confirm that it ignores the retention file setting with
> nagios-2.7. I don't have a more recent version up to test with. 

Thanks.  Bummer - I'm still trying to find something that points to this being
intentional.  I'll ask on nagios-devel...

> > > Not all directives can be overridden by the retention file but
> > > flap_detection_enabled is one of them
> > > (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service).
> > 
> > I'm not sure what you mean here - which directives _can't_ be
> overridden
> > by the
> > retention file?
> 
> Any that aren't marked with a *. Click the * or read the Retention Notes
> at the top of the page.

Oh, duh.  *kicks self in head*

http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#retention_notes

> > > Your configuration snippets look accurate, assuming they're applied
> as
> > > you indicate.
> > 
> > They were cut and pasted from live configs, so yeah.
> 
> I meant that the template really was applied to the hosts/services.

Well, assuming any status data that gets fed to the state files and via the NEB
is accurate, then my configs match my live settings.  I'm verifying by seeing
what ends up in the status_file and state_retention_file, as well as the
NDO-fed database.

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Error: Cannot open main configuration file '/usr/local/nagios/etc/nagios.cfg' for reading!

2008-01-24 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Scott McCoy
> Sent: Thursday, January 24, 2008 10:41 AM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] Error: Cannot open main configuration file
> '/usr/local/nagios/etc/nagios.cfg' for reading!
> 
> When using the Fedora Quickstart, I get to step 7 and type
> /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg and I
> get the following error
> 
> Error: Cannot open main configuration file
> '/usr/local/nagios/etc/nagios.cfg' for reading!
> 
> I have check the permissions on /usr/local/nagios/etc/nagios.cfg and
> nagios  owner's the file.
> 
> [EMAIL PROTECTED] smccoy]# ls -l /usr/local/nagios/etc/
> total 68
> -rw-rw-r-- 1 nagios nagios 10459 Jan 24 09:47 cgi.cfg
> -rw-r--r-- 1  root   root  26 Jan 24 09:34 htpasswd.users
> -rw-r--r-- 1  nagios nagios 42466 Jan 24 09:54 nagios.cfg
> drwxrwxr-x 2 nagios nagios  4096 Jan 24 09:47 objects
> -rw-rw 1 nagios nagios  1341 Jan 24 09:47 resource.cfg

Do you have SELinux enabled? Is it blocking read access? The default
SELinux policy typically only affects the CGI's but you may have a more
restrictive policy enforced. Also, are the permissions on
/usr/local/nagios/etc/ and above 755 or set so that the nagios user has
read/execute access?

--
Marc

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagvis error nagvis

2008-01-24 Thread Marc Powell


> -Original Message-
> From: Alex Dehaini [mailto:[EMAIL PROTECTED]
> Sent: Thursday, January 24, 2008 9:29 AM
> To: Marc Powell
> Subject: Re: [Nagios-users] nagvis error nagvis
> 
> I have fixed the problem. I had to enable check_external_command=1 in
> nagios.cfg. After that, everything worked.

I'd say that's a bizarre and non-obvious resolution. I don't understand
how the error you posted would be related. Interesting...

--
Marc

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] flap_detection_enabled persistenceacrossreload/restart?

2008-01-24 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Tom Throckmorton
> Sent: Thursday, January 24, 2008 8:12 AM
> To: Marc Powell
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] flap_detection_enabled
> persistenceacrossreload/restart?
> 


> Would you (or anyone) mind testing this on your own installation:
> 
> - find a host/service for which flapping is enabled
> - disable flap detection for that host/service; wait until that state
is
>   reflected in the extended info
> - reload the configs
> - check the state of flap detection - has it reverted to enabled?

I can confirm that it ignores the retention file setting with
nagios-2.7. I don't have a more recent version up to test with. 

> > Not all directives can be overridden by the retention file but
> > flap_detection_enabled is one of them
> > (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service).
> 
> I'm not sure what you mean here - which directives _can't_ be
overridden
> by the
> retention file?

Any that aren't marked with a *. Click the * or read the Retention Notes
at the top of the page.

> > Your configuration snippets look accurate, assuming they're applied
as
> > you indicate.
> 
> They were cut and pasted from live configs, so yeah.

I meant that the template really was applied to the hosts/services.

--
Marc


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagvis error nagvis

2008-01-24 Thread Masopust, Christian

i have the same error here alltough ndo can write to all tables,
only entries like described below cannot be written to database...
(but when running the described sql from cmd-line it works)

christian

--
"I sense much NT in you, NT leads to Blue Screen. 
Blue Screen leads to downtime, downtime leads to suffering. NT is the path to 
the darkside." 

- Unknown Unix Jedi  

> -Original Message-
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On Behalf 
> Of Marc Powell
> Sent: Thursday, January 24, 2008 4:25 PM
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] nagvis error nagvis
> 
> 
> 
> > -Original Message-
> > From: [EMAIL PROTECTED] 
> [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of Alex Dehaini
> > Sent: Thursday, January 24, 2008 8:34 AM
> > To: Giles Coochey; nagios-users@lists.sourceforge.net
> > Subject: [Nagios-users] nagvis error nagvis
> > 
> > I keep getting this error after installing nagvis
> > 
> 
> 
> > Jan 24 14:24:51 dev ndo2db: Error: mysql_query() failed for 'INSERT
> INTO
> > nagios_configfilevariables SET instance_id='1', configfile_id='4',
> > varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''
> > 
> > 
> > Any idea on how I can fix this. I am running centos 5.0
> 
> Caveat- I don't use nagvis or ndo but... my first guess would 
> be a mysql
> permissions issue. Does the user that ndo is connecting to the mysql
> database as have insert permissions for that table? Are any of the db
> updates working?
> 
> --
> Marc
> 
> 
> --
> ---
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS 
> when reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null
> 

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagvis error nagvis

2008-01-24 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Alex Dehaini
> Sent: Thursday, January 24, 2008 8:34 AM
> To: Giles Coochey; nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] nagvis error nagvis
> 
> I keep getting this error after installing nagvis
> 


> Jan 24 14:24:51 dev ndo2db: Error: mysql_query() failed for 'INSERT
INTO
> nagios_configfilevariables SET instance_id='1', configfile_id='4',
> varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''
> 
> 
> Any idea on how I can fix this. I am running centos 5.0

Caveat- I don't use nagvis or ndo but... my first guess would be a mysql
permissions issue. Does the user that ndo is connecting to the mysql
database as have insert permissions for that table? Are any of the db
updates working?

--
Marc


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagvis error nagvis

2008-01-24 Thread Giles Coochey
Actually, I too get a similar error you yours, on starting NDO:

Jan 23 17:08:09 hostname ndo2db: Error: mysql_query() failed for 'INSERT INTO 
nagios_configfilevariables SET instance_id='1', configfile_id='1', 
varname='cfg_dir', varvalue='/opt/nagios/etc/cfg/hostgroup1''

And I have a working NagVis setup, so I am not sure whether that error is the 
actual cause of your problem.


From: Alex Dehaini [mailto:[EMAIL PROTECTED] 
Sent: 24 January 2008 15:43
To: Giles Coochey; nagios-users@lists.sourceforge.net
Subject: Re: nagvis error nagvis

I have already done that and re-compiled nagios. The problem is how ndo2db is 
putting info into mysql, that is where the issue comes from.

Jan 24 14:33:04 dev ndo2db: Error: mysql_query() failed for 'INSERT INTO 
nagios_configfilevariables SET instance_id='1', configfile_id='7', 
varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''  


On Jan 24, 2008 2:40 PM, Giles Coochey <[EMAIL PROTECTED]> wrote:
http://www.nagvis.org/docs/general/often_reported_errors#nagiosnotrunning


 
From: Alex Dehaini [mailto:[EMAIL PROTECTED]
Sent: 24 January 2008 15:34
To: Giles Coochey; nagios-users@lists.sourceforge.net 
Subject: nagvis error nagvis

I keep getting this error after installing nagvis

The installation went well but when I browse to the nagvis webpage I get this 
error

nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1)
NDO reports that Nagios is not running

When I check my logs, this is the error I get.


[EMAIL PROTECTED] etc]# tail /var/log/messages
Jan 24 14:24:51 dev nagios: Event broker module 
'/usr/local/nagios/bin/ndomod.o' deinitialized successfully. 
Jan 24 14:24:51 dev ndo2db: Successfully disconnected from MySQL database
Jan 24 14:24:51 dev nagios: Nagios 2.9 starting... (PID=3113)
Jan 24 14:24:51 dev nagios: LOG VERSION: 2.0
Jan 24 14:24:51 dev nagios: ndomod: NDOMOD 1.4b7 (10-31-2007) Copyright (c) 
2005-2007 Ethan Galstad ([EMAIL PROTECTED])
Jan 24 14:24:51 dev ndo2db: Successfully connected to MySQL database
Jan 24 14:24:51 dev nagios: ndomod: Successfully connected to data sink. 0 
queued items to flush. 
Jan 24 14:24:51 dev nagios: Event broker module 
'/usr/local/nagios/bin/ndomod.o' initialized successfully.
Jan 24 14:24:51 dev nagios: Finished daemonizing... (New PID=3116)
Jan 24 14:24:51 dev ndo2db: Error: mysql_query() failed for 'INSERT INTO 
nagios_configfilevariables SET instance_id='1', configfile_id='4', 
varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg'' 


Any idea on how I can fix this. I am running centos 5.0

On Nov 26, 2007 11:20 AM, Giles Coochey < [EMAIL PROTECTED]> wrote:
If you post in plain text, it's much easier for me not to top post. 

This is likely something to do with your socket file.

Check the socket file in ndo2db configuration file, mine is in:

/opt/nagios/var/ndo.sock

It is referenced in ndo2db.cfg with this line:

socket_name=/opt/nagios/var/ndo.sock

if socket exists, it may have been created with insufficient permissions, or 
the directory mask was bad, remove the file, check permissions and restart ndo.

Thanks

Giles


From: [EMAIL PROTECTED] [mailto: [EMAIL PROTECTED] On Behalf Of Alex Dehaini
Sent: 26 November 2007 12:10

To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Integrating nagvis 

Here is the error I get from my logs

Nov 26 11:00:01 development nagios: ndomod: Still unable to connect to data 
sink. 0 items lost, 1655 queued items to flush.

When I browse to the nagvis home page, I get this error. 

Warning: mysql_fetch_array(): supplied argument is not a valid MySQL result 
resource in 
/usr/local/nagios/share/nagvis/nagvis/includes/classes/class.GlobalBackend-ndomy.php
 on line 58

nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1) 

NDO reports that Nagios is not running

Any clue what is wrong?
On Nov 26, 2007 11:01 AM, Giles Coochey <[EMAIL PROTECTED]> wrote:


> -Original Message- 
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Giorgio Zarrelli
> Sent: 26 November 2007 11:58
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Integrating nagvis 
>
> Hi,
>
> I used:
>
> nagios-3.0b6
> nagvis-1.2b2
>
> What's the problem with the backend? Which operating system?
>
I'm using Nagios 2.9, NdoUtils 1.4.7b7 and NagVis 1.1.2

Not any problems, I just followed the README's and posted minor issues
with descriptions as to the particular problems on the list to resolve
all my problems.

- 
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/

Re: [Nagios-users] nagvis error nagvis

2008-01-24 Thread Alex Dehaini
I have already done that and re-compiled nagios. The problem is how ndo2db
is putting info into mysql, that is where the issue comes from.
Jan 24 14:33:04 dev ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_configfilevariables SET instance_id='1', configfile_id='7',
varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''


On Jan 24, 2008 2:40 PM, Giles Coochey <[EMAIL PROTECTED]> wrote:

> http://www.nagvis.org/docs/general/often_reported_errors#nagiosnotrunning
>
>
> 
> From: Alex Dehaini [mailto:[EMAIL PROTECTED]
> Sent: 24 January 2008 15:34
> To: Giles Coochey; nagios-users@lists.sourceforge.net
> Subject: nagvis error nagvis
>
> I keep getting this error after installing nagvis
>
> The installation went well but when I browse to the nagvis webpage I get
> this error
>
> nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1)
> NDO reports that Nagios is not running
>
> When I check my logs, this is the error I get.
>
>
> [EMAIL PROTECTED] etc]# tail /var/log/messages
> Jan 24 14:24:51 dev nagios: Event broker module
> '/usr/local/nagios/bin/ndomod.o' deinitialized successfully.
> Jan 24 14:24:51 dev ndo2db: Successfully disconnected from MySQL database
> Jan 24 14:24:51 dev nagios: Nagios 2.9 starting... (PID=3113)
> Jan 24 14:24:51 dev nagios: LOG VERSION: 2.0
> Jan 24 14:24:51 dev nagios: ndomod: NDOMOD 1.4b7 (10-31-2007) Copyright
> (c) 2005-2007 Ethan Galstad ([EMAIL PROTECTED])
> Jan 24 14:24:51 dev ndo2db: Successfully connected to MySQL database
> Jan 24 14:24:51 dev nagios: ndomod: Successfully connected to data sink. 0
> queued items to flush.
> Jan 24 14:24:51 dev nagios: Event broker module
> '/usr/local/nagios/bin/ndomod.o' initialized successfully.
> Jan 24 14:24:51 dev nagios: Finished daemonizing... (New PID=3116)
> Jan 24 14:24:51 dev ndo2db: Error: mysql_query() failed for 'INSERT INTO
> nagios_configfilevariables SET instance_id='1', configfile_id='4',
> varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''
>
>
> Any idea on how I can fix this. I am running centos 5.0
>
> On Nov 26, 2007 11:20 AM, Giles Coochey < [EMAIL PROTECTED]> wrote:
> If you post in plain text, it's much easier for me not to top post.
>
> This is likely something to do with your socket file.
>
> Check the socket file in ndo2db configuration file, mine is in:
>
> /opt/nagios/var/ndo.sock
>
> It is referenced in ndo2db.cfg with this line:
>
> socket_name=/opt/nagios/var/ndo.sock
>
> if socket exists, it may have been created with insufficient permissions,
> or the directory mask was bad, remove the file, check permissions and
> restart ndo.
>
> Thanks
>
> Giles
>
> 
> From: [EMAIL PROTECTED] [mailto:
> [EMAIL PROTECTED] On Behalf Of Alex Dehaini
> Sent: 26 November 2007 12:10
>
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Integrating nagvis
>
> Here is the error I get from my logs
>
> Nov 26 11:00:01 development nagios: ndomod: Still unable to connect to
> data sink. 0 items lost, 1655 queued items to flush.
>
> When I browse to the nagvis home page, I get this error.
>
> Warning: mysql_fetch_array(): supplied argument is not a valid MySQL
> result resource in
> /usr/local/nagios/share/nagvis/nagvis/includes/classes/class.GlobalBackend-
> ndomy.php on line 58
>
> nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1)
>
> NDO reports that Nagios is not running
>
> Any clue what is wrong?
> On Nov 26, 2007 11:01 AM, Giles Coochey <[EMAIL PROTECTED]> wrote:
>
>
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of Giorgio Zarrelli
> > Sent: 26 November 2007 11:58
> > To: nagios-users@lists.sourceforge.net
> > Subject: Re: [Nagios-users] Integrating nagvis
> >
> > Hi,
> >
> > I used:
> >
> > nagios-3.0b6
> > nagvis-1.2b2
> >
> > What's the problem with the backend? Which operating system?
> >
> I'm using Nagios 2.9, NdoUtils 1.4.7b7 and NagVis 1.1.2
>
> Not any problems, I just followed the README's and posted minor issues
> with descriptions as to the particular problems on the list to resolve
> all my problems.
>
> -
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
> --
> Alex Dehaini
> Developer
> Site - www.alexdehaini.com
> Email - [EMAIL PROTECTED]
>
>
>
> --
> Alex Dehaini
> Developer
> Site - www.alexdehaini.com
> Email - [EMAIL PROTECTED]
>



-- 
Alex Dehaini
Developer

Re: [Nagios-users] nagvis error nagvis

2008-01-24 Thread Giles Coochey
http://www.nagvis.org/docs/general/often_reported_errors#nagiosnotrunning



From: Alex Dehaini [mailto:[EMAIL PROTECTED] 
Sent: 24 January 2008 15:34
To: Giles Coochey; nagios-users@lists.sourceforge.net
Subject: nagvis error nagvis

I keep getting this error after installing nagvis

The installation went well but when I browse to the nagvis webpage I get this 
error

nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1)
NDO reports that Nagios is not running

When I check my logs, this is the error I get.


[EMAIL PROTECTED] etc]# tail /var/log/messages 
Jan 24 14:24:51 dev nagios: Event broker module 
'/usr/local/nagios/bin/ndomod.o' deinitialized successfully.  
Jan 24 14:24:51 dev ndo2db: Successfully disconnected from MySQL database
Jan 24 14:24:51 dev nagios: Nagios 2.9 starting... (PID=3113)
Jan 24 14:24:51 dev nagios: LOG VERSION: 2.0 
Jan 24 14:24:51 dev nagios: ndomod: NDOMOD 1.4b7 (10-31-2007) Copyright (c) 
2005-2007 Ethan Galstad ([EMAIL PROTECTED])
Jan 24 14:24:51 dev ndo2db: Successfully connected to MySQL database 
Jan 24 14:24:51 dev nagios: ndomod: Successfully connected to data sink.  0 
queued items to flush.
Jan 24 14:24:51 dev nagios: Event broker module 
'/usr/local/nagios/bin/ndomod.o' initialized successfully.  
Jan 24 14:24:51 dev nagios: Finished daemonizing... (New PID=3116) 
Jan 24 14:24:51 dev ndo2db: Error: mysql_query() failed for 'INSERT INTO 
nagios_configfilevariables SET instance_id='1', configfile_id='4', 
varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''  


Any idea on how I can fix this. I am running centos 5.0

On Nov 26, 2007 11:20 AM, Giles Coochey < [EMAIL PROTECTED]> wrote:
If you post in plain text, it's much easier for me not to top post.
 
This is likely something to do with your socket file.
 
Check the socket file in ndo2db configuration file, mine is in:
 
/opt/nagios/var/ndo.sock
 
It is referenced in ndo2db.cfg with this line:
 
socket_name=/opt/nagios/var/ndo.sock
 
if socket exists, it may have been created with insufficient permissions, or 
the directory mask was bad, remove the file, check permissions and restart ndo.
 
Thanks
 
Giles
 

From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Alex Dehaini
Sent: 26 November 2007 12:10

To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Integrating nagvis
 
Here is the error I get from my logs

Nov 26 11:00:01 development nagios: ndomod: Still unable to connect to data 
sink.  0 items lost, 1655 queued items to flush.

When I browse to the nagvis home page, I get this error. 

Warning: mysql_fetch_array(): supplied argument is not a valid MySQL result 
resource in 
/usr/local/nagios/share/nagvis/nagvis/includes/classes/class.GlobalBackend-ndomy.php
 on line 58

nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1) 

NDO reports that Nagios is not running

Any clue what is wrong?
On Nov 26, 2007 11:01 AM, Giles Coochey <[EMAIL PROTECTED]> wrote: 


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Giorgio Zarrelli 
> Sent: 26 November 2007 11:58
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] Integrating nagvis
>
> Hi,
> 
> I used:
>
> nagios-3.0b6
> nagvis-1.2b2
>
> What's the problem with the backend? Which operating system?
>
I'm using Nagios 2.9, NdoUtils 1.4.7b7 and NagVis 1.1.2

Not any problems, I just followed the README's and posted minor issues
with descriptions as to the particular problems on the list to resolve
all my problems.

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net 
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null



-- 
Alex Dehaini
Developer
Site - www.alexdehaini.com 
Email - [EMAIL PROTECTED] 



-- 
Alex Dehaini
Developer
Site - www.alexdehaini.com
Email - [EMAIL PROTECTED] 

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] nagvis error nagvis

2008-01-24 Thread Alex Dehaini
I keep getting this error after installing nagvis
The installation went well but when I browse to the nagvis webpage I get
this error

nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1)NDO reports
that Nagios is not running

When I check my logs, this is the error I get.


[EMAIL PROTECTED] etc]# tail /var/log/messages
Jan 24 14:24:51 dev nagios: Event broker module
'/usr/local/nagios/bin/ndomod.o' deinitialized successfully.
Jan 24 14:24:51 dev ndo2db: Successfully disconnected from MySQL database
Jan 24 14:24:51 dev nagios: Nagios 2.9 starting... (PID=3113)
Jan 24 14:24:51 dev nagios: LOG VERSION: 2.0
Jan 24 14:24:51 dev nagios: ndomod: NDOMOD 1.4b7 (10-31-2007) Copyright (c)
2005-2007 Ethan Galstad ([EMAIL PROTECTED])
Jan 24 14:24:51 dev ndo2db: Successfully connected to MySQL database
Jan 24 14:24:51 dev nagios: ndomod: Successfully connected to data sink.  0
queued items to flush.
Jan 24 14:24:51 dev nagios: Event broker module
'/usr/local/nagios/bin/ndomod.o' initialized successfully.
Jan 24 14:24:51 dev nagios: Finished daemonizing... (New PID=3116)
Jan 24 14:24:51 dev ndo2db: Error: mysql_query() failed for 'INSERT INTO
nagios_configfilevariables SET instance_id='1', configfile_id='4',
varname='cfg_file', varvalue='/usr/local/nagios/etc/localhost\.cfg''


Any idea on how I can fix this. I am running centos 5.0

On Nov 26, 2007 11:20 AM, Giles Coochey <[EMAIL PROTECTED]> wrote:

>  If you post in plain text, it's much easier for me not to top post.
>
>
>
> This is likely something to do with your socket file.
>
>
>
> Check the socket file in ndo2db configuration file, mine is in:
>
>
>
> /opt/nagios/var/ndo.sock
>
>
>
> It is referenced in ndo2db.cfg with this line:
>
>
>
> socket_name=/opt/nagios/var/ndo.sock
>
>
>
> if socket exists, it may have been created with insufficient permissions,
> or the directory mask was bad, remove the file, check permissions and
> restart ndo.
>
>
>
> Thanks
>
>
>
> Giles
>
>
>   --
>
> *From:* [EMAIL PROTECTED] [mailto:
> [EMAIL PROTECTED] *On Behalf Of *Alex Dehaini
> *Sent:* 26 November 2007 12:10
>
> *To:* nagios-users@lists.sourceforge.net
> *Subject:* Re: [Nagios-users] Integrating nagvis
>
>
>
> Here is the error I get from my logs
>
> *Nov 26 11:00:01 development nagios: ndomod: Still unable to connect to
> data sink.  0 items lost, 1655 queued items to flush.
>
> *When I browse to the nagvis home page, I get this error.
>
> *Warning*: mysql_fetch_array(): supplied argument is not a valid MySQL
> result resource in *
> /usr/local/nagios/share/nagvis/nagvis/includes/classes/class.GlobalBackend-
> ndomy.php* on line *58
>
> nagiosNotRunning: NDOMy: Nagios not running (Backend ndomy_1)
>
> NDO reports that Nagios is not running
>
> *Any clue what is wrong?
>
> On Nov 26, 2007 11:01 AM, Giles Coochey <[EMAIL PROTECTED]> wrote:
>
>
>
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of Giorgio Zarrelli
> > Sent: 26 November 2007 11:58
> > To: nagios-users@lists.sourceforge.net
> > Subject: Re: [Nagios-users] Integrating nagvis
> >
> > Hi,
> >
> > I used:
> >
> > nagios-3.0b6
> > nagvis-1.2b2
> >
> > What's the problem with the backend? Which operating system?
> >
>
> I'm using Nagios 2.9, NdoUtils 1.4.7b7 and NagVis 1.1.2
>
> Not any problems, I just followed the README's and posted minor issues
> with descriptions as to the particular problems on the list to resolve
> all my problems.
>
>
> -
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
>
> --
> Alex Dehaini
> Developer
> Site - www.alexdehaini.com
> Email - [EMAIL PROTECTED]
>



-- 
Alex Dehaini
Developer
Site - www.alexdehaini.com
Email - [EMAIL PROTECTED]
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] flap_detection_enabled persistence acrossreload/restart?

2008-01-24 Thread Tom Throckmorton
On Jan 23 17:35, Marc Powell wrote:
> > Before I dig further, can anyone verify that the state of host or
> service
> > flap detection (enabled/disabled) should or shouldn't be persistent across
> a
> > reload/restart?  I've combed the docs and archives, and am coming up
> dry.
> 
> The retention file should take precedence. 

Thanks, that's what I thought - it appears to for all but the flapping options.
I should add that this applies to the global flap detection option as well.

Would you (or anyone) mind testing this on your own installation:

- find a host/service for which flapping is enabled
- disable flap detection for that host/service; wait until that state is
  reflected in the extended info
- reload the configs
- check the state of flap detection - has it reverted to enabled?

> Not all directives can be overridden by the retention file but
> flap_detection_enabled is one of them
> (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service).

I'm not sure what you mean here - which directives _can't_ be overridden by the
retention file?

> Your configuration snippets look accurate, assuming they're applied as
> you indicate.

They were cut and pasted from live configs, so yeah.

Cheers,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] cross products: host/host group x service/service group ..?

2008-01-24 Thread Jan Grant
On Thu, 24 Jan 2008, Marc Powell wrote:

> 
> On Jan 24, 2008, at 4:23 AM, Jan Grant wrote:
> 
> > By-and-large, I generate my nagios configs mechanically from the same
> > config that drives app deployment. However, I've just had cause to  
> > write
> > some config by hand and the following occurred to me:
> >
> > I've got several hostgroups, and several service groups. What I'd  
> > really
> > find convenient is a way to say
> >
> > all these services are running on all these hosts
> 
> I think you can get close to what you want. Servicegroups are just for  
> CGI display purposes but in a service definition you can specify  
> hostgroups instead of specific hosts.

That's close enough to make my life easy; thanks!


-- 
jan grant, ISYS, University of Bristol. http://www.bris.ac.uk/
Tel +44 (0)117 3317661   http://ioctl.org/jan/
If it's broken really badly - don't fix it either.

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] cross products: host/host group x service/service group ..?

2008-01-24 Thread Marc Powell

On Jan 24, 2008, at 4:23 AM, Jan Grant wrote:

> By-and-large, I generate my nagios configs mechanically from the same
> config that drives app deployment. However, I've just had cause to  
> write
> some config by hand and the following occurred to me:
>
> I've got several hostgroups, and several service groups. What I'd  
> really
> find convenient is a way to say
>
>   all these services are running on all these hosts

I think you can get close to what you want. Servicegroups are just for  
CGI display purposes but in a service definition you can specify  
hostgroups instead of specific hosts.

http://nagios.sourceforge.net/docs/2_0/templatetricks.html

All Hosts In Multiple Hostgroups: If you want to create identical  
services that are assigned to all hosts in one or more hostgroups, you  
can do so by creating a single service definition. How? The  
hostgroup_name directive allows you to specify the name of one or more  
hostgroups that the service should be created for:

define service{
hostgroup_name  HOSTGROUP1,HOSTGROUP2,...,HOSTGROUPN
service_description SOMESERVICE
other service directives ...
}

The definition above would create a service called SOMESERVICE on all  
hosts that are members of hostgroups HOSTGROUP1 through HOSTGROUPN.  
All the instances of the SOMESERVICE service would be identical (i.e.  
have the same check command, max check attempts, notification period,  
etc.).

-
Marc

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Regading NFS

2008-01-24 Thread Marc Powell

On Jan 24, 2008, at 4:26 AM, Biju Augustine wrote:

> Hi All,
>
> I am getting the alerts for NFS
>
> command.cfg
>
> define command {
>command_name check_nfsmount
>command_line $USER1$/check_nfsmount -H $HOSTADDRESS$
>}
>
>

> Service Critical[01-24-2008 15:20:26] SERVICE ALERT: essentia.in;NFS
> checker;CRITICAL;SOFT;1;(Return code of 127 is out of bounds - plugin
> may be missing)

This error means that either --

- $USER1$ isn't set or is set to the wrong path
- check_nfsmount isn't under $USER1$ path
- check_nfsmount is a script that calls some other program that  
doesn't exist where specified.

--
Marc



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] RE host checks configuration question

2008-01-24 Thread Marc Powell

On Jan 24, 2008, at 4:59 AM, Cyrille Bollu wrote:

 > OK, I have find the explanation as what means a "check_interval"  
being set to 0 in the documentation of Nagios 3 (we are using 2.9  
here): 0 means *regular* host checks are not performed. Which is good.

The host check behavior can be different under nagios-3. Since you say  
you're using 2.9, I'll respond in that manner.

> I have another question now: Are on-demand host checks scheduled  
> when a service transits in a *soft* not-OK state or when it transits  
> in a *hard* not-ok state?

Soft. The first non-OK service check results in a host check.

http://nagios.sourceforge.net/docs/2_0/networkreachability.html

>
> This is a quite important question since knowing the answer to this  
> question will influence the value I'm gonna set for the host  
> "max_check_attempts" parameter: When "on-demand host checks" are  
> scheduled when a service transits in a *hard* not-OK state, I will  
> set the host "max_check_attempts" to a lower value than when "on- 
> demand host checks" are scheduled when a service transits in a  
> *hard* not-OK state (since I'm more confident in the service check  
> result).

There are other factors that you need to consider. Under 2.x, all host  
checks are performed serially. While a host check is being performed,  
up to max_check_attempts, all other nagios tasks stop completely. This  
can lead to unexpectedly high latencies on problem networks if you  
don't optimize your host check commands to complete as quickly as  
possible and still be confident the host is down. A single ping with 3  
max check attempts for example. Just enough to be sure the host is  
really down.

> Also, there is something else not very clear in Nagios 3  
> documentation:
>
> (from http://nagios.sourceforge.net/docs/3_0/hostchecks.html)
> "Hosts which have their max_check_attempts value set to 1 can cause  
> serious performance problems. The reason? If Nagios needs to  
> determine their true state using the network reachability logic (to  
> see if they're DOWN or UNREACHABLE), it will have to launch serial  
> checks of all of the host's immediate parents. Just to reiterate,  
> those checks are run serially, rather than in parallel, so it can  
> cause a big performance hit. For this reason, I would recommend that  
> you always use a value greater than 1 for the max_check_attempts  
> directives in your host definitions."
>
> Well, I believe the writer of this documentation but I don't  
> understand why setting this parameter to 1 will serialize the host's  
> parents checks. Can someone explain me this point?
>

I don't believe I can answer this any more specifically at this point.  
I haven't examined the code for 3.x much yet.

--
Marc

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Regading NFS

2008-01-24 Thread Biju Augustine
Hi All,

I am getting the alerts for NFS

command.cfg

define command {
command_name check_nfsmount
command_line $USER1$/check_nfsmount -H $HOSTADDRESS$
}

hosts.cfg

efine service{
use generic-service ; Name
of service template to use

host_name   icybox
service_description NFS checker
is_volatile 0
check_period24x7
max_check_attempts  3
normal_check_interval   5
retry_check_interval1
contact_groups  linux-admins
notification_interval   240
notification_period 24x7
notification_optionsc,r
check_command   check_nfsmount

}

Warning : CRITICAL

01-24-2008 15:31:29] Nagios 3.0b6 starting... (PID=13400)
Program End[01-24-2008 15:31:28] Caught SIGTERM, shutting down...
Service Critical[01-24-2008 15:22:26] SERVICE ALERT: essentia.in;NFS
checker;CRITICAL;HARD;3;(Return code of 127 is out of bounds - plugin
may be missing)
Service Critical[01-24-2008 15:21:26] SERVICE ALERT: essentia.in;NFS
checker;CRITICAL;SOFT;2;(Return code of 127 is out of bounds - plugin
may be missing)
Service Critical[01-24-2008 15:20:26] SERVICE ALERT: essentia.in;NFS
checker;CRITICAL;SOFT;1;(Return code of 127 is out of bounds - plugin
may be missing)
Program Start[01-24-2008 15:19:26] Nagios 3.0b6 starting... (PID=12368)
Program End[01-24-2008 15:19:26] Caught SIGTERM, shutting down...

Can you please help me

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] RE host checks configuration question

2008-01-24 Thread Cyrille Bollu
OK, I have find the explanation as what means a "check_interval" being set 
to 0 in the documentation of Nagios 3 (we are using 2.9 here): 0 means 
*regular* host checks are not performed. Which is good.

I have another question now: Are on-demand host checks scheduled when a 
service transits in a *soft* not-OK state or when it transits in a *hard* 
not-ok state?

This is a quite important question since knowing the answer to this 
question will influence the value I'm gonna set for the host 
"max_check_attempts" parameter: When "on-demand host checks" are scheduled 
when a service transits in a *hard* not-OK state, I will set the host 
"max_check_attempts" to a lower value than when "on-demand host checks" 
are scheduled when a service transits in a *hard* not-OK state (since I'm 
more confident in the service check result).

Also, there is something else not very clear in Nagios 3 documentation:

(from http://nagios.sourceforge.net/docs/3_0/hostchecks.html)
"Hosts which have their max_check_attempts value set to 1 can cause 
serious performance problems. The reason? If Nagios needs to determine 
their true state using the network reachability logic (to see if they're 
DOWN or UNREACHABLE), it will have to launch serial checks of all of the 
host's immediate parents. Just to reiterate, those checks are run serially
, rather than in parallel, so it can cause a big performance hit. For this 
reason, I would recommend that you always use a value greater than 1 for 
the max_check_attempts directives in your host definitions."

Well, I believe the writer of this documentation but I don't understand 
why setting this parameter to 1 will serialize the host's parents checks. 
Can someone explain me this point?

Best regards,

Cyrille

> 
> Hi all, 
> 
> My host checks are configured as follow: 
> 
> Host NameMax. Check AttemptsCheck Interval 
> Host Check CommandEnable Active Checks Enable 
> Passive Checks Check Freshness 
> adabase 5 0h 0m 0s check-host-alive
> NoYesNo 
> 
> Does this means that whenever a host check is needed, Nagios may 
> execute the "check-host-alive" command 5 times before a notification
> is sent? Wow, wouldn't be that bad?! 
> 
> And what does the "check interval" of 0 means? Might it take 5 
> minutes to get the notification? Should I set this parameter to "1min"? 
> 
> Could you help me? 
> 
> Best regards, 
> 
> Cyrille
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] cross products: host/host group x service/service group ..?

2008-01-24 Thread Jan Grant
By-and-large, I generate my nagios configs mechanically from the same 
config that drives app deployment. However, I've just had cause to write 
some config by hand and the following occurred to me:

I've got several hostgroups, and several service groups. What I'd really 
find convenient is a way to say

all these services are running on all these hosts

and generate service instances for the product of those two sets. (The 
hosts are part of a farm.)

It's obviously possible to get a lot of the way using service templates, 
but as far as I can tell I still have to write a service definition per 
(host, service template) pair.

I'm aware that there may be design issues I've missed - naming 
considerations or suchlike. I think it'd be desirable to be able to 
specify groups of hosts and groups of service templates and (with a 
small number of additional declarations) create service instances. So 
(a) am I just being dumb and missing an already-extant capability; (b) 
is there a fundamental reason why this doesn't fit in nagios' design; 
(c) is the typical thing in this situation to do what I already do and 
generate the configs mechanically?

Cheers,
jan

-- 
jan grant, ISYS, University of Bristol. http://www.bris.ac.uk/
Tel +44 (0)117 3317661   http://ioctl.org/jan/
Ever see something and think, "I've gotta leverage me some of that?"
Odds are, you were looking at a synergy and didn't even know it.

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] host checks configuration question

2008-01-24 Thread Cyrille Bollu
Hi all,

My host checks are configured as follow:

Host Name   Max. Check Attempts Check Interval  Host Check Command 
Enable Active ChecksEnable Passive Checks   Check Freshness 
adabase 5   0h 0m 0scheck-host-alive 
No  Yes No

Does this means that whenever a host check is needed, Nagios may execute 
the "check-host-alive" command 5 times before a notification is sent? Wow, 
wouldn't be that bad?!

And what does the "check interval" of 0 means? Might it take 5 minutes to 
get the notification? Should I set this parameter to "1min"?

Could you help me?

Best regards,

Cyrille-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] notification problem Nagios 3

2008-01-24 Thread Brandstätter Sigmund
Hi people,

Following Question:
 
We have configured Nagios and so far it works fine and does notifications. 
 
Since we have a lot of systems which belong to different groups (network, unix, 
windows, ...) we tried to configure the notifications so that we can define 
more then one recipient!
 
for example, we have contacts configured in contacts.cfg

example:

define contact{
contact_namenoone
alias   noone
use generic-contact
service_notification_period none
host_notification_periodnone
contactgroups   NO-admins
email   [EMAIL PROTECTED]}

then we have contact groups:

example:

define contact{
contact_namenoone
alias   noone
use generic-contact
service_notification_period none
host_notification_periodnone
contactgroups   NO-admins
email   [EMAIL PROTECTED]
}

we use the following template.cfg entries

define service{
nameprod-service
use generic-service
max_check_attempts  4
normal_check_interval   5
retry_check_interval1
contact_groups  BEREITSCHAFT-admins
register0
}
 
define service{
nametest-service
use generic-service
max_check_attempts  4
normal_check_interval   5
retry_check_interval1
contact_groups  NO-admins
register0
}
 
define service{
nameunix-service
contact_groups  +UNIX-admins
register0
}
 
define service{
namenetzwerk-service
contact_groups  +NETZWERK-admins
register0
}
 
define service{
namedb-service
contact_groups  +DATENBANK-admins
register0
}
 

and in the services.cfg it looks like:

example:

 
define service{
use prod-service,netzwerk-service
hostgroup_name  Netzwerkkomponenten
service_description PING
check_command   check_ping!100.0,20%!500.0,60%
}


The notification only goes to the first contact /tghe one defined for 
prod-service) but not to the others if wanted

The reason for this setup is that the prod-service goes to a blackberry and the 
others should only be additional email notification to theresponsible groups..

anyone got an idea of what we did wrong? cant find a reason.

thanks in advance

cheers
Sigmund





-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] CFG files - Organization

2008-01-24 Thread Ronaldo A. Bueno Filho
Hello guys!

I've installed and configured Nagios 3 and now I am making some tests. Now, I'm 
wondering about the best way to organize the cfg files.

 

I have the following environment:

 

- Several customers

- Local network

- Web monitoring

 

I`m not sure, but I think I can separate it in different directories like 
templates (generic templates), localnetwork (LAN), Customers (with sub folders 
for each customer) and so on.

 

Also, How can I separate the cfg files? 

 

For example: 

- Services file

- Hosts file

- Hostsgroups file

- Etc.

 

I would appreciate ideas regarding the best way of how to organize it!

 

Thanks,


--
Ronaldo A. Bueno Filho-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Problem with high latencies after going distributed

2008-01-24 Thread Thomas Guyot-Sionnest
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Some heavily broken intending there (looks like my mail client gets
confused)... don't trust the number of ">"!

On 23/01/08 10:47 PM, Frost, Mark {PBG} wrote:
>  
> 
>> -Original Message-
>> From: Thomas Guyot-Sionnest [mailto:[EMAIL PROTECTED] 
>> Sent: Wednesday, January 23, 2008 10:24 PM
>> To: Frost, Mark {PBG}
>> Cc: Nagios Users
>> Subject: Re: [Nagios-users] Problem with high latencies after 
>> going distributed
> I don't think so. I remember an email from Ton Voon some time 
> ago asking
> Ethan why the oc[hs]p command are run serially but I don't recall if
> there was a reply or what else was said...
> 
> I believe it's either documented in the official doc or some
> user-contributed doc that the oc[hs]p commands should return as soon as
> possible. It's usually done in Perl using a fork:
> 
> if (fork==0) {
>  # send stuff via NSCA here...
> }
> exit(0);
> 
> 
>> I guess what I'm thinking here is that unlike a custom check, I can't
>> see most
>> people needing to customize the passive check result process.  All the
>> solutions I've
>> seen seem to include a named pipe.  So why couldn't Nagios support
>> making the ocsp/ochp
>> "commands" just named pipes instead.   Then instead of a standalone
>> send_nsca binary,
>> have the nsca source build a send_nscaD binary (I'm making that up) that
>> reads from the
>> pipe that nagios writes to and sends directly to nsca on the server.
>> That sort of
>> eliminates the middle-man in the process of reporting passive check
>> results.
> 
>> I know, I know, I'm free to write the send_nscaD.c code and send it to
>> Ethan :-)

Well... I was thinking about partly re-writing nsca as an event-based
daemon (supporting only the --single mode, but that would be really
scalable) using libevent, allowing to pass along the timestamp  (this is
a recent feature request) and supporting multi-line responses (for
Nagios 3) in the process, and finally suggesting this as a base for a
NSCA v3... I'm not even sure if I would have enough time but since my
main objective it to learn I wouldn't loose anything trying :).

In the unlikely event that I write it, In the same step I could surely
to a C version of OCP_Daemon supporting natively the "NSCA v3" protocol
(it wouldn't be hard)...

I'll have to think about it... I quess the only sane separator to write
multiple multi-line results on a pipe would be \000 (NULL), so there
would be 3 mode of operation for send_nsca (and two for nsca_sendd
(don't you think it sounds better reversed?)):
send_nsca: compatible (v2 behavior), Single check (additional lines are
taken as additional output) and multi-check (NULL separated)
nsca_sendd: single-line (one check/line, OCP_Daemon style) and
multi-line "NULL-separated).

> I don't know how many people use OCP_Daemon but I had reports 
>>>from a few
> people that greatly reduced their latency using it and I 
> haven't had any
> bug reported yet. I believe it's well documented as well, but If you
> have any feedback on this I'll be happy to get it.
> 
>> I'm playing with it a bit and have so far had good results.  I'll have
>> some
>> feedback after I've played with it a bit longer.  Thanks for writing it
>> and
>> writing up the docs for it as well!

Pass the thanks over to Ethan who sent me a Nagios NSA t-shirt for it ;)

Thomas
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFHmE096dZ+Kt5BchYRAlLvAKDNc3a78bDrzrWp42LbSxeAoDvOTwCdEg8j
d1n0dP53Grg8p4pi7cQTEC8=
=9UP5
-END PGP SIGNATURE-

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null