[Nagios-users] using check_snmp_ifstatus.pl plugin for windows servers

2008-08-25 Thread Dhaval Thakar
hi,

i am using check_snmp_ifstatus.pl downloaded from
http://shawnflynn.com/2008/08/13/nagios-plugincheck_snmp_ifstatuspl/

this is very good tool to monitor interface utilization.
i want to use it in the replacement of mrtg.

this works fine for the linux systems (as in interface i need to mention
only eth0/eth1)

whereas i am unable to configure this tool to check windows interface

i am getting proper result for linux based server
snmpwalk -v 2c -c public 172.20.20.4 ifDescr
IF-MIB::ifDescr.1 = STRING: lo
IF-MIB::ifDescr.2 = STRING: eth0
IF-MIB::ifDescr.3 = STRING: eth1
IF-MIB::ifDescr.4 = STRING: sit0

./check_snmp_ifstatus.pl -H 172.20.20.4 -C public -i eth0
OK: eth0 is UP at 1Gbps. RX=56.87Kbps (0%), TX=443Kbps
(0.04%)|RXbps=56868;85000;98000;0;100
000 TXbps=442973;85000;98000;0;10 RXpct=0%;85;98;0;100
TXpct=0.04%;85;98;0;100 e
lapsed=95s;34;;;

for windows based server
snmpwalk -v 2c -c public 172.20.20.126 ifDescr
IF-MIB::ifDescr.1 = STRING: MS TCP Loopback interface
IF-MIB::ifDescr.2 = STRING: Realtek RTL8139/810x Family Fast Ethernet NIC

./check_snmp_ifstatus.pl -H 172.20.20.126 -C public -i "MS TCP Loopback
interface"
CRITICAL: Could not match MS TCP Loopback interface

./check_snmp_ifstatus.pl -H 172.20.20.126 -C public -i "Realtek
RTL8139/810x Family Fast Ethernet NIC"
CRITICAL: Could not match Realtek RTL8139/810x Family Fast Ethernet NIC

kindly guide me the proper way to utilize this plugin for windows based
servers.

thanks & regards
dhaval thakar




-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Initial problems

2008-08-25 Thread Taylor Dondich
First off, the Nagios output states it is Nagios 3.0.1, not Nagios
2.x.  This is fine, just refer to the Nagios 3 documentation as some
things have changed, such as the first couple of warnings.

First off, it looks like you have a permissions issue on the
/usr/local/nagios/log directory.  Make sure the directory exists, and
it's contents are writable by the Nagios user.

Second, the fatal error states that the object property in
minimal.cfg, line 60 is invalid.  It may be a directive that is no
longer allowed in Nagios 3 or has been renamed (it has a habit of
doing that).  The book you described is great to understand the basics
of Nagios, but try to be aware there are differences and you should
verify it by looking at the Nagios 3 docs at
http://nagios.sourceforge.net/docs/3_0/

Taylor

On Mon, Aug 25, 2008 at 7:30 PM, scott <[EMAIL PROTECTED]> wrote:
> Greetings
>
>   I am just starting to use nagios and it looks promising.  Along with
> the documentation supplied, I am also using the text from the book Pro
> Nagios 2.0 by James Turnbull.
>
>   The version of nagios is 2.0 February 06, 2006 as this came with the
> SuSE Linux 11.0.
>
>   Tried starting nagious and get these messages in the start:
> nagios /usr/local/nagios/etc/nagios.cfg
>
> Nagios 3.0.1
> Copyright (c) 1999-2008 Ethan Galstad (http://www.nagios.org)
> Last Modified: 04-01-2008
> License: GPL
>
> Warning: comment_file variable ignored.  Comments are now stored in the
> status and retention files.
> Warning: downtime_file variable ignored.  Downtime entries are now
> stored in the status and retention files.
> Warning: aggregate_status_updates directive ignored.  All status file
> updates are now aggregated.
> Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
> Nagios 3.0.1 starting... (PID=16379)
> Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
> Local time is Mon Aug 25 22:20:44 EDT 2008
> Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
> Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
> Warning: Duplicate definition found for command 'notify-by-email'
> (config file '/usr/local/nagios/etc/minimal.cfg', starting on line 59)
> Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
> Error: Could not add object property in file
> '/usr/local/nagios/etc/minimal.cfg' on line 60.
> Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
> Bailing out due to one or more errors encountered in the configuration
> files. Run Nagios from the command line with the -v option to verify
> your config before restarting. (PID=16379)
>
>   I've tried using the afore mentioned directory to hold the log files
> as well as /var/log/nagios and /usr/local/nagios/etc but all three fail.
>
>   Suggestions on how to resolve these errors would be appreciated.
>
> -
> This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
> Build the coolest Linux based applications with Moblin SDK & win great prizes
> Grand prize is a trip for two to an Open Source event anywhere in the world
> http://moblin-contest.org/redirect.php?banner_id=100&url=/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>



-- 
Taylor
Check out my Shortcut with O'Reilly Press:
Network Monitoring with Nagios:
http://oreilly.com/catalog/9780596528195/index.html

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Initial problems

2008-08-25 Thread scott
Greetings

   I am just starting to use nagios and it looks promising.  Along with 
the documentation supplied, I am also using the text from the book Pro 
Nagios 2.0 by James Turnbull. 

   The version of nagios is 2.0 February 06, 2006 as this came with the 
SuSE Linux 11.0.

   Tried starting nagious and get these messages in the start:
nagios /usr/local/nagios/etc/nagios.cfg

Nagios 3.0.1
Copyright (c) 1999-2008 Ethan Galstad (http://www.nagios.org)
Last Modified: 04-01-2008
License: GPL

Warning: comment_file variable ignored.  Comments are now stored in the 
status and retention files.
Warning: downtime_file variable ignored.  Downtime entries are now 
stored in the status and retention files.
Warning: aggregate_status_updates directive ignored.  All status file 
updates are now aggregated.
Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
Nagios 3.0.1 starting... (PID=16379)
Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
Local time is Mon Aug 25 22:20:44 EDT 2008
Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
Warning: Duplicate definition found for command 'notify-by-email' 
(config file '/usr/local/nagios/etc/minimal.cfg', starting on line 59)
Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
Error: Could not add object property in file 
'/usr/local/nagios/etc/minimal.cfg' on line 60.
Warning: Cannot open log file '/usr/local/nagios/log/nagios.log' for writing
Bailing out due to one or more errors encountered in the configuration 
files. Run Nagios from the command line with the -v option to verify 
your config before restarting. (PID=16379)

   I've tried using the afore mentioned directory to hold the log files 
as well as /var/log/nagios and /usr/local/nagios/etc but all three fail.

   Suggestions on how to resolve these errors would be appreciated.

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ndoutils Query optimisation

2008-08-25 Thread Simon Finch
Hi Alan,
 
Up to yesterday I had a similar problem with our NDO db on the same spec box as 
yours.
 
Are your tables primarily type innodb like mine ?
I am asking as my NDO db was created by Centreon's install script and not the 
normal way, if so then the following may help.
Your initial post showed no setting for innodb_buffer_pool_size if you are 
using lots of innodb tables then this needs to be as big as you can afford.
Second is innodb_flush_log_at_trx_commit which defaults to 1, which is safe, 
but slow, it commits after every transaction.
 
I have 9 Nagios servers feeding data from 2500 hosts into 1 NDO db, when I 
started Mysql was reported by top as using 300 - 350% of cpu and the whole 
system was virtually unusable, now cpu usage is 10 - 50% with occasional peaks 
of 90% and all is good.
 
These are the settings I am using:
 
key_buffer_size=64M
thread_cache_size=32
innodb_buffer_pool_size=512M
innodb_flush_log_at_trx_commit=0
innodb_log_file_size=32M
read_buffer_size=256k
tmp_table_size=64M
join_buffer_size=8M
query_cache_size=32M
table_cache=1024
I am still fiddling with settings, but these seem good for now. I am not a dba, 
only a sysadmin who had to make it work. Hopefully I can help others with 
similar problems.
 
Thanks and regards,
 
Simon Finch
ESM Team Leader
St George Bank
Australia
 
 

>>> Alan Cooper <[EMAIL PROTECTED]> 26/08/2008 4:39 am >>>
Mikael Fridh wrote:
> On Wed, Aug 20, 2008 at 8:46 PM, Alan Cooper <[EMAIL PROTECTED]> wrote:
>   
>> Thanks for the replies,
>>
>> Unfortunately, all the tables are already indexed so I'm back to the
>> underlying problem is that there are so many joins on big tables so
>> MySQL has to copy to temp tables each time - I'd really like to
>> eliminate the need for this if possible.
>> 
>
> Show the EXPLAIN output as well as SHOW CREATE TABLE for all tables in
> your query.
>
>   

Hi Mikael,

Please find the details requested attached.

Thanks again for your help.

Regards,
Alan


*** This is an automatically appended message from St. George ITP.  If this 
email is SPAM please register it with WebSense MailControl by clicking this 
link 
https://www.mailcontrol.com/sr/zM1O!OPwutLTndxI!oX7UmGwwysDArWcDrzRWHDQJ5mU7sNPLDw9NuS3S+!tH0c3S!CJwybpcpidye7!a4hOmw==
 . ***

**
*   IMPORTANT INFORMATION*
This document should be read only by those persons to whom it is 
addressed and its content is not intended for use by any other 
persons.  If you have received this message in error, please notify 
us immediately.  Please also destroy and delete the message from 
your computer.  Any unauthorised form of reproduction of this message 
is strictly prohibited.

St George Bank Limited AFSL 240997, Advance Asset Management Limited 
AFSL 240902,  St George Life Limited AFSL 240900, ASGARD Capital Management 
Limited 
AFSL 240695 and Securitor Financial Group Limited AFSL 240687 is not liable for 
the proper and complete transmission of the information contained in 
this communication, nor for any delay in its receipt.
**

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] external command to enable all active service checks

2008-08-25 Thread Mathieu Gagné
This should be the answer:

START_EXECUTING_SVC_CHECKS
http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=41

Mathieu

Robert Anderson wrote:
> I am setting up a failover nagios instance as described here.
> 
> http://nagios.sourceforge.net/docs/3_0/redundancy.html#scenario_2
> 
> I have looked through the external commands but cannot find one that 
> will enable or disable all active service checks. Am I missing 
> something? I have found external command that apply to a single service, 
> servicegroup or host, but none that apply to all services. Please help.
> 
> Thanks,
> 
> -- 
> Rob Anderson
> [EMAIL PROTECTED] 
> 
> 
> 
> 
> -
> This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
> Build the coolest Linux based applications with Moblin SDK & win great prizes
> Grand prize is a trip for two to an Open Source event anywhere in the world
> http://moblin-contest.org/redirect.php?banner_id=100&url=/
> 
> 
> 
> 
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
> any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] external command to enable all active service checks

2008-08-25 Thread Robert Anderson
I am setting up a failover nagios instance as described here.

http://nagios.sourceforge.net/docs/3_0/redundancy.html#scenario_2

I have looked through the external commands but cannot find one that will
enable or disable all active service checks. Am I missing something? I have
found external command that apply to a single service, servicegroup or host,
but none that apply to all services. Please help.

Thanks,

-- 
Rob Anderson
[EMAIL PROTECTED]
-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] checks, notifications don't work after time period exception

2008-08-25 Thread Jon Angliss
On Mon, 25 Aug 2008 13:38:28 -0400, "Seth Simmons"
<[EMAIL PROTECTED]> wrote:

>The system date is correct and Nagios is showing the correct day and
>time on the status page.  If I stop Nagios, I don't see any other Nagios
>processes running.  The two templates here have not been touched.  I
>looked at both and have 24x7 for all times which is the default.  I just
>modified the time definitions as you have it listed and I'll see what
>happens tonight.  There are also no custom time settings in nagios.cfg;
>all the settings related to time zone, etc are commented out.

I had a similar issue. I tried excluding a time slot between 0300, and
0600 due to large DB loads causing website performance issues
(backups, indexing, and such).  When the start of the exception rolled
around, nagios would just stop checking that service, and would only
restart on a forced manual check, or restart of the nagios service.  I
ended up changing the theory to continue checking 24x7, but only alert
when outside the exception.  Not that it helps the issue, but it
stopped the symptoms.  I never got around to really digging into the
issue any deeper.  It did give me the added benefit of knowing when
the server was actually down during the maintenance window, and when
it was just being slow.  This was handy for management purposes so
they could calculate if they need more head count.
-- 
Jon Angliss


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nrpe and check_mailq

2008-08-25 Thread Marc Powell

On Aug 25, 2008, at 5:16 PM, Kevin Freels wrote:

> *AND* I can run check_mailq on the FreeBSD box as a mortal user (me),
> but not on the Linux box. I did notice that version of Perl on the  
> Linux
> box is 5.8.0, while the working FreeBSD is 5.005. Perl on the  
> FreeBSD is
> also chgrp'd to wheel, not root.
>
> Maybe somewhere in there lies the culprit?

The permissions on your mail queue directory for your mailer are  
probably more open on the boxen that work.

> I'm also not comfortable adding a generic user ('nagios') to sudo. I
> realize I can restrict the commands the user can use, but still

The alternative is to add the nagios user to a group that has read  
permissions to your mail queue or generally more open permissions for  
all users on the mail queue in general. The former may be comparable  
to the sudo approach and the latter is definitely less secure than  
either.

--
Marc


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nrpe and check_mailq

2008-08-25 Thread Matthew Pounsett


On 25-Aug-2008, at 18:16 , Kevin Freels wrote:


I'm also not comfortable adding a generic user ('nagios') to sudo. I
realize I can restrict the commands the user can use, but still


I'm not sure if this applies to your system or not, but you might be  
able to get away with just adding the 'nagios' user to the correct  
group to be able to read the mail spool directory, possibly /var/spool/ 
mqueue.  Again, not sure if it'll work on your system, but it's  
something to check.


HTH,
  Matt





PGP.sig
Description: This is a digitally signed message part
-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] nrpe and check_mailq

2008-08-25 Thread Kevin Freels
Okay, I started to go down the sudo path until I realized I have other
systems that check_mailq works just fine on without having to be root.
The difference is that the system it works on is a FreeBSD and the
system  that's choking is Linux.

check_mailq on the working system is running as the same user ('nagios')
as the Linux box, so neither of them are root. The perms/owners for
check_mailq are the same for both systems.

*AND* I can run check_mailq on the FreeBSD box as a mortal user (me),
but not on the Linux box. I did notice that version of Perl on the Linux
box is 5.8.0, while the working FreeBSD is 5.005. Perl on the FreeBSD is
also chgrp'd to wheel, not root.

Maybe somewhere in there lies the culprit?

I'm also not comfortable adding a generic user ('nagios') to sudo. I
realize I can restrict the commands the user can use, but still

BTW, thanks for the replies

k
-=-=-=-  

> > If I remember correctly, you need to be root to run mailq on most 
> > systems so you might need to give the nrpe user sudo access for it.
> 
> Yup, most likely permissions problem.
> 
> The easiest way to fix that is by adding it to sudo (nrpe 
> would call "sudo /usr/local/nagios/libexec/check_mailq  -w 
> 300 -c 500"). Make sure it does not have the requiretty 
> option or it will still fail from the daemon.
> 
> - --
> Thomas

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] %40 in acknowledgement

2008-08-25 Thread Hugo van der Kooij
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Charles Breite wrote:
> We are using Nagios 2.9 and when someone acknowledges an alert it
> populates the Author(Your name) field with
> 
> username%40domainname. This should be [EMAIL PROTECTED]
> 
> Any ideas on how to make the @ appear properly?

My guess would be that your webbrowser is converting the character.

Which webbrowsers have you tried?

Hugo.


- --
[EMAIL PROTECTED]   http://hugo.vanderkooij.org/
PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc

A: Yes.
>Q: Are you sure?
>>A: Because it reverses the logical flow of conversation.
>>>Q: Why is top posting frowned upon?

Bored? Click on http://spamornot.org/ and rate those images.

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.7 (GNU/Linux)

iD8DBQFIsyQHBvzDRVjxmYERAjgAAJ9BfZMHeJVegBLI8/qTwCxZaDizSQCdEl7a
ucV93wGPuJCPQbfKSrpGafc=
=97ka
-END PGP SIGNATURE-

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] %40 in acknowledgement

2008-08-25 Thread Charles Breite
We are using Nagios 2.9 and when someone acknowledges an alert it
populates the Author(Your name) field with 

username%40domainname. This should be [EMAIL PROTECTED]

Any ideas on how to make the @ appear properly?

Thanks

 

 

 

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] checks, notifications don't work after time period exception

2008-08-25 Thread Seth Simmons
The system date is correct and Nagios is showing the correct day and
time on the status page.  If I stop Nagios, I don't see any other Nagios
processes running.  The two templates here have not been touched.  I
looked at both and have 24x7 for all times which is the default.  I just
modified the time definitions as you have it listed and I'll see what
happens tonight.  There are also no custom time settings in nagios.cfg;
all the settings related to time zone, etc are commented out.

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Mark
Young
Sent: Monday, August 25, 2008 11:44 AM
To: Nagios Users Mailinglist
Subject: Re: [Nagios-users] checks,notifications don't work after time
period exception

Hi Seth,

On Aug 25, 2008, at 8:05 AM, Seth Simmons wrote:

> We have a qa group overseas that will work on our customer sites  
> during the US overnight.  To avoid false alerts, I added a time  
> exception so notifications are not sent out between 4am and 5:30am.   
> The problem is, after the exception, Nagios (3.0.3) won't send  
> notifications, neither are checks performed for any sites with an  
> exception.  If a site is in a critical state either shortly after 4  
> or (if they start early) right before 4, checks do not continue  
> after 5:30.  When I look at Nagios later, it shows it in critical  
> and the last check was done at 3:58am with the next check at  
> midnight the next day.

When I start dealing with time problems with Nagios I have a small  
list that I try first just to it out.
* Check date/time of monitoring server and that it is the right  
timezone (UTC or whatever you want it as).
* Check that the Nagios web interface is displaying the time you  
expect it to (top left corner in most cgis).  In the nagios.cfg you  
may have set additional time information in there.
* Stop the nagios process, checking that there are are no other  
running instances left.  'service nagios stop' 'ps aux |grep nagios'
* Restart the nagios process.

Sometimes you can get duplicate Nagios daemons running and they can  
cause many odd problems like this.  Also I hope we are not dealing  
with any time translations with the "overseas" group.


>
> Let me give some more specific examples:
> Server-A is running abc.customer.com for us and our qa group takes  
> the site down at 3:55am, before the 4am exception.  Nagios will show  
> as critical until either midnight the next day, or you force a check  
> on the service.  So, say at 8am I look at it, the service is  
> critical with last check at 3:55am and next scheduled check at 12am  
> tomorrow.  When I force a check, it will continue on normal check  
> schedule and send notice that the service is ok.

So you are saying that "Server-A" is supposed to be checked in the  
timerange 24x7 minus 4:00am-5:30am each day, but when it stops at  
4:00am it will not start checking until the next day, unless you force  
it through an external command to start checking again?  It is  
possible that there could be a bug, but you seem to have a really  
common timeperiod definition type.  I normally suggest that users  
always run the checks 24x7 and then just modify the notification  
periods (like you did with 'Server-B).  But I would try it with a  
simple time definition first.

# Test timeperiod for the recycle service.
define timeperiod{
 timeperiod_name recycle
 alias   recycle
 sunday  00:00-04:00,05:30-24:00
 monday  00:00-04:00,05:30-24:00
 tuesday 00:00-04:00,05:30-24:00
 wednesday   00:00-04:00,05:30-24:00
 thursday00:00-04:00,05:30-24:00
 friday  00:00-04:00,05:30-24:00
 saturday00:00-04:00,05:30-24:00
 }


Also what does your "generic-service" and "local-service" templates  
look like?  There could be some settings that are following you  
through those templates.  Also you may have modified some settings in  
the nagios.cfg that makes changes to how nagios deals with time.


>
> Server-B is also running a site and tomcat is stopped at 4:10am.   
> This service has notification period with the same time period with  
> exceptions from 4am - 5:30am.  After that it will not send  
> notifications.  At 8am it is still doing checks and saying is  
> critical, but when looking at the details it says it has not sent  
> any notifications.  When I force a check it still won't do it.  If I  
> restart Nagios then it does a check it will send first notice.  I  
> don't see anything wrong with my time period so not sure where the  
> issue is.  Not sure if anyone else has noticed this before.

The difference between those are that they are using a different  
service template.  Server-B is using 'local-service'.

>
> Here is what I have for that time period and checks for the above  
> examples:
>
> define timeperiod{
> timeperiod_name   url-monitor
>

Re: [Nagios-users] Checking Windows Services with stupid names.

2008-08-25 Thread Michael Medin
Klaus Umbach skrev:
> 2008/8/25 Michael Medin <[EMAIL PROTECTED] >
>
> I would say: use NRPE instead :)
>
> but perhaps this is a bug, I have seen the reports on and off but
> haven't had time to look into it (since it is so easily solved
> with NRPE)
>
>
> NRPE? To check a Windows-Box? No, that's not really funny...
huh?
Why not?
I'we always thought of NRPE to be vastly superior to the NSClient 
protocol myself...

// Michael Medin
>
> I found out, how it works:Windows Services always have 2 names, the 
> internal one is better, than the description-name.
>
> Cheers
> Klaus
>
> -- 
> First Official Internetdenier

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] checks, notifications don't work after time period exception

2008-08-25 Thread Mark Young
Hi Seth,

On Aug 25, 2008, at 8:05 AM, Seth Simmons wrote:

> We have a qa group overseas that will work on our customer sites  
> during the US overnight.  To avoid false alerts, I added a time  
> exception so notifications are not sent out between 4am and 5:30am.   
> The problem is, after the exception, Nagios (3.0.3) won’t send  
> notifications, neither are checks performed for any sites with an  
> exception.  If a site is in a critical state either shortly after 4  
> or (if they start early) right before 4, checks do not continue  
> after 5:30.  When I look at Nagios later, it shows it in critical  
> and the last check was done at 3:58am with the next check at  
> midnight the next day.

When I start dealing with time problems with Nagios I have a small  
list that I try first just to it out.
* Check date/time of monitoring server and that it is the right  
timezone (UTC or whatever you want it as).
* Check that the Nagios web interface is displaying the time you  
expect it to (top left corner in most cgis).  In the nagios.cfg you  
may have set additional time information in there.
* Stop the nagios process, checking that there are are no other  
running instances left.  'service nagios stop' 'ps aux |grep nagios'
* Restart the nagios process.

Sometimes you can get duplicate Nagios daemons running and they can  
cause many odd problems like this.  Also I hope we are not dealing  
with any time translations with the "overseas" group.


>
> Let me give some more specific examples:
> Server-A is running abc.customer.com for us and our qa group takes  
> the site down at 3:55am, before the 4am exception.  Nagios will show  
> as critical until either midnight the next day, or you force a check  
> on the service.  So, say at 8am I look at it, the service is  
> critical with last check at 3:55am and next scheduled check at 12am  
> tomorrow.  When I force a check, it will continue on normal check  
> schedule and send notice that the service is ok.

So you are saying that "Server-A" is supposed to be checked in the  
timerange 24x7 minus 4:00am-5:30am each day, but when it stops at  
4:00am it will not start checking until the next day, unless you force  
it through an external command to start checking again?  It is  
possible that there could be a bug, but you seem to have a really  
common timeperiod definition type.  I normally suggest that users  
always run the checks 24x7 and then just modify the notification  
periods (like you did with 'Server-B).  But I would try it with a  
simple time definition first.

# Test timeperiod for the recycle service.
define timeperiod{
 timeperiod_name recycle
 alias   recycle
 sunday  00:00-04:00,05:30-24:00
 monday  00:00-04:00,05:30-24:00
 tuesday 00:00-04:00,05:30-24:00
 wednesday   00:00-04:00,05:30-24:00
 thursday00:00-04:00,05:30-24:00
 friday  00:00-04:00,05:30-24:00
 saturday00:00-04:00,05:30-24:00
 }


Also what does your "generic-service" and "local-service" templates  
look like?  There could be some settings that are following you  
through those templates.  Also you may have modified some settings in  
the nagios.cfg that makes changes to how nagios deals with time.


>
> Server-B is also running a site and tomcat is stopped at 4:10am.   
> This service has notification period with the same time period with  
> exceptions from 4am – 5:30am.  After that it will not send  
> notifications.  At 8am it is still doing checks and saying is  
> critical, but when looking at the details it says it has not sent  
> any notifications.  When I force a check it still won’t do it.  If I  
> restart Nagios then it does a check it will send first notice.  I  
> don’t see anything wrong with my time period so not sure where the  
> issue is.  Not sure if anyone else has noticed this before.

The difference between those are that they are using a different  
service template.  Server-B is using 'local-service'.

>
> Here is what I have for that time period and checks for the above  
> examples:
>
> define timeperiod{
> timeperiod_name   url-monitor
> alias   url-monitor
> sunday 00:00-23:59
> monday   00:00-23:59
> tuesday00:00-23:59
> wednesday00:00-23:59
> thursday  00:00-23:59
> friday00:00-23:59
> saturday  00:00-23:59
> excluderecycle
> }



This is how I would have wrote the timeperiod definitions to make them  
more clear.  I've used the exclude method many times so I am sure that  
it works as you are expecting.

define timeperiod{
 timeperiod_name 24x7
 alias   24 Hours A Day, 7 

Re: [Nagios-users] Checking Windows Services with stupid names.

2008-08-25 Thread Klaus Umbach
2008/8/25 Michael Medin <[EMAIL PROTECTED]>

>  I would say: use NRPE instead :)
>
> but perhaps this is a bug, I have seen the reports on and off but haven't
> had time to look into it (since it is so easily solved with NRPE)
>

NRPE? To check a Windows-Box? No, that's not really funny...

I found out, how it works:Windows Services always have 2 names, the internal
one is better, than the description-name.

Cheers
Klaus

-- 
First Official Internetdenier
-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] checks, notifications don't work after time period exception

2008-08-25 Thread Seth Simmons
We have a qa group overseas that will work on our customer sites during
the US overnight.  To avoid false alerts, I added a time exception so
notifications are not sent out between 4am and 5:30am.  The problem is,
after the exception, Nagios (3.0.3) won't send notifications, neither
are checks performed for any sites with an exception.  If a site is in a
critical state either shortly after 4 or (if they start early) right
before 4, checks do not continue after 5:30.  When I look at Nagios
later, it shows it in critical and the last check was done at 3:58am
with the next check at midnight the next day.

 

Let me give some more specific examples:

Server-A is running abc.customer.com for us and our qa group takes the
site down at 3:55am, before the 4am exception.  Nagios will show as
critical until either midnight the next day, or you force a check on the
service.  So, say at 8am I look at it, the service is critical with last
check at 3:55am and next scheduled check at 12am tomorrow.  When I force
a check, it will continue on normal check schedule and send notice that
the service is ok.

 

Server-B is also running a site and tomcat is stopped at 4:10am.  This
service has notification period with the same time period with
exceptions from 4am - 5:30am.  After that it will not send
notifications.  At 8am it is still doing checks and saying is critical,
but when looking at the details it says it has not sent any
notifications.  When I force a check it still won't do it.  If I restart
Nagios then it does a check it will send first notice.  I don't see
anything wrong with my time period so not sure where the issue is.  Not
sure if anyone else has noticed this before.

 

Here is what I have for that time period and checks for the above
examples:

 

define timeperiod{

timeperiod_name   url-monitor

alias   url-monitor

sunday 00:00-23:59

monday   00:00-23:59

tuesday00:00-23:59

wednesday00:00-23:59

thursday  00:00-23:59

friday00:00-23:59

saturday  00:00-23:59

excluderecycle

}

 

define timeperiod{

timeperiod_name   recycle

alias   recycle

sunday 04:00-05:30

monday   04:00-05:30

tuesday04:00-05:30

wednesday04:00-05:30

thursday  04:00-05:30

friday04:00-05:30

saturday  04:00-05:30

}

 

define command{

command_namecheck_http_abc

command_line$USER1$/check_http -H abc.company.com

}

 

define service{

use
generic-service 

host_name
Server-A

service_descriptionsite abc

is_volatile0

check_period
url-monitor

max_check_attempts2

normal_check_interval 5

retry_check_interval  5

contacts
nagiosadmin

notification_interval   30

notification_period url-monitor

notification_options   w,c,r

check_command check_http_abc

}

 

define service{

use
local-service 

host_name Server-B

service_descriptionHTTP

check_period24x7

max_check_attempts2

normal_check_interval 3

retry_check_interval  5

contacts
nagiosadmin

notification_interval   60

notification_period url-monitor

notification_options   w,c,r

check_command check_http

}

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/l

[Nagios-users] Checking Windows Services with stupid names.

2008-08-25 Thread Klaus Umbach
Hi,

I have a service on a windows-box to check, which has a "&" in the name.
(With check_nt and nsclient++)

with 'SERVICESTATE -l "Foo & Bar"' it checks for the services 'foo' and
'bar' , \&, \\\& and using ' instead of " did'n't help.

How do I check this?

Cheers
Klaus

-- 
First Official Internetdenier
-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_ntp_peer

2008-08-25 Thread Andreas Ericsson
Tijn wrote:
>> Not all servers support NTP Control packets used by check_ntp_peer. Do
>> you get anything with this command?
>>
>> ntpq -p myname
>>
>>   
> Yes thanks, Openntpd doesn't support ntpq. On the Openbsd-misc list 
> there was an aswer from Henning Brauer {= developer of OpenNTPD):
> On 2004 Aug 9, at 4:03 AM, Henning Brauer wrote:
>>  > [I]s support planned for [ntpq -p in] the future?
>>  >
>>  > no.
> 
> 2004 is some years ago..
> 

I'm not surprised. Openntpd is designed to be a much more lightweight
version than bog-standard ntpd. Primarily to not have to haul around
refclock support to each and every installation.

If it's any relief though, ntpq *will* work on all stratum1 servers,
as only vanilla ntpd has refclock support.

-- 
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null