[Nagios-users] Help with large scale planning

2008-05-20 Thread Mark Potter
Hello List,

Been a while since I have been able to post on a regular basis due to
being given the opportunity to seek other employment! I have landed,
gracefully, in a position where I have been tasked with designing a
large scale Nagios installation. The requirements are all client
requirements and pretty necessary. I need a little advice on where to
start. I will describe the environment and then lay out my idea on how I
see the design coming together.

The environment is an HPCC environment and the requirements are based on
that aspect almost exclusively. Initially the monitoring will be for
clustering only and then expanding out to other servers outside of the
HPCC environment.

1. No client installed on compute nodes (there is an HA head node where
a client or a full install could be done).
2. No active checks directly to compute nodes
3. Ganglia is available for node data

That is the majority of the requirements. Ganglia makes things a bit
easier but I am not sure how much easier. It looks like GroundWork could
handle this but I don't see the large scale features available in the
open source version.

The environment is as follows

1. 80 clusters
2. Each cluster has 70-72 compute nodes 


The client wants a single point of monitoring for this environment. I am
looking at the following for a setup:

Using the ganglia plugin from Nagios Exchange to gather and parse the
data, on the HA head node, and having this report back to a main Nagios
server (HA) for the single point of monitoring.

What I don't know is how Nagios 3.x will scale with ~5000-6000 hosts
coming into a single point of monitoring. What cannot happen is the
checks causing any degradation in the HPCC environment. Ganglia is
already in place and accounted for in performance so querying the
ganglia process is allowed but they would prefer to pull this data from
gmetad and not gmond.

Also, from the Nagios management side, I would like to see if there is a
way to automatically add hosts if a new host pops up in the ganglia
data. This is not a deal breaker but will make life so much easier in
the long run. I have likely not given enough information somewhere but I
think there is enough here to get a discussion started.

It's good to be back!
Regards,

Mark L. Potter
eXcellence in IS Solutions, Inc. (X-ISS) 
Office:  713-862-9200  x219
Email : [EMAIL PROTECTED] 
http://www.x-iss.com  

Making IT Work for You
HPC & Enterprise IT Solutions

* HPC Application Acceleration
* Cluster Design, Deploy, Manage, Train
* Linux/Windows Integration
* Remote Management, Backup, Anti-Spam/Virus
* Network Assessments, Design
* Security Audits, Design
* Datacenter Design, Relocation
* Messaging and Collaboration



NOTICE:
This message may contain privileged or otherwise confidential information.
If you are not the intended recipient, please immediately advise the sender
by reply email and delete the message and any attachments without using,
copying or disclosing the contents.



-
This SF.net email is sponsored by: Microsoft 
Defy all challenges. Microsoft(R) Visual Studio 2008. 
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring Connection attempts [OT Response]

2008-03-05 Thread mark . potter
Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965


[EMAIL PROTECTED] wrote on 03/05/2008 04:04:16 
PM:

> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
> 
> Matthew Macdonald-Wallace wrote:
> | Hi All,
> |
> | Before I start coding my own plugin to do this, does anyone know of a
> | plugin that monitors the number of external connection attempts over a
> | given period of time for a given service and sends alerts accordingly?
> |
> | I've noticed on a number of servers that we maintain recently that
> | there are unauthorised attempts to connect via SSH/FTP.  These appear
> | in the log files about 2 seconds apart and are obviously automated.
> 
> They are in fact rooted machines running a bot doing the work. these are
> around for years and apparantly people still think that they can put up
> unpatched servers on the internet.
> 
> But unless you will report the server to th owner of the network or
> anything like that I think it will be just adrain of resources without
> any benefit.
> 
> Hugo.
> 
As much as reporting them sounds nice I find only about one in ten reports 
ever has any response and only about one on five of those has anything 
done about it. I use a combination of scripts to simply null route the 
bastards and be done with them.
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notifications

2008-03-04 Thread mark . potter
[EMAIL PROTECTED] wrote on 03/04/2008 04:21:46 
PM:

> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
> 
> Hi,
> 
> I am unable to get nagios to send out notifications. I have postfix
> and mailx installed. I have checked all my settings and have been
> unable to get it to work. I check the mail log and it appears that the
> notification attempts have been made but I cannot receive the
> notifications.
> 
> Kevin


It truly is vague question day on the list. If the messages are making it 
to the maillog then it is most likely, in fact I am certain, not a Nagios 
issue. I suspect it is somewhere in your mailer configuration. Without 
more information there is precious little I can say. Check the Nagios logs 
to see if they are sent from Nagios, check the maillog to see what is 
happening there. If Nagios is sending them and they are making it to your 
mailer then Nagios is working and your mailer needs troubleshooting.

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] multiple notifications on some alerts

2008-03-04 Thread mark . potter
[EMAIL PROTECTED] wrote on 03/04/2008 02:42:09 
PM:

> We have been getting multiple notifications on some of our alerts
> recently and I have been tasked with trying to find out why this is
> happening.
> 
> I have checked the services.cfg, contacts.cfg and contactgroups.cfg
> files, all appear correct.  I have checked the Exchange Server, the
> distribution group and members are correct.
> 
> Am I missing checking somewhere?
> 
> I have made a search online and have seen that others are reporting this
> also, but did not see a solution.
> 
> Thanks
> Mike Welsh
> 
> 
1. What version of Nagios are you using
2. On what OS it running
3. Attach at least the configs related to your multiple alerts. I would 
include contacts.cfg, contactgroups.cfg, at least one service for which 
you are getting multiple alerts, and so on.

I am sure someone can help you out but we will never solve the problem 
taking shots in the dark.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Checking contents of a file on MS Windows

2008-03-04 Thread mark . potter
So I have run across one for which I can't figure out a starting point. We 
are using Tivoli to backup our farm including databases. For the most part 
this is easy to monitor as we can just monitor Tivoli. However the 
database backups are not scheduled through Tivoli but rather in their 
respective DBs as scheduled procedures. (I am not a DBA so forgive me if 
my language on that piece is off). The only actual method for determining 
the success or failure of these backups is a log file on the Windows 
server. We are using SNMP for all our Windows monitoring at the moment but 
are open to other solutions. I need to be able to access this file, grep 
out the date and status, and process those results for Nagios. Does anyone 
know of a simple way to do this and if not then does anyone know of a 
complicated method? I am not locked into to any solution and would be 
willing to use active or passive checks, snmp, nc_net, or any other 
solution. Thanks in advance for any replies or advice.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-22 Thread mark . potter
Melanie Pfefer <[EMAIL PROTECTED]> wrote on 02/22/2008 09:16:51 
AM:

> Hello Mark, all,
> Yes true but what to put instead of the hostname(TROY)
> in?
> check_snmp_windows_disk!TROY!95!98!^[CDE]:!
> in case I want to put the hostgroup_name 
> 
> thanks
You would use hostgroup_name in the service definition and this line:

check_snmp_windows_disk!$HOSTADDRESS$!95!98!^[CDE]:

You can use $HOSTNAME$ but I wouldn't as DNS issues will then cause your 
checks to fail.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-22 Thread mark . potter
Melanie Pfefer <[EMAIL PROTECTED]> wrote on 02/22/2008 05:53:03 
AM:

> also another question please. I had no problem on my
> test machine. On my prod machine I got this error:
> 
> (No output!) 
> 
> according to
> http://nagios.manubulon.com/faq.html#FAQ8, I should
> apply a patch. In that case, how to apply this patch?
> 
> according to
> http://www.nagios.org/faqs/viewfaq.php?faq_id=336, I
> need to add use lib "path-to-utils.pm" to the perl
> script. how to specify that path?
> 
> thanks
> PS: I m using Linux fedora
> 
(No Output) could be a variety of things. First run the command from the 
command line with the -v switch to see the verbose output, as the nagios 
user of course, and from there begin troubleshooting. I doubt it is option 
#2 since I have not had to modify the script on any server. If you have to 
apply a patch I would start with contacting the developer of the script.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-22 Thread mark . potter
Melanie Pfefer <[EMAIL PROTECTED]> wrote on 02/22/2008 05:34:39 
AM:

> thanks indeed
> 
> Now if I want to use hostgroup_name instead of
> host_name,
> 
> ow to write the check_command line in services.cfg?
> 
> check_command 
> check_snmp_windows_disk!TROY!95!98!^[CDE]:!
> 
> is working for TROY only What should I do to apply it
> on all hosts of a group?
> 
> thx

Just use hostgroup_name instead of host_name. That's in the Nagios docs.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Fwd: help needed

2008-02-22 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/22/2008 12:14:56 
AM:

> 
> Hi,
> 
> Thanks all for the replies. But hard luck. Ultimately I have enabled
> every single option by putting * against it. Still "View Config" 
> seems to be not working for me.
> 
> nagios_check_command=/usr/local/NAGIOS/libexec/check_nagios /usr/
> local/NAGIOS/var/status.dat 5 '/usr/local/NAGIOS//bin/nagios'
> 
> use_authentication=0
> authorized_for_system_information=*
> authorized_for_configuration_information=*
> authorized_for_system_commands=*
> authorized_for_all_services=*
> authorized_for_all_hosts=*
> authorized_for_all_service_commands=*
> authorized_for_all_host_commands=*
> And still getting the same error. Now what ???
> 
> 
I would change all of that:

use_authentication=1

and then put the same name you used for the http authorization where you 
have the "*" on each line. Restart both apache and nagios in case there is 
any issue with the http auth and go from there. I believe, but could be 
wrong, since you have use_authentication set to 0 that it is not parsing 
anything having to do with authorization afterwards.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] init script

2008-02-21 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/21/2008 03:53:22 
PM:

> Can someone please send me the init script?
> I didn't install it before I got rid of the source.
> 
> 
Here is one for CentOS: http://www.unixservers.us/content/view/51/69/

Of course I imagine that http:www.google.com has more. As for sending one 
it's sort of hard because you haven't said whether you are running on 
Linux (what distro), BDS, Solaris, etc. I could send you mine but we are a 
SLES shop and I don't know what you run. I hope the CentOS one helps you 
out.
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check time syncronization

2008-02-21 Thread mark . potter
Hari Sekhon <[EMAIL PROTECTED]> wrote on 02/21/2008 01:52:15 PM:

> [EMAIL PROTECTED] wrote:
> > Another option would to be used check_by_ssh. I am, of course, 
> > assuming they are allowed to use ssh but a machine with no remote 
> > connectivity is a problem to begin with. check_by_ssh isn't quite as 
> > nice as nrpe but it would accomplish the checks in question. One could 

> > also write a pretty simple wrapper to check the time on both servers, 
> > compare it, and account for the lag between the checks. It wouldn't be 

> > pretty but it would work for the most part.
>
> check_by_ssh? I'd avoid that at any cost, ssh is too powerful, it's the 
> equivalent of nrpe with the "don't blame me flag if you get hacked". If 
> you can't use nrpe, then you certainly can't give out ssh access.

This is patently untrue. NRPE opens a new port and introduces new 
processes to an environment. This has to be vetted through all security 
testing and that can take months at some companies only to have it fail 
because they do not understand it. If they are admining Linux boxes 
already I am betting they have ssh running in the environment and properly 
locked down at many levels. SSH may be more powerful than NRPE as far as 
what could happen but it is also running in a lot more places. It is an 
alternative if you can't get NRPE approved. The final statement is false 
as well. "If you can't use nrpe, then you certainly can't give out ssh 
access". I can assure you that there are many environments where the 
security admins are more concerned about introducing new processes that 
use open ports than they are about giving out ssh access when properly 
locked down. It is really very simple to allow ssh access by IP and chroot 
the nagios user making ssh no more of a risk than nrpe and not introducing 
a new "threat" into the farm. The security admins are likely wrong but 
they are also the ones calling the shots in many cases.

> 
> Also, I'm not sure it's worth writing any wrapper, since any which way 
> you'd still need a remote execution mechanism. By the time you have any 
> remote execution mechanism, then surely you should use the standard 
> check_ntp plugin...

You don't need a remote execute mechanism:
HOST-RESOURCES-MIB::hrSystemDate.0
That will pull the system date via snmp without a remote execute 
mechanism. Meaning you could, in theory, pull the date off of two systems 
with one being the ntp server, write some logic in for the lag between the 
two checks, and compare the time without any remote execute mechanism 
whatsoever. It would not be as reliable as anything using a remote execute 
method because the lag between the two checks will vary but it could be 
used as a basic check without too much trouble.

> 
> I think that SNMP, NSCA would be your best bets, but if you can't have 
> anything, there is also one more remote possibility. Timestamping 
> through ICMP. Long shot but it can be done, you'd probably need to write 

> a custom plugin though and if it's a tight environment then likely this 
> would be blocked.

I was referring to writing a wrapped for snmp checks. Amazing that you 
suggest using snmp. I highly doubt nsca can be used if nrpe cannot. SNMP 
or SSH are likely the only options for the scenario as presented. Also I 
don't know if you noticed the OP's email address but I am betting he deals 
with a lot more red tape than most us in getting anything changed, 
installed, and so on.
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check time syncronization

2008-02-21 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/21/2008 10:40:41 
AM:

> Goltz, Jim (NIH/NLM/LHC) [C] wrote:
> > Sorry. I meant, there's no plugin that lets you check from the Nagios
> > host whether host A is in sync with NTP server B. 
> >
> > NRPE works, but we can't use it here because of security issues.  I 
keep
> > forgetting not everyone is prohibited from running NRPE.
> > 
> If you have ntp on each server, and configure it correctly, you could 
> use check_ntp -H remote_server -w 15 to test if it's out of sync with 
> the Nagios server
> 
> The on the nagios server itself have the same check but against the real 

> ntp server, this way you've got a chain of correct times or not.
> 
> Not ideal, but one possible workaround.
> 
> Other ideas could be SNMP on the remote hosts, or NSCA to feed back to 
> the nagios server...
> 
> -h
> 
> -- 
> Hari Sekhon
> 
Another option would to be used check_by_ssh. I am, of course, assuming 
they are allowed to use ssh but a machine with no remote connectivity is a 
problem to begin with. check_by_ssh isn't quite as nice as nrpe but it 
would accomplish the checks in question. One could also write a pretty 
simple wrapper to check the time on both servers, compare it, and account 
for the lag between the checks. It wouldn't be pretty but it would work 
for the most part.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] help needed

2008-02-21 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/20/2008 11:49:55 
PM:

> Hi ,
> 
> We are not able to access  "View Config" option tab in Web Access. 
> Getting the CGI permission error. Could someone point out what could
> be the problem/s


Depending on your version here is where you start:

http://nagios.sourceforge.net/docs/3_0/ - 3.x docs
http://nagios.sourceforge.net/docs/2_0/ - 2.x docs

If you are going to get help on the list you will likely need to give at 
least the following information:

1. Nagios version
2. OS and version
3. What troubleshooting you have done thus far

If you want support without doing any of your own troubleshooting then 
nagios.com might be your best bet.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Memory Usage for Process

2008-02-20 Thread mark . potter
Hari Sekhon <[EMAIL PROTECTED]> wrote on 02/20/2008 12:15:58 PM:

> [EMAIL PROTECTED] wrote:
> >
> > [EMAIL PROTECTED] wrote on 02/20/2008 
> > 11:23:35 AM:
> >
> > > Are there any plugins that will monitor memory use of processes on a
> > > Windows system?
> > > 
> > > Thanks
> > > 
> > > Ken
> > > 
> > > Ken Harford
> >
> >
> > http://www.nagiosexchange.org/Windows.49.0.html?&tx_netnagext_pi1 
> >  >[p_view]=686 
> >
> >
> > That claims to do exactly what you want. I haven't used it so I can't 
> > say more than that.
> Had a quick look at that, it's very short, and you will need snmp on the 

> windows hosts available. I think WMI would be a better choice here if 
> you could write something to do this, otherwise you'd have to go and set 

> up snmp on all the windows devices you wanted to monitor in this way and 

> open up the firewalls too.
> 
> -h
> 
> -- 
> Hari Sekhon
> 
Yeah I tend to forget that not everyone has SNMP turned on. We have HP 
boxes and use HPSIM so SNMP is already enabled on every one of our hosts 
as part of the setup process including secure community strings and so on. 
This is a boon for me because I get to use SNMP for a majority of 
monitoring tasks and don't have to go through change control to get stuff 
installed on the farm. If you don't already have it enabled I can see it 
being a massive chore. Good point Hari and thanks for the response.
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Memory Usage for Process

2008-02-20 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/20/2008 11:23:35 
AM:

> Are there any plugins that will monitor memory use of processes on a
> Windows system?
> 
> Thanks
> 
> Ken
> 
> Ken Harford


http://www.nagiosexchange.org/Windows.49.0.html?&tx_netnagext_pi1
[p_view]=686

That claims to do exactly what you want. I haven't used it so I can't say 
more than that.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] State Stalking and notifications

2008-02-20 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/20/2008 09:46:12 
AM:

> 
> On more than a few occasions, I've had users ask me if it's possible to
> have alerts sent if the output of a check changes.  That is, if you
> consider that a check might actually have multiple failure conditions it
> could be monitoring.  And example of this might be a single command you
> run that reports about multiple aspects of an application.  The check
> goes critical if one part of that app becomes non-functional, but if a
> second piece goes off-line, the state of the check hasn't changed, so no
> notification is sent.
> 
> I read about "state stalking" and thought that this might be what I was
> looking for.  However, after reading a bit more closely, it appears that
> what state stalking is really about is logging results of checks and not
> about sending additional notifications.
> 
> I understand the philosophy of "well if one part is broken or two parts
> are broken, you have to fix it anyway".  There's also the hack of having
> recurring notifications sent (say every 30 minutes) which should send a
> user the latest check output.  Neither of those options are particularly
> nice for us.  I had thought about writing a custom check for each line
> of output that this command generates, but that seems needlessly
> painful.
> 
> I see that volatile checks are more about notifications, however they
> will send a notification every time the service comes back non-OK (i.e.
> even if the check output is exactly the same) which is not what we want.
> 
> I'm guessing the answer here is "Nagios can't do that", but I thought
> I'd ask anyway.
> 
> Thanks
> 
> Mark
> 

Technically Nagios can't do that. At least not from the vantage point you 
have described. We are much more granular in our monitoring for exactly 
the scenario you have described. At this point we don't combine multiple 
pieces into a single service check unless a department manager 
specifically requests a full overview in a single check. We monitor each 
piece with its own service check so we have complete control over who get 
notified for what, when they get notified, how often they get notified, 
and so on. I would say that Nagios can do what you want but that it is up 
to you to make your checks more granular.
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-20 Thread mark . potter
"Valdinger, Stephen (DOV, MSX)" <[EMAIL PROTECTED]> wrote on 
02/20/2008 07:09:34 AM:

> -Original Message-
> From: Melanie Pfefer [mailto:[EMAIL PROTECTED] 
> Sent: Wednesday, February 20, 2008 8:01 AM
> To: [EMAIL PROTECTED]
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] SNMP/Windows disk usage
> 
> Hello again
> 
> I downloaded the perl script and indeed I can use it
> manually:
> ./check_snmp_storage.pl -H TROY -C public -m ^[CDE]:
> -w 80% -c 90%
> 
> D:\ Label:  Serial Number 225e3590:
> 3%used(2587MB/89495MB) C:\ Label:  Serial Number
> 8cfa6a13: 27%used(6644MB/25000MB) (<80%) : OK
> 
> 
> 
> In nagios, I should define a service and a command to
> start using check_snmp_storage.pl. Could you please
> help in what I should add?
> 
> Many thanks
> 
> <>
> 
> Melanie, 
> 
>Here is a sample command and service definition that I use with
> this plugin:
> 
> In commands.cfg:
> 
> define command{
> command_namecheck_c:\
> command_line$USER1$/check_snmp_storage.pl -H $HOSTADDRESS$
> -C $USER3$ -m ^C: -w 85 -c 95
> }
> 
> In services.cfg (or wherever you place your service definitions)
> 
> define service{
>use   generic-service
>host_name   
>service_description   Check C:\ Usage Data
>check_command   check_c:\
>}
> 
> This is how we use it on our network, your needs may be different. 
> Remember when testing to plugin to test as root and to chown
> nagiosuser.nagiosgroup when you put it into libexec for it to work
> correctly. 
> 

Some systems may have issues with the "\" in a command name. I have seen 
it blow things up so there is a point on which care needs to be taken. I 
would also recommend against hard coding -w and -c values where at all 
possible. It makes using the same command for multiple services with 
different needs much more difficult. Also you should always test as the 
nagios user and not as root. Testing as root really does nothing other 
than show permissions and rights issues if the command fails to work as 
the nagios user. The script should be in libexec, owned by the nagios user 
and group, and tested as the nagios user. If that doesn't work then test 
it as root for the same dir with the same ownership to see if it works at 
all.
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-20 Thread mark . potter
Melanie.

1. When you test the script are you testing it as the nagios user?
2. Does the script reside in the libexec folder under the nagios user's 
home directory?

If the answer to the above questions is yes then:

define  command {
command_namecheck_snmp_windows_disk
command_line$USER1$/check_snmp_storage -H $HOSTADDRESS$ -C 
$ARG1$ -q FixedDisk -w $ARG2$ -c $ARG3$ -m $ARG4$ $ARG5$
}
That is my command definition. $ARG5$ is an optional switch for exclusion.

One of the check commands (with security info snipped)

define  service {
host_name  
service_description SNMP - Windows - Disk - Production
is_volatile   0
check_command check_snmp_windows_disk!!95!98!^C:!-e
max_check_attempts  3
normal_check_interval120
retry_check_interval30
active_checks_enabled1
passive_checks_enabled 1
check_period24x7
parallelize_check   1
obsess_over_service 1
check_freshness 0
event_handler_enabled   1
flap_detection_enabled  1
process_perf_data   1
retain_status_information   1
retain_nonstatus_information1
contact_groups  sysalertstest
notification_interval   120
notification_period 24x7
notification_optionsw,u,c,r
notifications_enabled   1
register1
}

That is the check for our production windows servers excluding the C drive 
(the -e is the exclude switch).

I hope this information is helpful.

Melanie Pfefer <[EMAIL PROTECTED]> wrote on 02/20/2008 07:00:30 
AM:

> Hello again
> 
> I downloaded the perl script and indeed I can use it
> manually:
> ./check_snmp_storage.pl -H TROY -C public -m ^[CDE]:
> -w 80% -c 90%
> 
> D:\ Label:  Serial Number 225e3590:
> 3%used(2587MB/89495MB) C:\ Label:  Serial Number
> 8cfa6a13: 27%used(6644MB/25000MB) (<80%) : OK
> 
> 
> 
> In nagios, I should define a service and a command to
> start using check_snmp_storage.pl. Could you please
> help in what I should add?
> 
> Many thanks
> 
> 
> --- [EMAIL PROTECTED] wrote:
> 
> > You will need to actually download the plugin from 
> > http://nagios.manubulon.com/snmp_storage.html as it
> > is not part of the 
> > Nagios plugins package but rather a third party
> > plugin.
> > 
> > Melanie Pfefer <[EMAIL PROTECTED]> wrote on
> > 02/19/2008 03:37:04 
> > PM:
> > 
> > > Hello
> > > 
> > > I added this:
> > > 
> > > define command{
> > > command_namecheck_win_storage
> > > command_line 
> > $USER3$/check_snmp_storage.pl
> > > -H $HOSTADDRESS$ -C $ARG1$ -m $ARG2$ -w $ARG3$ -c
> > > $ARG4$
> > > }
> > > 
> > > 
> > > define service {
> > > use common-service
> > > hostgroup_name  nt-servers
> > > service_description Check Windows
> > drives
> > > check_command 
> > > check_win_storage!public!^[CDE]:!80!90!
> > > }
> > > 
> > > the result was: Return code of 127 is out of
> > bounds -
> > > plugin may be missing
> > > 
> > > 
> > > Any idea?
> > > thanks
> > > --- [EMAIL PROTECTED] wrote:
> > > 
> > > > Melanie,
> > > > 
> > > > I have setup the plugin available here: 
> > > > http://nagios.manubulon.com/snmp_storage.html
> > and it
> > > > works perfectly for 
> > > > our needs. It uses SNMP so you would have to
> > have
> > > > SNMP setup on all of 
> > > > your Windows boxes but with that caveat it is
> > the
> > > > best solution I have 
> > > > found for monitoring windows partitions. If you
> > are
> > > > looking for something 
> > > > slightly less complicated or with fewer options
> > > > there are many plugins on 
> > > > nagiosexchange.com that will do the job. I would
> > > > recommend 
> > > > check_snmp_storage though due to the options and
> > the
> > > > robustness of the 
> > > > plugin.
> > > > 
> > > > Best Regards,
> > > > 
> > > > Mark L. Potter
> > > > Systems Engineer
> > > > Academy Sports & Outdoors
> > > > 1800 N. Mason Rd
> > > > Katy, Texas 77449
> > > > 
> > > > 
> > > > 
> > > > Melanie Pfefer <[EMAIL PROTECTED]>
> > wrote on
> > > > 02/19/2008 03:04:23 
> > > > AM:
> > > > 
> > > > > Hello Mark,
> > > > > I am interested in monitoring the disks even
> > with
> > > > no
> > > > > exclusions.
> > > > > I installed nagios2.9 and the nagios plugin on
> > a
> > > > linux
> > > > > box.
> > > > > 
> > > > > Would it be possible that you give me what
> > you?ve
> > > > done
> > > > > to monitor the partitions on windows?
> > > > > 
> > > > > I appreciate your help.
> > > > > 
> > > > > --- [EMAIL PROTECTED] wrote:
> > > > > 
> > > > > > [EMAIL PROTECTED]
> > wrote
> > > > on
> > > > > > 02/18/2008 11:22:05 
> > > > > > AM:
> > > > > > 
> > > > 

Re: [Nagios-users] Feature requests

2008-02-20 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/20/2008 02:40:59 
AM:

> Hi
> 
> we are integrating nagios in our company. we have nearly 3 dozens of
> servers and several hundred clients. We are 15 admins split in 3 groups.
> Our internal organisation is split in functions, eg. somebody cares for
> the mail stuff, another one cares for the web servers and so on.
> 
> >From what I've seen while configuring contacts, contact groups, 
services
> and service groups and timeperiods there are a few limitations that from
> our point of view gave a bit of a headache. We tried to map our
> organisation to somehow fit into Nagios:
> 
> - contact groups cannot be grouped. This would be great. Because the 
boss
> of such an organisational unit could then see all tests that belong to 
his
> OU.
Contact groups cannot be grouped but contacts can be members of multiple 
groups. Bosses, in my limited experience either want to see problems when 
they happen and when they recover or when they have not been handled. I 
have yet to see a boss who really wants to see all notifications. I have 
met a couple who thought they wanted all notifications. This has never 
lasted more than a week before escalations were implemented. See the 
documentation for escalations: 
http://nagios.sourceforge.net/docs/2_0/escalations.html

> - a contact cannot be associated with more than 1 time period. Thus, I
> cannot be advertised from 0900 - 1700 by email (as I am in the office 
all
> day) and from 1700 - 0900 by sms unless I configure another contact for
> myself.

This is accomplished by using escalations. 

> - service groups cannot be associated with contact groups and vice 
versa.
> this would be very nice if serveral services are grouped then the 
service
> group could be associated with the contact group.
> - and finally theres a bug in the Nagios 2.6 (Debian Etch Stable)
> documentation regarding contacts: A contact must have an email address.
> The nagios documentation states otherwise.

I, personally, never recommend using the distro package for nagios. 2.6 is 
older, the packages are not maintained by Ethan, and the latest version is 
2.10. We are a SLES shop and SLES is pretty good about updating their 
packages but I much prefer to compile Nagios from source to have that 
little bit of control. I would check the online docs concerning your 
problem and download 2.10, compile it, and go from there. Your other 
problems are all handled by escalations pretty easily.
> 
> 
> Thanks for consideration
> Best,
> Philipp
> 
> 
> 
-
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2008.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-19 Thread mark . potter
You will need to actually download the plugin from 
http://nagios.manubulon.com/snmp_storage.html as it is not part of the 
Nagios plugins package but rather a third party plugin.

Melanie Pfefer <[EMAIL PROTECTED]> wrote on 02/19/2008 03:37:04 
PM:

> Hello
> 
> I added this:
> 
> define command{
> command_namecheck_win_storage
> command_line $USER3$/check_snmp_storage.pl
> -H $HOSTADDRESS$ -C $ARG1$ -m $ARG2$ -w $ARG3$ -c
> $ARG4$
> }
> 
> 
> define service {
> use common-service
> hostgroup_name  nt-servers
> service_description Check Windows drives
> check_command 
> check_win_storage!public!^[CDE]:!80!90!
> }
> 
> the result was: Return code of 127 is out of bounds -
> plugin may be missing
> 
> 
> Any idea?
> thanks
> --- [EMAIL PROTECTED] wrote:
> 
> > Melanie,
> > 
> > I have setup the plugin available here: 
> > http://nagios.manubulon.com/snmp_storage.html and it
> > works perfectly for 
> > our needs. It uses SNMP so you would have to have
> > SNMP setup on all of 
> > your Windows boxes but with that caveat it is the
> > best solution I have 
> > found for monitoring windows partitions. If you are
> > looking for something 
> > slightly less complicated or with fewer options
> > there are many plugins on 
> > nagiosexchange.com that will do the job. I would
> > recommend 
> > check_snmp_storage though due to the options and the
> > robustness of the 
> > plugin.
> > 
> > Best Regards,
> > 
> > Mark L. Potter
> > Systems Engineer
> > Academy Sports & Outdoors
> > 1800 N. Mason Rd
> > Katy, Texas 77449
> > 
> > 
> > 
> > Melanie Pfefer <[EMAIL PROTECTED]> wrote on
> > 02/19/2008 03:04:23 
> > AM:
> > 
> > > Hello Mark,
> > > I am interested in monitoring the disks even with
> > no
> > > exclusions.
> > > I installed nagios2.9 and the nagios plugin on a
> > linux
> > > box.
> > > 
> > > Would it be possible that you give me what you?ve
> > done
> > > to monitor the partitions on windows?
> > > 
> > > I appreciate your help.
> > > 
> > > --- [EMAIL PROTECTED] wrote:
> > > 
> > > > [EMAIL PROTECTED] wrote
> > on
> > > > 02/18/2008 11:22:05 
> > > > AM:
> > > > 
> > > > > [EMAIL PROTECTED] schrieb:
> > > > > > We are using snmp to monitor our windows
> > servers
> > > > and I have had a 
> > > > request 
> > > > > > from the windows admins. They want to alert
> > at
> > > > 80% and 90% usage on 
> > > > C:\ 
> > > > > > but 95% and 98% on all other partitions. (I
> > > > didn't ask about the logic 
> > > > 
> > > > > > because it wouldn't matter). I have tried
> > > > multiple snmp plugins and 
> > > > all of 
> > > > > > them seem to act the same. I can monitor all
> > the
> > > > disks on a single 
> > > > check, 
> > > > > > a single disk on a single check but I cannot
> > > > monitor all the disks 
> > > > except 
> > > > > > C: on a single check. There doesn't seem to
> > be
> > > > any plugins out there 
> > > > that 
> > > > > > allow exclusion and use snmp. Before I go
> > > > writing a wrapper for an 
> > > > > > existing plugin does anyone know if there is
> > a
> > > > plugin that will allow 
> > > > me 
> > > > > > to do this? I am looking to exclude a single
> > > > drive letter but check 
> > > > all 
> > > > > > the rest of the drives via snmp on windows
> > > > servers. Thanks in advance 
> > > > for 
> > > > > > any advice.
> > > > > > 
> > > > > 
> > > > > http://nagios.manubulon.com/snmp_storage.html
> > > > > 
> > > > > 
> > > > > check_snmp_storage will do the Job
> > > > > 
> > > > > Joerg
> > > > That does exactly what we need. Is it listed on
> > > > Nagios Exchange? Thank you 
> > > > for your response; I have managed to escape
> > writing
> > > > a wrapper thanks to 
> > > > you!
> > > > >
> > > 
> >
> 
-
> > > > This SF.net email is sponsored by: Microsoft
> > > > Defy all challenges. Microsoft(R) Visual Studio
> > > > 2008.
> > > >
> > >
> >
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/>
> > > ___
> > > > Nagios-users mailing list
> > > > Nagios-users@lists.sourceforge.net
> > > >
> > >
> >
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> > > > ::: Please include Nagios version, plugin
> > version
> > > > (-v) and OS when reporting any issue. 
> > > > ::: Messages without supporting info will risk
> > being
> > > > sent to /dev/null
> > > 
> > > 
> > > 
> > > 
> >
> ___
> > > Yahoo! Answers - Got a question? Someone out there
> > knows the answer. Try 
> > it
> > > now.
> > > http://uk.answers.yahoo.com/ 
> > 
> 
> 
> 
>   ___
> Support the World Aids Awareness campaign this month with Yahoo! For 
Good 
> http://uk.promotions.yahoo.com/forgood/
-
This SF.net email

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-19 Thread mark . potter
Melanie,

I have setup the plugin available here: 
http://nagios.manubulon.com/snmp_storage.html and it works perfectly for 
our needs. It uses SNMP so you would have to have SNMP setup on all of 
your Windows boxes but with that caveat it is the best solution I have 
found for monitoring windows partitions. If you are looking for something 
slightly less complicated or with fewer options there are many plugins on 
nagiosexchange.com that will do the job. I would recommend 
check_snmp_storage though due to the options and the robustness of the 
plugin.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449



Melanie Pfefer <[EMAIL PROTECTED]> wrote on 02/19/2008 03:04:23 
AM:

> Hello Mark,
> I am interested in monitoring the disks even with no
> exclusions.
> I installed nagios2.9 and the nagios plugin on a linux
> box.
> 
> Would it be possible that you give me what you?ve done
> to monitor the partitions on windows?
> 
> I appreciate your help.
> 
> --- [EMAIL PROTECTED] wrote:
> 
> > [EMAIL PROTECTED] wrote on
> > 02/18/2008 11:22:05 
> > AM:
> > 
> > > [EMAIL PROTECTED] schrieb:
> > > > We are using snmp to monitor our windows servers
> > and I have had a 
> > request 
> > > > from the windows admins. They want to alert at
> > 80% and 90% usage on 
> > C:\ 
> > > > but 95% and 98% on all other partitions. (I
> > didn't ask about the logic 
> > 
> > > > because it wouldn't matter). I have tried
> > multiple snmp plugins and 
> > all of 
> > > > them seem to act the same. I can monitor all the
> > disks on a single 
> > check, 
> > > > a single disk on a single check but I cannot
> > monitor all the disks 
> > except 
> > > > C: on a single check. There doesn't seem to be
> > any plugins out there 
> > that 
> > > > allow exclusion and use snmp. Before I go
> > writing a wrapper for an 
> > > > existing plugin does anyone know if there is a
> > plugin that will allow 
> > me 
> > > > to do this? I am looking to exclude a single
> > drive letter but check 
> > all 
> > > > the rest of the drives via snmp on windows
> > servers. Thanks in advance 
> > for 
> > > > any advice.
> > > > 
> > > 
> > > http://nagios.manubulon.com/snmp_storage.html
> > > 
> > > 
> > > check_snmp_storage will do the Job
> > > 
> > > Joerg
> > That does exactly what we need. Is it listed on
> > Nagios Exchange? Thank you 
> > for your response; I have managed to escape writing
> > a wrapper thanks to 
> > you!
> > >
> 
-
> > This SF.net email is sponsored by: Microsoft
> > Defy all challenges. Microsoft(R) Visual Studio
> > 2008.
> >
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/>
> ___
> > Nagios-users mailing list
> > Nagios-users@lists.sourceforge.net
> >
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version
> > (-v) and OS when reporting any issue. 
> > ::: Messages without supporting info will risk being
> > sent to /dev/null
> 
> 
> 
>   ___
> Yahoo! Answers - Got a question? Someone out there knows the answer. Try 
it
> now.
> http://uk.answers.yahoo.com/ 
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP/Windows disk usage

2008-02-18 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/18/2008 11:22:05 
AM:

> [EMAIL PROTECTED] schrieb:
> > We are using snmp to monitor our windows servers and I have had a 
request 
> > from the windows admins. They want to alert at 80% and 90% usage on 
C:\ 
> > but 95% and 98% on all other partitions. (I didn't ask about the logic 

> > because it wouldn't matter). I have tried multiple snmp plugins and 
all of 
> > them seem to act the same. I can monitor all the disks on a single 
check, 
> > a single disk on a single check but I cannot monitor all the disks 
except 
> > C: on a single check. There doesn't seem to be any plugins out there 
that 
> > allow exclusion and use snmp. Before I go writing a wrapper for an 
> > existing plugin does anyone know if there is a plugin that will allow 
me 
> > to do this? I am looking to exclude a single drive letter but check 
all 
> > the rest of the drives via snmp on windows servers. Thanks in advance 
for 
> > any advice.
> > 
> 
> http://nagios.manubulon.com/snmp_storage.html
> 
> 
> check_snmp_storage will do the Job
> 
> Joerg
That does exactly what we need. Is it listed on Nagios Exchange? Thank you 
for your response; I have managed to escape writing a wrapper thanks to 
you!
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] SNMP/Windows disk usage

2008-02-18 Thread mark . potter
We are using snmp to monitor our windows servers and I have had a request 
from the windows admins. They want to alert at 80% and 90% usage on C:\ 
but 95% and 98% on all other partitions. (I didn't ask about the logic 
because it wouldn't matter). I have tried multiple snmp plugins and all of 
them seem to act the same. I can monitor all the disks on a single check, 
a single disk on a single check but I cannot monitor all the disks except 
C: on a single check. There doesn't seem to be any plugins out there that 
allow exclusion and use snmp. Before I go writing a wrapper for an 
existing plugin does anyone know if there is a plugin that will allow me 
to do this? I am looking to exclude a single drive letter but check all 
the rest of the drives via snmp on windows servers. Thanks in advance for 
any advice.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How to explain active host checks to boss

2008-02-13 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/13/2008 11:29:06 
AM:

> 
> 
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED]
> > Sent: Wednesday, February 13, 2008 10:48 AM
> > To: nagios-users@lists.sourceforge.net
> > Subject: [Nagios-users] How to explain active host checks to boss
> > 
> > 
> > Background: Due to management requirements we are using NagiosQL as a
> > configuration manager for our Nagios install. NagiosQL defaults to
> active
> > checks enabled for hosts so this is how it's been done until now. We
> have
> > the alerts coming as we want them. We are adding more hosts and
> services
> > weekly. I know that active host checks are not a good thing to have
> going
> > forward as they are unnecessary. Please advise on the best way to
> explain
> > this to the boss who is, at this moment, convinced that if we turn off
> the
> > option in the config file then the host will never be checked even if
> a
> > service is down. I can't find a good place in the documentation to
> point
> > this out and would like to get these turned off in the near future so
> we
> > don't run into issues later on down the road. Any help in pointing me
> in
> > the right direction would be appreciated. Here is a sample host cfg
> from
> > our environment:
> 
> Assuming you're using 2.x. The main issue with host checks in 2.x and
> prior is that they are performed serially, not in parallel. While a host
> check is being run, nagios stops absolutely everything else, other
> host/service checks, notifications, etc until that single host check is
> complete. To put this in perspective, assume that you have 100 hosts
> checked with 10 pings over a 15 minute check_interval with a
> max_check_attempts of 3. When every host is up, each host check will
> take approximately 10 seconds to complete, during which nagios isn't
> doing anything else except obsessing over that host --
> 
> 100 hosts X 10 seconds = 1000 seconds 
> 
> As you can see, you've already exceeded your normal check interval of
> 900 seconds. Nagios cannot complete the host checks in the time interval
> you've specified and you haven't even done any service checks yet. Now,
> nagios will attempt to interleave service checks between host checks to
> compensate but you've just introduced latency for both check types.
> 
> Now imagine that you have a simple outage. 5 hosts are down that aren't
> related via parenting. Your timing now looks like --
> 
> (95 hosts X 10 seconds) + (5 hosts X 30 seconds) = 1100 seconds,
> dedicated to host checks only.
> 
> Because the host checks aren't related, nagios is able to interleave
> some service checks between so the latency isn't as bad as it could be.
> Take the calculation above and determine the effects of a large outage.
> Factor in parenting, where nagios will only being checking hosts up the
> tree without interleaving service checks and you start seeing big
> problems at the time that your monitoring systems is most critical and
> useful. You could easily end up in a situation where hosts and services
> aren't being checked for lng intervals.
> 
> 
> Nagios is smart. You don't need to schedule regular host checks because
> nagios knows that if there is a problem with a service, it may be caused
> by an outage of the host or a parent of the host. Nagios will
> automagically run the host check_command anytime there is a non-OK
> result from a service check, assuming only that active_checks_enabled is
> on for the host and there is a valid check_command specified. It will
> also follow the parents tree if the host check returns non-OK results
> until nagios finds an OK parent or reaches the top of the tree. Even so,
> you want to have your host checks finish as quickly as possible; 1 ping
> max_check_attempts 3 times is usually sufficient to determine status.
> 
> Nagios-3 introduces parallel host check execution and there are some
> benefits to running host checks there specifically for caching results
> for possible use by the on-demand checks or if you're interested in
> using host performance data for trending for example, but they aren't
> necessary.
> 
> Some documentation to help --
> 
> http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#host
> 
> "check_interval:NOTE: Do NOT enable regularly scheduled checks
> of a host unless you absolutely need to! Host checks are already
> performed on-demand when necessary, so there are few times when
> regularly scheduled checks would be needed. Regularly scheduled host
> checks can negatively impact performance - see the performance tuning
> tips for more information. This directive is used to define the number
> of "time units" between regularly scheduled checks of the host. Unless
> you've changed the interval_length directive from the default value of
> 60, this number will mean minutes. More information on this value can be
> found in the check scheduling documentation."
> 
> 
> htt

[Nagios-users] How to explain active host checks to boss

2008-02-13 Thread mark . potter
Background: Due to management requirements we are using NagiosQL as a 
configuration manager for our Nagios install. NagiosQL defaults to active 
checks enabled for hosts so this is how it's been done until now. We have 
the alerts coming as we want them. We are adding more hosts and services 
weekly. I know that active host checks are not a good thing to have going 
forward as they are unnecessary. Please advise on the best way to explain 
this to the boss who is, at this moment, convinced that if we turn off the 
option in the config file then the host will never be checked even if a 
service is down. I can't find a good place in the documentation to point 
this out and would like to get these turned off in the near future so we 
don't run into issues later on down the road. Any help in pointing me in 
the right direction would be appreciated. Here is a sample host cfg from 
our environment:

define  host {
host_name   plapwnd01
alias   plapwnd01
address 10.1.254.253
parents 10.1.254-Parent
check_command   check-host-alive
max_check_attempts  3
check_interval  15
active_checks_enabled   1
passive_checks_enabled  1
check_period24x7
obsess_over_host0
check_freshness 0
event_handler_enabled   1
flap_detection_enabled  1
process_perf_data   1
retain_status_information   1
retain_nonstatus_information1
contact_groups  sysalertstest
notification_interval   15
notification_period 24x7
notification_optionsd,u
notifications_enabled   1
register1
}

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_snmp issue

2008-02-12 Thread mark . potter
[EMAIL PROTECTED] wrote on 02/12/2008 10:06:16 
AM:

> 
> 
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users-
> > [EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED]
> > Sent: Tuesday, February 12, 2008 9:39 AM
> > To: nagios-users@lists.sourceforge.net
> > Subject: [Nagios-users] check_snmp issue
> > 
> > 
> > Thank you all for the comments on the parenting issue I posted
> yesterday.
> > We have come up with a much easier solution than had been proposed.
> 
> What did you end up with?

host -> gateway -> router

Which is actually not much of a change from how we are already doing it.

> 
> > I am now having an issue with check_snmp. I have used check_snmp to
> > monitor this exact service in the past on other Nagios hosts and am
> using
> > other snmp checks as well from the current host. Here is the problem:
> > 
> > [EMAIL PROTECTED]:/usr/local/nagios/libexec> /check_snmp -H peapgse01
> -o
> > GSA-MIB::crawlRunning.0 -C public -m GSA-MIB -P 2c -d "="
> > SNMP problem - No data received from host
> > CMD: /usr/bin/snmpget -t 1 -r 5 -m GSA-MIB -v 2c -c public
> peapgse01:161
> > GSA-MIB::crawlRunning.0
> > 
> > But if I run the command it shows, exactly:
> > 
> > [EMAIL PROTECTED]:/usr/local/nagios/libexec> /usr/bin/snmpget -t 1 -r 5
> -m
> > GSA-MIB -v 2c -c public peapgse01:161  GSA-MIB::crawlRunning.0
> > GSA-MIB::crawlRunning.0 = INTEGER: Running(1)
> 
> Not a direct answer but to troubleshoot I'd run check_snmp in verbose
> mode (-v). That will cause it to print the output it receives from
> snmpget. You'll at least know then if the problem is with the call to
> snmpget or the parsing of its output. I'd also run 'strace -Ffs512
> ./check_snmp ...' and look for the sendto and recvfrom lines which would
> be the communication with the remote snmp agent just to verify that
> there is data flowing both ways. I'd also drop the -d "=" since that's
> the default anyway. I'd try using the numeric OID instead of
> GSA-MIB::crawlRunning.0 to eliminate MIB loading/parsing issues. And
> finally, I'd use IP or FQDN for the hostname to eliminate resolution
> issues.
> 
> --
> Marc

Sometimes you can't see the forest for the trees. It was an MIB issue. 
Using the numeric OID gets the result I need. Thank you for the advice. 
Taking it step by step solved my issue:
[EMAIL PROTECTED]:/usr/local/nagios/libexec> ./check_snmp -v -H peapgse01 
-o .1.3.6.1.4.1.11129.1.1.1.0 -C public -m GSA-MIB -P 2c
/usr/bin/snmpget -t 1 -r 5 -m GSA-MIB -v 2c -c public peapgse01:161 
1.3.6.1.4.1.11129.1.1.1.0
iso.3.6.1.4.1.11129.1.1.1.0 = INTEGER: 1

SNMP OK - 1 | iso.3.6.1.4.1.11129.1.1.1.0=1

Would be nice to use the MIB but totally unnecessary and I don't feel like 
bothering to troubleshoot it.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] check_snmp issue

2008-02-12 Thread mark . potter
Thank you all for the comments on the parenting issue I posted yesterday. 
We have come up with a much easier solution than had been proposed.

I am now having an issue with check_snmp. I have used check_snmp to 
monitor this exact service in the past on other Nagios hosts and am using 
other snmp checks as well from the current host. Here is the problem:

[EMAIL PROTECTED]:/usr/local/nagios/libexec> ./check_snmp -H peapgse01 -o 
GSA-MIB::crawlRunning.0 -C public -m GSA-MIB -P 2c -d "="
SNMP problem - No data received from host
CMD: /usr/bin/snmpget -t 1 -r 5 -m GSA-MIB -v 2c -c public peapgse01:161 
GSA-MIB::crawlRunning.0

But if I run the command it shows, exactly:

[EMAIL PROTECTED]:/usr/local/nagios/libexec> /usr/bin/snmpget -t 1 -r 5 -m 
GSA-MIB -v 2c -c public peapgse01:161  GSA-MIB::crawlRunning.0
GSA-MIB::crawlRunning.0 = INTEGER: Running(1)

Both are being run as the nagios user, the second is a cut and paste of 
the command shown by check_snmp and reported to have no output. I don't 
think I have mucked up the syntax as the command produced for snmpget 
works just fine. I am probably missing something really easy here and if 
someone could point it out I would really appreciate it.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Parenting vs Dependencies

2008-02-11 Thread mark . potter
I know that this has been discussed ad nauseam and I have read quite a bit 
of those discussions this morning and over the preceding weekend. I still 
am at a bit of a loss. I have a pretty firm understanding of parenting and 
dependencies and how they both work. I think I have a good grasp on what 
we need in our environment but I seem to be missing the point in 
explaining this to my boss. So here is an overview of what I have to 
monitor, what my boss is asking, and what I think we need and maybe 
someone can beat some sense into me.

I have a medium sized network (100-200 hosts).
Hosts are in cabinets on site (no off site monitoring yet)
Cabinets are not setup by subnet (I think this is fairly normal)
1 switch per cabinet
Multiple subnets across the farm

The boss and I agree that the switch in each cabinet needs to be part of 
our monitoring setup and be part of some sort of parenting/dependency. 
This is where we cease seeing eye to eye. For the purpose of making this 
easy lets say we are dealing with three subnets and ten cabinets only. 
Here is what the boss wants (at least as far as I can understand): Setup 
three hosts per cabinet to represent the three subnets, that actual IP 
would be the IP of the switch. Parent the hosts in the cabinet to these 
and parent these to the gateways for the subnet. This is, with the 
parameters above, thirty extra hosts, and IMO a management nightmare.

Since the switch is not part of the route my thought is that we parent in 
the normal manner as I understand it, host parents to router, and so on 
and at that point we would make the hosts have a dependency on the switch 
through which their connection passes. The current setup doesn't take into 
account the switches at all but rather is setup to where each host is 
parented to it's gateway IP and that has been working rather well but if a 
switch dies we need to be able to see that via nagios and not have 20 or 
servers and their services going off all at once.. I cannot see why adding 
tens of special hosts for the boss's parenting solution will help anything 
in the long run.

Thanks in advance to anyone who takes the time to respond to this as I 
probably haven't explained very well to begin with.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Email Alerts

2008-01-07 Thread mark . potter
[EMAIL PROTECTED] wrote on 01/07/2008 03:43:18 
PM:

> Hello,
> 
> Where can I add an additional email address in Nagios?  I have the 
> default one working fine.  Can someone offer to me the location and 
> exact configuration parametes that I should enter.
> 
> Share life as it happens with the new Windows Live. Start sharing!
> 
-
> Check out the new SourceForge.net Marketplace.
> It's the best place to buy or sell services for
> just about anything Open Source.
> 
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace

> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

http://nagios.sourceforge.net/docs/2_0/toc.html

Right around: Theory of Operation is where you would want to start.-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP Question

2007-12-14 Thread mark . potter
Reformatted due to top posting.

 
> [EMAIL PROTECTED] wrote on 12/14/2007 01:15:33 
PM:
> 
> > >I am trying to view the status of port 1 on a Cisco 2970 switch. 
> > >However, Nagios is giving me an UNKNOWN error.  I tried running the 
> > >following from a command prompt: 
> > >./check_snmp -H  -C  -o ifOperStatus.1 -r 1 

> > 
> > >I get the following response: 
> > >SNMP OK - up(1) | RFC1213-MIB::ifOperStatus.1=up(1) 
> > > 
> > > If I am getting this response at the command prompt, why isn't the 
> > > monitor coming up green in Nagios? 
> > > 
> > > Thanks, 
> > > Mike 
> > > 
> >Are you running the command as the nagios user when you run it from 
> >the command line?

> I am running it as root.
> 

You will not be able to troubleshoot when running plugins as root. Root 
has different permissions than the nagios user. Try running it as the 
nagios user. Also, you will have to put the MIB somewhere nagios can use 
it otherwise use the numeric for SNMP.
-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP Question

2007-12-14 Thread mark . potter
[EMAIL PROTECTED] wrote on 12/14/2007 01:15:33 
PM:

> I am trying to view the status of port 1 on a Cisco 2970 switch. 
> However, Nagios is giving me an UNKNOWN error.  I tried running the 
> following from a command prompt:
> ./check_snmp -H  -C  -o ifOperStatus.1 -r 1
> 
> I get the following response:
> SNMP OK - up(1) | RFC1213-MIB::ifOperStatus.1=up(1)
> 
> If I am getting this response at the command prompt, why isn't the 
> monitor coming up green in Nagios?
> 
> Thanks,
> Mike
> 
Are you running the command as the nagios user when you run it from the 
command line?-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_by_ssh vs nrpe discussion (was check_by_ssh)

2007-12-13 Thread mark . potter
"mark redding" <[EMAIL PROTECTED]> wrote on 12/13/2007 10:36:16 AM:

> Hi,
> 
> On 13/12/2007, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> >
> > Currently I know of no risk associated with running NRPE however any 
service
> > that opens a port on a system can become a risk. After much debate and
> > discussion we have decided to use check_by_ssh in our environment for 
this
> > very reason. It may not be a risk at the moment but it could become a 
risk
> > in the future. I am not a hacker so I can't say what could happen with 
the
> > existing code to cause a risk but the possibility remains. Our 
decision was
> > based in simplicity as there is less to install on each system, fewer
> > configurations to maintain, less risk as no new daemons are 
introduced.
> > Everyone else may have different mileage in their own environments.
> 
> I can see that a poorly implemented "check_by_ssh" based system (ie.
> where the remote end fails to have extremely tight control on what can
> be run) can be less secure that using nrpe (where the remote nrpe.cfg
> file specifies who can communicate with it and exactly which commands
> (inc params) it can run) given that there is a password-less key
> exchange taking place and therefore anyone gaining access to the
> account with the key is then able to gain shell level access to the
> remote machine.
> 
> For example, if the local host does not have it's private key
> protected properly, or indeed if the root password on the localhost is
> weak then :-
> 
> $ ssh -i /home/nagios/.ssh/id_rsa -l nagios remote-host
> 
> gets you straight onto "remote-host". Just because it has the letters
> 'ssh' in it does not make it un-exploitable.
> 
> Anyway, It's horses for courses I suppose.
> 

It all depends on your ssh setup. When you have tight controls for ssh in 
place, which you should to begin with, then check_by_ssh isn't an issue. 
The situation which you describe already has one box being rooted by a 
weak password and improper permissions on the private key file. SSH is a 
service already running on most Linux boxes and if the sysadmin has it 
that borked to begin with then it really isn't going to matter which is 
more secure (nrpe or check_by_ssh) because the overall system is at risk 
from the ground up. 

Nothing is un-exploitable but we all try to make sure our systems are 
secure. In our case it made sense to go with check_by_ssh for the reasons 
listed and not to mention we had just overhauled our ssh policies and 
implemented even more access controls. It also allowed us to not have to 
drop configurations onto multiple servers and in that way made managing 
the overall system easier.

That said I have worked for a webhost where we used nrpe and I never had 
any problems. In most cases it's six in one hand and a half dozen in the 
other. It all boils down to the environment into which you walk when 
implementing Nagios. If you walk into one with a well setup ssh security 
policy and a paranoid security team the check_by_ssh is likely how you 
will end up going but in a lot of other cases nrpe is not an issue. I have 
done both and will likely end up doing both again.

Horses for courses indeed!
-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_by_ssh

2007-12-13 Thread mark . potter
> Further, what is the big concern about running nrpe ??? I have a
> number of mission critical systems that all run nrpe. Configured with
> the correct "allowed_hosts" list, with "nrpe_user" & "nrpe_group" set
> to the right type of account, and the "dont_blame_nrpe=0" parameter
> set (in order to prevent arguments being passed to nrpe) I really
> can't see what the risk is (someone like to enlighten me ?).
> 
> -- 
> bright blessings,
> Mark
> 

Currently I know of no risk associated with running NRPE however any 
service that opens a port on a system can become a risk. After much debate 
and discussion we have decided to use check_by_ssh in our environment for 
this very reason. It may not be a risk at the moment but it could become a 
risk in the future. I am not a hacker so I can't say what could happen 
with the existing code to cause a risk but the possibility remains. Our 
decision was based in simplicity as there is less to install on each 
system, fewer configurations to maintain, less risk as no new daemons are 
introduced. Everyone else may have different mileage in their own 
environments.

Sorry for the off topic post.-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] specify normal_check_interval in seconds

2007-12-12 Thread mark . potter
[EMAIL PROTECTED] wrote on 12/12/2007 11:39:26 
AM:

> I am not talking about the interval_length in the main nagios.cfg 
> file, I am talking about the 'normal_check_interval' parameter when 
> you are creating service definitions?
> 
> Alex

The normal_check_interval is x number of units where the units are defined 
by interval_length in the nagios.cfg. If you set interval_length=10 in 
nagios.cfg and a normal_check_interval=36 then the check interval would be 
360 seconds. The answer is that you can in fact do what you have asked but 
more than likely not in the manner in which you wish to do so. You could, 
in theory, set the interval_length to 1 and then all check intervals 
throughout nagios would be in seconds. What you cannot do is use different 
measurements of time for different intervals. You have a definition of the 
units used and all intervals throughout are multiples of that unit.

For the record top posting after having been responded to in a proper 
manner makes it very hard to follow and harder to answer.


> On Dec 12, 2007 5:33 PM, Marc Powell <[EMAIL PROTECTED]> wrote:
> 
> 
> > -Original Message-
> > From: [EMAIL PROTECTED] [mailto:nagios-users- 

> > [EMAIL PROTECTED] On Behalf Of Alex Dehaini
> > Sent: Wednesday, December 12, 2007 11:20 AM
> > To: nagios-users@lists.sourceforge.net
> > Subject: [Nagios-users] specify normal_check_interval in seconds
> >
> > Hi guys,
> >
> > Can I specify 'normal_check_interval' in seconds?

> In nagios.cfg --
> 
> # INTERVAL LENGTH
> # This is the seconds per unit interval as used in the
> # host/contact/service configuration files.  Setting this to 60 means
> # that each interval is one minute long (60 seconds).  Other settings 
> # have not been tested much, so your mileage is likely to vary...
> 
> interval_length=60
> 
> 
> Be _very_ aware that changing this setting affects _all_ interval
> specifications.
> 
> --
> Marc
> 
> 
- 
> SF.Net email is sponsored by:
> Check out the new SourceForge.net Marketplace.
> It's the best place to buy or sell services
> for just about anything Open Source.
> 
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace

> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net 
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null
> 
> 
> 
> -- 
> Alex Dehaini
> Developer
> Site - www.alexdehaini.com 
> Email - [EMAIL PROTECTED] 
> 
-
> SF.Net email is sponsored by:
> Check out the new SourceForge.net Marketplace.
> It's the best place to buy or sell services
> for just about anything Open Source.
> 
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace

> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services
for just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] host and service commands

2007-12-12 Thread mark . potter
Too much top posting to respond easily



> So are you saying the service commands run before the host commands?

The host command doesn't "run" on a schedule as such. Once state is 
established the host check will not run unless a service on the host fails 
in the 2.x series. This is my understanding and it could be wrong so feel 
free to correct me. Therefore the service command runs before the host 
command in essence but it is a bad description as the service checks run 
without regard for the host check until a service check fails. There is 
more in depth explanation that can be done but the short answer is that 
the question assumes a schedule of events that is not entirely accurate.

-
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Escalations Question

2007-12-04 Thread mark . potter
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
> 
> [EMAIL PROTECTED] wrote:
> 
> > I have looked through the docs and the list and this seems right but I
> > would like some confirmation before I implement service and host
> > escalations. The situation is this: I have been tasked with getting a
> > single email to helpdesk (they do email to ticket) per Nagios alert 
and
> > no more. They don't want recovery or further notifications. This is 
the
> > first one I have written for a group of webservers:
> > 
> > define  serviceescalation {
> > host_name 
> > plwsweb01,plwsweb02,plwsweb03,plwsweb05,plwsweb06
> > service_description Apache - Maximum Connections
> > Check,HTTP - Port 80 Check - Production Systems,Ping - Simple Ping 
Check
> > - Production
> > contact_groups  helpdesk,sysalertsinfra
> > first_notification  1
> > last_notification   1
> > notification_interval   30
> > }
> > 
> > According to my understanding, and I could be wrong, this will send 
the
> > first notification to groups (helpdesk and sysalertsinfra) and the 
rest,
> > if I define no further escalations, will go the default contact for 
that
> > group.
> > 
> 
> - From what you have specified here, the first and last notification 
will
> go on this escalation to the contact_group you have listed. Since you
> don't have the escalation options (w,u,c,r) listed, it will send the
> alerts based off of whatever you have specified in the service
> configuration definition.
> 
> You would need to make sure you have this listed if you only want your
> escalation to send the critical.
> 
> escalation_options   c
> 
> Although, if you only want one page to go out on critical, then I
> wouldn't mess with the escalation at all. I would specify the service to
> only notifiy you once, and only once on a critical back in your service
> definition.
> 
> 
> Generally, you would use escalations like I explain here:
> 
> I use RT (Request Tracker) to receive alerts from Nagios. The first
> notification defined through the service definition goes to my alarms
> group. This sends notification to RT, my pager, and my bosses pager.
> 
> I then escalate the second notification, such as listed below, to only
> go to my pager and my bosses pager every 60 minutes forever until one of
> us responds.
> 
> > 1. Are the multiple services/hosts allowed here?
> 
> Yes. You can use hostgroupescalations:
> 
> define hostescalation{
> hostgroup_name  corp-monitors
> first_notification  2
> last_notification   0
> notification_interval   60
> contact_groups  itproblems
> }
> 
> > 2. Is there a better way to do this?
> 
> That depends on what you're trying or wanting to do. Hostgroup
> escalations just saves you typing time if you want to escalate based on
> hostgroups.
> 
> Services you won't be able to do in multiples, so you'll have to setup a
> service escalation for each service on each host.
> 
> Hope this helps you a bit.
> 
Max,

That works pretty well actually. I will add everyone that needs to get the 
first notification to the proper group and then escalate as you have 
shown. The logic you provided makes more sense than trying to capture to 
the first and only the first notification. Thanks so much for your help.-
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Escalations Question

2007-12-04 Thread mark . potter
I have looked through the docs and the list and this seems right but I 
would like some confirmation before I implement service and host 
escalations. The situation is this: I have been tasked with getting a 
single email to helpdesk (they do email to ticket) per Nagios alert and no 
more. They don't want recovery or further notifications. This is the first 
one I have written for a group of webservers:

define  serviceescalation {
host_name plwsweb01,plwsweb02,plwsweb03,plwsweb05,plwsweb06
service_description Apache - Maximum Connections 
Check,HTTP - Port 80 Check - Production Systems,Ping - Simple Ping Check - 
Production
contact_groups  helpdesk,sysalertsinfra
first_notification  1
last_notification   1
notification_interval   30
}

According to my understanding, and I could be wrong, this will send the 
first notification to groups (helpdesk and sysalertsinfra) and the rest, 
if I define no further escalations, will go the default contact for that 
group.

The other questions I have are:

1. Are the multiple services/hosts allowed here?
2. Is there a better way to do this?

Thank you all, in advance, for your time.


Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Dealing with nightly high load

2007-11-29 Thread mark . potter
Cell: 281-734-6965


[EMAIL PROTECTED] wrote on 11/29/2007 02:30:20 
AM:

> All,
> 
> I'm trying to deal with a bunch of servers that generate some high load
> during the night. That is, between 00:00 and 06:00 a series of job is
> run that will generate load from 10 to 15. Outside this windows, load
> should be no more than 3.
> 
> I have read all the documentation, but I cant find a way to implement
> this in Nagios. I have found 2 options:
> - defining 2 services (load_daily and load_nightly) with different
> tresholds. This will make 2 services show up in nagios that are
> essential the same
> - using escalations to supress messages during the night. This will
> however still show errors in the webinterface and spoil you reports with
> fake errors
> 
> How can I tell Nagios that during the day the load should be max 3
> (warning) and max 5 (critical) but during the night, it's ok if they are
> 8 (warning) and 10 (critical)? I'm sure other people have ran into this
> same problem and found a solution for it? Please share!
> 
> I'm running Nagios 2.10
> 
> Thanx for the help.
> 
> Met vriendelijke groet/With kind regards, 
> 
> Jasper Filon
> 
> CAIWAY Business
> 


Jasper,

I have sort of the same issue and you have already stated the most 
expedient, and indeed only, solution that I found. Two services with 
different time periods for checking. It's doesn't look as pretty in the 
web interface but it works like a charm. The other method would be to 
write a wrapper that performs the check and alerts on the value you set 
based on time. This wouldn't be hard to do to be honest but it seems like 
re-inventing the wheel to some degree. The other question I have is if the 
servers are capable of withstanding a load that high why not just have 
your high threshold set the same all time? My situation isn't load based 
but with the standard Nagios plugin you can alert on any of the three 
points that uptime returns (here I make the assumption you are monitoring 
*nix boxen). If the situation calls for being able to be alerted to spikes 
during normal operating hours then you will either need two checks and 
time period for each or a wrapper that checks the time and returns an 
alert based on the time and load.

Best Regards,

Mark L. Potter
-
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Line breaks in notifications ignored

2007-11-21 Thread mark . potter
"Tal Cohen" <[EMAIL PROTECTED]> wrote on 11/21/2007 08:35:51 AM:

> Looks like the back slash is getting escaped out. Try double 
> backslashes: \\n instead of \n.
> 
> Tal
> 
> 

Double backslashes produce the same results unfortunately. It doesn't make 
much sense to me that they would but they do. Thanks, though, for the 
response!


> 
> 
> I am pretty sure I have seen the answer to this but I have searched 
> the archives, google, and my own archives (since I joined the list) 
> and can't seem to find where I saw this mentioned before. I have 
> notification commands such as: 
> 
> define command{ 
> command_namehost-notify-by-email 
> command_line/usr/bin/printf "%b" "** Nagios **\n
> \nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: 
> $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/
> Time: $LONGDATETIME$\n" | /usr/bin/mailx -s "Host $HOSTSTATE$ alert 
> for $HOSTNAME$!" $CONTACTEMAIL$ 
> } 
> 
> When I run: /usr/bin/printf "%b" "** Nagios **\n\nNotification Type"
> from the command line it appears correctly, with the \n inserting a 
> line break, however when Nagios sends an alert I get the alert with 
> no line breaks and an"nn" or "n" (depending on the break) where it 
> should have been. I am stumped at the moment. I have been over the 
> docs but only found the macros listing. I have a sneaking suspicion 
> that it may have something to do with 2.9/2.10 upgrades but that 
> didn't make much sense to me. I have tried escaping the breaks but 
> the same thing happens as if there weren't any escapes there at all. 
> 
> Nagios 2.10 
> SLES: 9 patch level 3
> 
> Any help at all, even just the right google phrase for which to 
> search, would really be appreciated! 
> 
> Best Regards,
> 
> Mark L. Potter
> Systems Engineer
> Academy Sports & Outdoors
> 1800 N. Mason Rd
> Katy, Texas 77449
> Office: 281-646-5857
> Cell: 281-734-6965-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Line breaks in notifications ignored

2007-11-21 Thread mark . potter
I am pretty sure I have seen the answer to this but I have searched the 
archives, google, and my own archives (since I joined the list) and can't 
seem to find where I saw this mentioned before. I have notification 
commands such as:

define command{
command_namehost-notify-by-email
command_line/usr/bin/printf "%b" "** Nagios **\n\nNotification 
Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: 
$HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | 
/usr/bin/mailx -s "Host $HOSTSTATE$ alert for $HOSTNAME$!" $CONTACTEMAIL$
}

When I run: /usr/bin/printf "%b" "** Nagios **\n\nNotification Type" from 
the command line it appears correctly, with the \n inserting a line break, 
however when Nagios sends an alert I get the alert with no line breaks and 
an"nn" or "n" (depending on the break) where it should have been. I am 
stumped at the moment. I have been over the docs but only found the macros 
listing. I have a sneaking suspicion that it may have something to do with 
2.9/2.10 upgrades but that didn't make much sense to me. I have tried 
escaping the breaks but the same thing happens as if there weren't any 
escapes there at all.

Nagios 2.10
SLES: 9 patch level 3

Any help at all, even just the right google phrase for which to search, 
would really be appreciated!

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Non-support/Interface question

2007-11-19 Thread mark . potter
I have been tasked with creating an easy to view frontend for our 
helpdesk. I was using NagVis for their frontend but their latest request 
seems both a little basic and sort of complex for NagVis. I am hoping 
someone will be able to recommend a method of doing this that is easy to 
maintain. What they, essentially, want is a table with three columns. The 
first column will be the name of the hostgroup, the second would be the 
applications for the hostgroup (A single indicator for all apps that goes 
red if a single app fails), and the third would be the actual hosts. The 
second two columns are simply a red/yellow/green indicator with some 
fourth option for acknowledged problems while the first will be text large 
enough to read on their wall mount plasma screens. I could do this in 
NagVis with a background image for the columns/rows but it seems it would 
be much simpler and easier to maintain as php, html, perl etc. I could 
code such a page but do not have the in depth knowledge of Nagios to hook 
in and get the status information needed for the page. Does anyone know of 
a tool that will accomplish this without much trouble or an easy guide to 
developing such a thing? Thanks to all in advance for an replies.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_by_ssh failing on commands that work via remote ssh as the nagios user

2007-11-19 Thread mark . potter
> 
> Nagios: 2.10 
> Plugins: 1.4.9 
> OS: SLES 9.3 
> 
> I am having some trouble figuring out how to make check_by_ssh work 
> properly. I think I have done all of the usual troubleshooting and I
> have googled the issue to no avail: 
> 
> Command as nagios user on the remote server: 
> 
> /usr/local/nagios/libexec/check_load -w 7,7,7 -c 10,10,10 
> OK - load average: 0.01, 0.04, 0.03|load1=0.010;7.000;10.000;0; 
> load5=0.040;7.000;10.000;0; load15=0.030;7.000;10.000;0; 
> 
> Command as nagios user via ssh on Nagios server: 
> 
> ssh ulwswas01 "/usr/local/nagios/libexec/check_load -w 7,7,7 -c 
10,10,10" 
>  
> OK - load average: 0.05, 0.07, 0.04|load1=0.050;7.000;10.000;0; 
> load5=0.070;7.000;10.000;0; load15=0.040;7.000;10.000;0; 
> 
> check_by_ssh attempt as nagios user on Nagios server: 
> ./check_by_ssh -H ulwswas01 -C "/usr/local/nagios/libexec/check_load
> -w 7,7,7 -c 10,10,10" 
> Remote command execution failed: 
> 
> I get nothing in the log files concerning this and it makes little 
> sense. I have tried everything I need to have working, to the best 
> of my knowledge, as the nagios use and it works. Thanks in advance 
> for any help! 
> 

Please disregard the above request. Further research showed the -S option 
was all that was needed. Would be nice if this was documented to show it 
is needed on all systems with banners as it is currently a little harder 
to find. If it were mentioned in the -h, the banner part, I would not have 
sent the request for help initially.  I now return you to your regularly 
scheduled madness...-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] check_by_ssh failing on commands that work via remote ssh as the nagios user

2007-11-19 Thread mark . potter
Nagios: 2.10
Plugins: 1.4.9
OS: SLES 9.3

I am having some trouble figuring out how to make check_by_ssh work 
properly. I think I have done all of the usual troubleshooting and I have 
googled the issue to no avail:

Command as nagios user on the remote server:

/usr/local/nagios/libexec/check_load -w 7,7,7 -c 10,10,10
OK - load average: 0.01, 0.04, 0.03|load1=0.010;7.000;10.000;0; 
load5=0.040;7.000;10.000;0; load15=0.030;7.000;10.000;0;

Command as nagios user via ssh on Nagios server:

ssh ulwswas01 "/usr/local/nagios/libexec/check_load -w 7,7,7 -c 10,10,10"

OK - load average: 0.05, 0.07, 0.04|load1=0.050;7.000;10.000;0; 
load5=0.070;7.000;10.000;0; load15=0.040;7.000;10.000;0;

check_by_ssh attempt as nagios user on Nagios server:
/check_by_ssh -H ulwswas01 -C "/usr/local/nagios/libexec/check_load -w 
7,7,7 -c 10,10,10"
Remote command execution failed: 

I get nothing in the log files concerning this and it makes little sense. 
I have tried everything I need to have working, to the best of my 
knowledge, as the nagios use and it works. Thanks in advance for any help!

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Slow Nagios reloads with NDOUtils

2007-11-16 Thread mark . potter
I really wish I could be more descriptive in the title but I am afraid it 
would make a very long title at best and be confusing at worst. For 
starters I am using the following:

SLES 9.3
Nagios: 2.10
Nagios Plugins: 1.4.9
NDOUtils: 14b6
NagiosQL: 2.00
NagVis: 1.1

My first problem, and I am not sure it is actually a problem, is that when 
I do a reload of nagios (/etc/init.d/nagios reload) it takes, what seems 
to me to be, a long time. It is usually around 90-120 seconds for Nagios 
to start allowing use of the web interface once the reload is initiated. A 
check of the files reveals no errors (save one warning for a host with no 
services) and the nagios process shows in a ps awux list. However the web 
interface shows the "Whoops! Error: Could not read host and service status 
information!" during the 90-120 second delay I mentioned earlier. Here is 
a tail of the logfile for the shutdown and restart:

Nov 16 15:16:19 ulapnag01 nagios: ndomod: Shutdown complete.
Nov 16 15:16:19 ulapnag01 nagios: Event broker module 
'/usr/local/nagios/bin/ndomod.o' deinitialized successfully.
Nov 16 15:16:19 ulapnag01 nagios: Nagios 2.10 starting... (PID=5582)
Nov 16 15:16:19 ulapnag01 nagios: LOG VERSION: 2.0
Nov 16 15:16:19 ulapnag01 nagios: ndomod: NDOMOD 1.4b6 (09-27-2007) 
Copyright (c) 2005-2007 Ethan Galstad ([EMAIL PROTECTED])
Nov 16 15:16:19 ulapnag01 nagios: ndomod: Successfully connected to data 
sink.  0 queued items to flush.
Nov 16 15:16:19 ulapnag01 nagios: Event broker module 
'/usr/local/nagios/bin/ndomod.o' initialized successfully.
Nov 16 15:16:19 ulapnag01 ndo2db: Successfully connected to MySQL database
Nov 16 15:16:19 ulapnag01 ndo2db: Successfully disconnected from MySQL 
database
Nov 16 15:16:19 ulapnag01 nagios: Warning: Host 'qs105f7pm' has no 
services associated with it!

We are monitoring:

# Active Host / Service Checks:272 / 329 

This doesn't seem like enough to bog down a Dell PowerEdge 1855 with 2GB 
RAM onboard. Any and all help would be appreciated. I have been banging my 
head against the wall on this one. I have tried searching cannot seem to 
find these symptoms anywhere. I am being pressured to put this system into 
production but would prefer to have it restart as quickly as the current 
production system (never have had any lag reloading) so that I don't have 
to schedule daily restarts to test changes as they are being requested 
fast and furious. I apologize if I have over explained. Thanks in advance 
for any help!



Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP and Nagio newbie question.

2007-10-26 Thread mark . potter
"Matt Lozier" <[EMAIL PROTECTED]> wrote on 10/26/2007 02:10:20 PM:

> Hi Mark,
> 
> Thank you for taking the time to respond to my question ? much 
appreciated! 
> 
> One more question branching off of your input: to get *every* 
> instance of ?hrSWInstalledName.X? would I need to write a shell 
> script to serve as a wrapper program?  Or is there a way to get 
> check_snmp to do this for me?
> 
> Many thanks,
> 
> ---


What exactly are you wanting to monitor? Are you looking to make sure 
certain things are installed or trying to make sure that nothing outside 
of certain things get installed? I think that more information is needed 
in order to discern what is best for you.-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SNMP and Nagio newbie question.

2007-10-26 Thread mark . potter
> Hello,
> 
> I?ve got Nagios installed and configured managing several Linux and 
> Windows XP boxes.  On each of the Windows machines I have the SNMP 
> utilities installed, so that if I execute:
> 
> #> snmpwalk ?v 2c ?c   
> 
> I retrieve the contents of the MIB.  The print out lists lots of 
> useful information, among which is a listing of each program that is
> installed on the Windows machine. 
> 
> My question: Is there an easy way for Nagios to retrieve this 
> information, or do I need to write a shell script that executes the 
> SNMPwalk, and parses out the needed info?
> 
> Any and all input is appreciated!
> 
> Thank you,
> 
> ---
> Matt Lozier
> IT Analyst
> 972.644.2581, ext. 248
> 972.661.2701  fax
> 
> [image removed] 
> 
> The information contained in this message or any attached document 
> is confidential and intended only for the individual(s) or entity to
> which it is addressed.   The information should be considered 
> privileged and confidential.  If you are not the intended recipient,
> you are hereby notified that any unauthorized use of the information
> contained in or transmitted with the communication, or 
> dissemination, distribution, or copying of this communication is 
> strictly prohibited by law.  If you have received this communication
> in error, please inform the sender by immediately returning this 
> communication to the sender and then deleting the original message 
> and any copy of it in your possession.
> 
-
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

check_snmp -h in the libexec dir will give you the information that you 
seek. You will need to define a command but check_snmp should do what you 
need it to do.-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notification bombardment

2007-10-24 Thread mark . potter
[EMAIL PROTECTED] wrote on 10/24/2007 12:10:48 
PM:

> Hey list, 
> 
> I have an issue that just came up. Notifications are being sent
> CONSTANTLY. By this I mean, I have over 400 in my inbox from the past 15
> minutes of hosts that have gone done and recovered, gone down and
> recovered. 
> 
> The web interface through all of this has showed the hosts as up.
> however, when I go to shut off notifications until I can find the source
> of the problem, I get the wonderful "hey dork I can't read the
> nagios.cmd file. How about making sure it is actually running." error
> (In Lamens terms since I can't remember the exact text of the error.)
> 
> Now, being somewhat experienced with Nagios and from reading thousands
> of posts that I get from the list over the months, I checked to see if
> nagios truly was running. 
> 
> Low and behold it was, like 4 times. Interesting, as this morning
> everything was peachy. So I killed off the stray processes, and brought
> everything back up and did a 
> 
> ps -ef | grep nagios and as expected everything was fine and dandy 
again. 
> 
> Unfortunately the problem still exists. Right now I have the nagios box
> turned completely off, cause I can't get the notifications to shutoff in
> it's current state of disarray. 
> 
> Ubuntu 7.10 Gutsy Gibbon (which I have a feeling is the culprit), Nagios
> 3.0b1 and 1.4.7 plugins. 
> 
> Sorry for the long, drawn out email...
> 
> Stephen Valdinger
> [attachment "svalding.vcf" deleted by Mark Potter/IS/Academy] 
> 
-
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

I seem to remember there being notification issues in some of the 3.0bX 
versions. If this is for production it is recommended that you use the 2.X 
version (currently 2.10). The cmd file not found coult be a permissions 
issue. But as far as 3.0bX goes I wouldn't use it in production 
environment and for testing I would use the latest 3.0b5 and see if that 
changes any of the issues you might be having.-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Web Access : Can't Login

2007-10-24 Thread mark . potter
Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965


[EMAIL PROTECTED] wrote on 10/24/2007 11:24:07 
AM:

> ALL --
> 
> Newbie here :
> 
> Running Nagios 3.0B5 on Fedora Core 6 : 
> 
> Everything has been working fine, even able to get a 2003 server 
monitored.
> 
> However, now, whenever I go to access the content of the nagios site (
> http://localhost/nagios or remotely on my workstation), I get a 
> username/password prompt and none of the users including root or 
> nagios can get into the site : 
> Authorization Required
> This server could not verify that you are authorized to access the 
> document requested. Either you supplied the wrong credentials (e.g.,
> bad password), or your browser doesn't understand how to supply the 
> credentials required.
> 
> Apache/2.2.6 (Fedora) Server at 192.168.0.129 Port 80
> The linux firewall is 'off' and 'selinux' is disabled.
> 
> What happened ?  What'd I do ?  What should I do ?
> 
> Any advice would be greatly appreciated ; I don't want to 
> necessarily have to start from scratch.
> 
> Best,
> 
> V.
>  __
> Do You Yahoo!?
> Tired of spam? Yahoo! Mail has the best spam protection around 
> http://mail.yahoo.com 
> 
-
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

It sounds like you have not properly setup your htaccess and htpasswd 
authentication. This can be found in the documentation here:

http://nagios.sourceforge.net/docs/2_0/installweb.html

and here:

http://nagios.sourceforge.net/docs/2_0/cgiauth.html

Nagios does not come with a user already setup. You will have to follow 
those docs to setup authenticated users.
-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios Plugins 2.10 build problem

2007-10-24 Thread mark . potter
[EMAIL PROTECTED] wrote on 10/24/2007 10:16:49 
AM:

> Howdy, if it is a 64bit system, you might need both 32bit and 64bit 
> versions of openssl and openssl-devel installed.
> 
> I believe I had to do that on my SLES9 PPC64 systems.
> 
> 
> From: [EMAIL PROTECTED] [mailto:nagios-
> [EMAIL PROTECTED] On Behalf Of 
[EMAIL PROTECTED]
> Sent: Wednesday, October 24, 2007 08:44
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] Nagios Plugins 2.10 build problem
> 
> 
> I am having an issue with an up to date SLES 9.3 server and the 
> latest release of Nagios Plugins.  I have the proper devel packages 
> installed as shown below but the plugins still will not build: 
> 
> 
> if gcc -DLOCALEDIR=\"/usr/local/nagios/share/locale\" -
> DHAVE_CONFIG_H -I. -I. -I.. -I.. -I../lib -I../gl -I../intl   -I/
> usr/local/ssl/include  -I/usr/local/ssl/include  -g -O2 -MT 
> check_http.o -MD -MP -MF ".deps/check_http.Tpo" -c -o check_http.o 
> check_http.c; \ 
> then mv -f ".deps/check_http.Tpo" ".deps/check_http.Po"; else rm -f 
> ".deps/check_http.Tpo"; exit 1; fi 
> /bin/sh ../libtool --tag=CC --mode=link gcc  -g -O2  -L. -L/usr/
> local/ssl/lib -o check_http  check_http.o sslutils.o netutils.o 
> utils.o ../lib/libnagiosplug.a ../gl/libgnu.a  -lnsl -lresolv -lssl 
-lcrypto 
> gcc -g -O2 -o check_http check_http.o sslutils.o netutils.o utils.o 
> -L/root/scratch/nagios-plugins-1.4.10/plugins -L/usr/local/ssl/lib 
> ../lib/libnagiosplug.a ../gl/libgnu.a -lnsl -lresolv -lssl -lcrypto 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x247): In 
> function `dlfcn_bind_func': 
> : undefined reference to `dlsym' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x293): In 
> function `dlfcn_bind_func': 
> : undefined reference to `dlerror' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x323): In 
> function `dlfcn_bind_var': 
> : undefined reference to `dlsym' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x391): In 
> function `dlfcn_bind_var': 
> : undefined reference to `dlerror' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x3eb): In 
> function `dlfcn_unload': 
> : undefined reference to `dlclose' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x485): In 
> function `dlfcn_load': 
> : undefined reference to `dlopen' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x4e5): In 
> function `dlfcn_load': 
> : undefined reference to `dlclose' 
> /usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x50e): In 
> function `dlfcn_load': 
> : undefined reference to `dlerror' 
> 
> ~/scratch/nagios-plugins-1.4.10 # rpm -qa | grep ssl 
> openssl-0.9.7d-15.32 
> openssl-devel-0.9.7d-15.32 
> 
> I cannot see a requirement in ./configure for a newer version. I 
> tried to use gnutls and met with the same sort of issue. I feel it 
> is something obvious I am missing. Any help would be appreciated. 
> 
> Best Regards,
> 
> Mark L. Potter
> Systems Engineer
> Academy Sports & Outdoors
> 
-
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null


Thank you for the response. It is, however not a 64 bit system.

# cat /etc/SuSE-release
SUSE LINUX Enterprise Server 9 (i586)
VERSION = 9
PATCHLEVEL = 3

However I have realized that I should be posting to the plugins list and 
not here. My apologies to everyone. I am off to post to the correct list.-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios Plugins 2.10 build problem

2007-10-24 Thread mark . potter
I am having an issue with an up to date SLES 9.3 server and the latest 
release of Nagios Plugins.  I have the proper devel packages installed as 
shown below but the plugins still will not build:


if gcc -DLOCALEDIR=\"/usr/local/nagios/share/locale\" -DHAVE_CONFIG_H -I. 
-I. -I.. -I.. -I../lib -I../gl -I../intl   -I/usr/local/ssl/include 
-I/usr/local/ssl/include  -g -O2 -MT check_http.o -MD -MP -MF 
".deps/check_http.Tpo" -c -o check_http.o check_http.c; \
then mv -f ".deps/check_http.Tpo" ".deps/check_http.Po"; else rm -f 
".deps/check_http.Tpo"; exit 1; fi
/bin/sh ../libtool --tag=CC --mode=link gcc  -g -O2  -L. 
-L/usr/local/ssl/lib -o check_http  check_http.o sslutils.o netutils.o 
utils.o ../lib/libnagiosplug.a ../gl/libgnu.a  -lnsl -lresolv -lssl 
-lcrypto
gcc -g -O2 -o check_http check_http.o sslutils.o netutils.o utils.o 
-L/root/scratch/nagios-plugins-1.4.10/plugins -L/usr/local/ssl/lib 
./lib/libnagiosplug.a ../gl/libgnu.a -lnsl -lresolv -lssl -lcrypto
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x247): In function 
`dlfcn_bind_func':
: undefined reference to `dlsym'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x293): In function 
`dlfcn_bind_func':
: undefined reference to `dlerror'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x323): In function 
`dlfcn_bind_var':
: undefined reference to `dlsym'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x391): In function 
`dlfcn_bind_var':
: undefined reference to `dlerror'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x3eb): In function 
`dlfcn_unload':
: undefined reference to `dlclose'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x485): In function 
`dlfcn_load':
: undefined reference to `dlopen'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x4e5): In function 
`dlfcn_load':
: undefined reference to `dlclose'
/usr/local/ssl/lib/libcrypto.a(dso_dlfcn.o)(.text+0x50e): In function 
`dlfcn_load':
: undefined reference to `dlerror'

~/scratch/nagios-plugins-1.4.10 # rpm -qa | grep ssl
openssl-0.9.7d-15.32
openssl-devel-0.9.7d-15.32

I cannot see a requirement in ./configure for a newer version. I tried to 
use gnutls and met with the same sort of issue. I feel it is something 
obvious I am missing. Any help would be appreciated.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Customizing statusmap.cgi

2007-10-12 Thread mark . potter
I am interested in customizing statusmap.cgi to use hostgroups instead of 
single hosts. We are already using a patch that displays the hosts on a 
google map and management wants a map of the hostgroups at location, shown 
on the map, for the NOC to watch. I won't venture off into my lack of 
support for this idea but suffice it to say I have been tasked with 
accomplishing that. The easiest method, so it seems to me at the moment, 
is to use the googlemap patch combined with patching to use hostgroups 
instead of hosts. I am not much a c hacker so I am at a disadvantage as to 
how to get this working. Any and all help would be appreciated.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors
1800 N. Mason Rd
Katy, Texas 77449
Office: 281-646-5857
Cell: 281-734-6965
-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios slow restarting/reloading possible NDOUtils issue?

2007-10-09 Thread mark . potter
I have gone over the documents again this morning and don't think I missed 
anything. Here is all the relevant information I can think of as well:

OS: SLES 9.3
Nagios: 2.9
Plugins: 1.4.10
NRPE: 2.9
NDOUtils: 1.4.9
Addons: NagiosQL, NagVis

This system is in the testing phase at this time so I can do anything that 
needs to be done.

When I reload nagios (/etc/init.d/nagios reload) it takes upwards of a 
full minute to bring everything back up for viewing. In the meantime a 
message is displayed concerning the status of the nagios daemon:

Error: Could not read host and service status information! 

This goes away in between 90 second and two minutes and everything begins 
to function normally and the system goes on that way until the next time I 
have to reload the configuration.

The suggestion on this page are:

1. Check the Nagios log file for messages relating to startup or status 
data errors. 

[1191938804] Caught SIGHUP, restarting...
[1191938804] ndomod: Shutdown complete.
[1191938804] Event broker module '/usr/local/nagios/bin/ndomod.o' 
deinitialized successfully.
[1191938804] Nagios 2.9 starting... (PID=22953)
[1191938804] LOG VERSION: 2.0
[1191938804] ndomod: NDOMOD 1.4b6 (09-27-2007) Copyright (c) 2005-2007 
Ethan Galstad ([EMAIL PROTECTED])
[1191938804] ndomod: Successfully connected to data sink.  0 queued items 
to flush.
[1191938804] Event broker module '/usr/local/nagios/bin/ndomod.o' 
initialized successfully.
[1191938804] Warning: Contact group 'infrasyalerts' is not used in any 
host/service definitions or host/service esc
alations!
[1191938804] Warning: Contact group 'notesadmins' is not used in any 
host/service definitions or host/service escal
ations!
[1191938804] Warning: Host performance file processing command 
'process-host-perfdata-file' was not found - host pe
rformance data file will not be processed!
[1191938804] Warning: Service performance file processing command 
'process-service-perfdata-file' was not found - s
ervice performance data file will not be processed!

2. Always verify configuration options using the -v command-line option 
before starting or restarting Nagios!

Done and nagios is actually running

3. Make sure you've compiled the main program and the CGIs to use the same 
status data storage options (i.e. text file or database). If the main 
program is storing status data in a text file and the CGIs are trying to 
read status data from a database, you'll have problems.

This is where I suspect the issue might be hiding. I am using NDOUtils and 
did not compile nagios is any special manner. I suspect, if this is 
actually an error, that it is somewhere in the text files vs. DB issue. I 
could be insane and this just could be how long it takes nagios to HUP 
when using NDOUtils. No matter the issue I suspect this list is where I 
will find my answer. Thanks in advance for everyone's time.

Best Regards,

Mark L. Potter
Systems Engineer
Academy Sports & Outdoors

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] FW: Problem with NDOUtils 1.4b6 and MySQL

2007-10-03 Thread mark . potter
I have found the issue. The RPMs from mysql do not contain the proper 
shared objects. Installing MySQL from source solved the issue. I now have 
NDOUtils, NagVis, and various other customizations running flawlessly. 
Thank you all for the suggestions.


[EMAIL PROTECTED] wrote on 10/02/2007 08:27:02 
AM:

> -Original Message-
> From: [EMAIL PROTECTED] On Behalf Of
> [EMAIL PROTECTED]
> Sent: 02 October 2007 14:17
> 
> > I have been trying this for a day now and it is time to ask for some
> help. I have included the full 
> > output of the configure, as well as RPM output and directory listings.
> Any help would be greatly 
> > appreciated. It seems that NDO cannot find what it is looking for in
> regards to mysql yet AFAIK
> > everything is there. Please advise if something is missing, or if I
> should compile mysql from source, or
> > any other fix. Here is all the relevant information I can think of
> this morning: 
> >
> > ~/ndoutils-1.4b6 # ./configure --with-mysql-lib=/usr/lib/mysql 
> 
> Use the following make command:
> 
> ./configure --with-mysql-lib=/usr/lib/mysql
> --with-mysql-inc=/usr/include/mysql --disable-pgsql
> 
> Jonathan Wheeler
> e-Science Centre
> Rutherford Appleton Laboratory
> 
-
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/[attachment 
> "ATT4488962.txt" deleted by Mark Potter/IS/Academy] 
> 
-
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] FW: Problem with NDOUtils 1.4b6 and MySQL

2007-10-02 Thread mark . potter
> 
> Use the following make command:
> 
> ./configure --with-mysql-lib=/usr/lib/mysql
> --with-mysql-inc=/usr/include/mysql --disable-pgsql
> 
> Jonathan Wheeler
> e-Science Centre
> Rutherford Appleton Laboratory
> 
-
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/[attachment 
> "ATT4488962.txt" deleted by Mark Potter/IS/Academy] 
> 
-
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when 
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

Thank you Mr Wheeler and Hugo for your advice. I have snipped the output 
of the suggested command from Mr. Wheeler.

checking for mysql_store_result in -lmysqlclient... no


*** MySQL library could not be located... **

You chose to compile NDBXT with MySQL support, but I was unable to
locate the MySQL library on your system.  If the library is
installed,  use the --with-mysql-lib argument to specify the
location of the MySQL library.

NOTE: After you install the necessary libraries on your system:
  1. Make sure /etc/ld.so.conf has an entry for the directory in
 which the MySQL libraries are installed.
  2. Run 'ldconfig' to update the run-time linker options.
  3. Run 'make devclean' in the NDBXT distribution to clean out
 any old references to your previous compile.
  4. Rerun the configure script.

TIP: Try the following

 ./configure --with-mysql-lib=/usr/lib/mysql




checking mysql/mysql.h usability... yes
checking mysql/mysql.h presence... yes
checking for mysql/mysql.h... yes
configure: creating ./config.status
config.status: creating Makefile
config.status: creating src/Makefile
config.status: creating include/config.h
config.status: include/config.h is unchanged


*** ndoutils 1.4b6 09-27-2007 ***


Type 'make' to compile the NDO utilities.


That is running : ndoutils-1.4b6 # ./configure 
--with-mysql-lib=/usr/lib/mysql --with-mysql-inc=/usr/include/mysql 
--disable-pgsql 

I should have been a little more clear. I spent yesterday going over the 
archives in depth and trying solutions for issues that seemed to be the 
same or similar. Thanks to everyone for such quick responses!


I noticed I had not specified my actual distro, although it could have 
been derived from the RPM output, I am running SLES9 with full updates, 
upgraded MySQL (as shown in my first email), and while it doesn't matter 
just yet Nagios 2.9.-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Problem with NDOUtils 1.4b6 and MySQL

2007-10-02 Thread mark . potter
I have been trying this for a day now and it is time to ask for some help. 
I have included the full output of the configure, as well as RPM output 
and directory listings. Any help would be greatly appreciated. It seems 
that NDO cannot find what it is looking for in regards to mysql yet AFAIK 
everything is there. Please advise if something is missing, or if I should 
compile mysql from source, or any other fix. Here is all the relevant 
information I can think of this morning:

~/ndoutils-1.4b6 # ./configure --with-mysql-lib=/usr/lib/mysql
checking for a BSD-compatible install... /usr/bin/install -c
checking build system type... i686-pc-linux-gnu
checking host system type... i686-pc-linux-gnu
checking for gcc... gcc
checking for C compiler default output file name... a.out
checking whether the C compiler works... yes
checking whether we are cross compiling... no
checking for suffix of executables...
checking for suffix of object files... o
checking whether we are using the GNU C compiler... yes
checking whether gcc accepts -g... yes
checking for gcc option to accept ANSI C... none needed
checking whether make sets $(MAKE)... yes
checking how to run the C preprocessor... gcc -E
checking for egrep... grep -E
checking for ANSI C header files... yes
checking whether time.h and sys/time.h may both be included... yes
checking for sys/wait.h that is POSIX.1 compatible... yes
checking for sys/types.h... yes
checking for sys/stat.h... yes
checking for stdlib.h... yes
checking for string.h... yes
checking for memory.h... yes
checking for strings.h... yes
checking for inttypes.h... yes
checking for stdint.h... yes
checking for unistd.h... yes
checking arpa/inet.h usability... yes
checking arpa/inet.h presence... yes
checking for arpa/inet.h... yes
checking ctype.h usability... yes
checking ctype.h presence... yes
checking for ctype.h... yes
checking dirent.h usability... yes
checking dirent.h presence... yes
checking for dirent.h... yes
checking dlfcn.h usability... yes
checking dlfcn.h presence... yes
checking for dlfcn.h... yes
checking errno.h usability... yes
checking errno.h presence... yes
checking for errno.h... yes
checking fcntl.h usability... yes
checking fcntl.h presence... yes
checking for fcntl.h... yes
checking float.h usability... yes
checking float.h presence... yes
checking for float.h... yes
checking getopt.h usability... yes
checking getopt.h presence... yes
checking for getopt.h... yes
checking grp.h usability... yes
checking grp.h presence... yes
checking for grp.h... yes
checking for inttypes.h... (cached) yes
checking limits.h usability... yes
checking limits.h presence... yes
checking for limits.h... yes
checking ltdl.h usability... yes
checking ltdl.h presence... yes
checking for ltdl.h... yes
checking math.h usability... yes
checking math.h presence... yes
checking for math.h... yes
checking netdb.h usability... yes
checking netdb.h presence... yes
checking for netdb.h... yes
checking netinet/in.h usability... yes
checking netinet/in.h presence... yes
checking for netinet/in.h... yes
checking pthread.h usability... yes
checking pthread.h presence... yes
checking for pthread.h... yes
checking pwd.h usability... yes
checking pwd.h presence... yes
checking for pwd.h... yes
checking regex.h usability... yes
checking regex.h presence... yes
checking for regex.h... yes
checking signal.h usability... yes
checking signal.h presence... yes
checking for signal.h... yes
checking socket.h usability... no
checking socket.h presence... no
checking for socket.h... no
checking for stdint.h... (cached) yes
checking for string.h... (cached) yes
checking for strings.h... (cached) yes
checking sys/ipc.h usability... yes
checking sys/ipc.h presence... yes
checking for sys/ipc.h... yes
checking sys/mman.h usability... yes
checking sys/mman.h presence... yes
checking for sys/mman.h... yes
checking sys/msg.h usability... yes
checking sys/msg.h presence... yes
checking for sys/msg.h... yes
checking sys/poll.h usability... yes
checking sys/poll.h presence... yes
checking for sys/poll.h... yes
checking sys/resource.h usability... yes
checking sys/resource.h presence... yes
checking for sys/resource.h... yes
checking sys/sendfile.h usability... yes
checking sys/sendfile.h presence... yes
checking for sys/sendfile.h... yes
checking sys/socket.h usability... yes
checking sys/socket.h presence... yes
checking for sys/socket.h... yes
checking for sys/stat.h... (cached) yes
checking sys/time.h usability... yes
checking sys/time.h presence... yes
checking for sys/time.h... yes
checking sys/timeb.h usability... yes
checking sys/timeb.h presence... yes
checking for sys/timeb.h... yes
checking for sys/types.h... (cached) yes
checking sys/un.h usability... yes
checking sys/un.h presence... yes
checking for sys/un.h... yes
checking for sys/wait.h... (cached) yes
checking syslog.h usability... yes
checking syslog.h presence... yes
checking for syslog.h... yes
checking tcpd.h usability... no
checking tcpd.h presence... no
checki