Re: [Nagios-users] Plugin for dmesg/kernel messages

2013-07-16 Thread Tom Throckmorton
On Tue, Jul 16, 2013 at 12:13 AM, Alex mysqlstud...@gmail.com wrote:

  Those messages didn't show up in /var/log/messages ?  That's normally
 where I catch that stuff with a log monitor.

 No, they didn't show up in syslog, but perhaps they should have.

 What application are you using to monitor syslog log messages such as
 these?


SEC[1] works really well.  You set it up as an in-line destination on your
central loghost, with actions that send events back to nagios via NSCA.

-tt

[1] http://simple-evcorr.sourceforge.net/




 Thanks,
 Alex


 --
 See everything from the browser to the database with AppDynamics
 Get end-to-end visibility with application monitoring from AppDynamics
 Isolate bottlenecks and diagnose root cause in seconds.
 Start your free trial of AppDynamics Pro today!
 http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

--
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Simulating downtime in nagios

2008-10-08 Thread Tom Throckmorton
On Oct 07 20:30, Kelly Jones wrote:
 On 10/6/08, Tom Throckmorton [EMAIL PROTECTED] wrote:
  On Oct 06 18:57, Kelly Jones wrote:
  Thanks, Tom.
 
  Yes, I'm trying to simulate a host/service outage, not scheduled downtime.
 
  The problem w/ submitting a passive check is that the next ACTIVE check
  will
  invalidate it. Example: you tell nagios that machine foo is down. That's
  soft
  alert 1, not enough to generate any emails. Nagios then active checks foo
  and
  sees that it's up. Of course, you can submit another passive check, but
  you'll ping-pong (flap) between up and down states.
 
  OK, so it sounds like you want to be able to have Nagios temporarily stop
  managing the service check scheduling for this service, long enough for you
  to
  inject some bogus results.  Seems like rescheduling the next active check
  (SCHEDULE_FORCED_SVC_CHECK) would do the right thing as far as pushing the
  next
  scheduled check into the future.  Or maybe you want to disable active checks
  for the service (DISABLE_SVC_CHECK), run your simulation, and then re-enable
  them...?
 
 I may've done it wrong, but SCHEDULE_FORCED_SVC_CHECK means that
 nagios won't send any alerts at all. 

This command manipulates the check scheduling queue for _active_ checks; it has
no direct impact on alerts:

http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=129

...so forcing a service check to some time in the future will delay the active
checks, but you can still submit passive checks (and generate alerts, assuming
the result you're submitting is different than the current state of the
service)

 Basically, messing with nagios' check schedule also screws up its
 notification schedule.
 
 And, since I'm testing notifications, that's not useful.

I must be missing something here.  If, for example, I do the following for a
given service which is currently OK, and for which active checks are normally
accepted:

- delay the next check via SCHEDULE_FORCED_SVC_CHECK now + 1 hour
- submit a passive result with a state of CRITICAL
  (PROCESS_SERVICE_CHECK_RESULT) x $max_check_attempts

As expected, I see:

- an alert for each result I've submitted
- the status changes to SOFT/CRITICAL after the first result, and HARD/CRITICAL
  after $max_check_attempts has been reached
- a notification about the problem

The next scheduled check remains at the time + 1 hour.  If I submit an OK
result, the status changes from CRITICAL to OK, and I get a recovery
notification.  And I can repeat this as often as I like within the time before
the next scheduled active check.

How is this different than what you're trying to achieve?


-tt

 I've written several nagios tests myself, and they're all in one Perl
 program (each subroutine = one test). For these, simulating downtime
 is easy. The script reads downtime from a file and automatically
 exits w/ 1 or 2 during downtime instead of running the subroutine.
 
 I'm tempted to run ALL nagios tests in a wrapper, but that seems so
 ugly for such a simple? problem.
 
 -- 
 We're just a Bunch Of Regular Guys, a collective group that's trying
 to understand and assimilate technology. We feel that resistance to
 new ideas and technology is unwise and ultimately futile.


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Simulating downtime in nagios

2008-10-06 Thread Tom Throckmorton
On Oct 06 18:57, Kelly Jones wrote:
 Thanks, Tom.
 
 Yes, I'm trying to simulate a host/service outage, not scheduled downtime.
 
 The problem w/ submitting a passive check is that the next ACTIVE check will
 invalidate it. Example: you tell nagios that machine foo is down. That's soft
 alert 1, not enough to generate any emails. Nagios then active checks foo and
 sees that it's up. Of course, you can submit another passive check, but
 you'll ping-pong (flap) between up and down states.

OK, so it sounds like you want to be able to have Nagios temporarily stop
managing the service check scheduling for this service, long enough for you to
inject some bogus results.  Seems like rescheduling the next active check
(SCHEDULE_FORCED_SVC_CHECK) would do the right thing as far as pushing the next
scheduled check into the future.  Or maybe you want to disable active checks
for the service (DISABLE_SVC_CHECK), run your simulation, and then re-enable
them...?


-tt


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Simulating downtime in nagios

2008-10-06 Thread Tom Throckmorton
On Oct 06 12:29, Kelly Jones wrote:
 What's the best way to simulate (not schedule) downtime in nagios?
 
 I want to pretend a service is down for a certain amount of time to
 see what alerts nagios sends, etc.

Just to clarify, are you trying to simulate a service outage (as opposed to
simulating a scheduled downtime) so you can test alerts, and perhaps
notifications, in order to validate your configuration?

 I've come up w/ two bad ways to do this:
 
  % Edit the config file to change the test to check_dummy. I want to
  run these fire drills via cron, and editing a file and restarting
  nagios seems a little ugly.
 
  % Submit a passive check saying the service is down, and reschedule
  the next check 4 hours later, so the service is 'down' for 4
  hours. This can be done using the nagios named pipe, so it's easy to
  cron.  Problem: doing things this way suppresses the alerts (when you
  don't test a service, it doesn't send an alert).
 
 Thoughts?

I use something similar to the second method to do ad hoc validation of
alerts/notifications, by submitting passive results via an external command,
though without diddling the service check scheduling.  I'm a little confused by
your last statement though...

If you're only submitting a single passive check and then rescheduling the next
check, of course there will be no alerts (and you'll likely never reach
$max_check_attempts) - is there some reason you can't submit multiple passive
check results?

-tt

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring 64-bit and 32-bit servers - plugin path problem

2008-09-24 Thread Tom Throckmorton
On Sep 23 08:18, Kenneth Holter wrote:
 Hello all.
 
 
 I've set up Nagios to monitor a few 64-bit machines, and this seems to be
 working correctly. For example, my command for checking a remote disk is
 defined like this:
 
 
 define command{
 command_namecheck_remote_disk
 command_line/somepath/check_by_ssh args -C
 /usr/lib64/nagios/plugins/check_disk remote args
 }
 
 This command executes /usr/*lib64*/nagios/plugins/check_disk on the remote
 system. On 32-bit systems, on the other hand, the path to the plugin is
 /usr/*lib*/nagios/plugins/check_disk. It thus looks like I have to
 differentiate between 32-bit and 64-bit commands, and in effect devide the
 configuration itself in a 32-bit section and 64-bit section.
 
 I'm sure there is a neat way of solving this, and could use some advice. I
 guess I could symlink all over the place to get things going, but there are
 probably a better way of doing this.

Kenneth,

Here are two more ideas for you:

1) You could add some test logic into your command def, like so:

$USER1$/check_by_ssh args -C if [ -e /usr/lib64/nagios/plugins/ ] ; then 
/usr/lib64/nagios/plugins/check_disk remote args ; else 
/usr/lib/nagios/plugins/check_disk remote args ; fi

Rather ugly, but I think that would do what you want.

2) configure the remote ssh client to know it's local path to the plugins; you
could do this either in the public key, by using something like
'environment=NAGIOSPATH=/usr/lib64/plugins' after your command and host
restrictions (you _are_ using command and host restrictions in your public
keys, right? ;-), or in ~/.ssh/environment on the remote host.  Note that to
use either of those, you'd need to set PermitUserEnvironment=yes in your
sshd_config - see sshd(8) and sshd_config(5).  Once that's set, you should be
able to run:

$USER1$/check_by_ssh args -C \$NAGIOSPATH\/check_disk remote_args

...which would also get you there, I think.

Couple of disclaimers...
- allowing user environments to be set via sshd can be a security risk
- I've only tried this from the command line; you'll likely need to do some
  interesting escaping to get it to work correctly.

Also,

Gavin Carr wrote:
 We've found it easier to just leave the plugin invocations unqualified and
 add the relevant nagios plugins directory to the $PATH for the remote nagios
 user.  Works nicely here.

+1 on that one, too - simple is good.

Cheers,

-tt

-- 
Tom Throckmorton

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] MySQL Alive

2008-07-16 Thread Tom Throckmorton
On Jul 16 10:40, Jeff Koch wrote:
 
 Hi:
 
 We're new with nagios but learning.

Welcome :)

 We need to be able to check whether mysql is alive on a server - we don't 
 want to log in.

Is this a policy or a preference?  In other words, is there some specific
reason you don't want to / can't log in, is it OK to log in locally but not
remotely, etc?  For reference, check_mysql only requires PROCESS privs to
perform a basic check - check_mysql_query only requires SELECT.

 I've looked at the check_mysql and check_mysql_query 
 plugins and they seem to require username and password.

Indeed, because they are actually performing SQL queries, which as Stephen V.
points out will provide a higher degree of assurance that everything is working
as expected.  
 
 Can anyone advise on how to verify that MySQL is alive?

Aside from the above, here are a couple of alternatives:

- check_tcp -p 3306, which is moderatly reliable, but can produce false 
positives

- check_procs (as Stephen suggested), though that presumes you're using 
nrpe/nsca

- check_log, to watch your error log, esp. (which may again presume nrpe/nsca)

- if the db is the backend for a web application, check the content from some
  url that you _know_ generates a fetch against the db

- some combination of the above + check_cluster :)

You'll find other alternatives on nagiosexchange.org as well, I'm sure.

Hope this helps get you started,

-tt

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Flapping notification

2008-06-12 Thread Tom Throckmorton
On 6/6/08 7:00 AM, Cyril Jaquier wrote:
 Hi nagios users,
 
 We are currently using nagios 3.0.2 on Debian/testing. Everything works 
 fine. However, something is really annoying with flapping detection and 
 notification. Let's take an example.
 
 We have SMS notification using gnokii for CRITICAL and RECOVERED state. 
 One service goes into the CRITICAL state and admins receive an SMS. Now, 
 the host start flapping between CRITICAL and WARNING state so that 
 flapping detection inhibits the notification. Then the service recovers 
 but flapping detection is still enabled. After a while the flapping 
 detection algorithm enables the notification again.
 
 At this moment, I expect nagios to send a notification saying that the 
 service recovered. However, nothing happens and the admins do not know 
 if the service is still CRITICAL or not.
 
 Maybe we are doing something wrong but I think this makes the flapping 
 detection feature a bit unreliable as you will miss important events.
 
 Any suggestions?

Hi,

Nagios is behaving as expected - I'd suggest you re-read the docs on 
notifications, particularly the section on service and host filters:

(from http://nagios.sourceforge.net/docs/3_0/notifications.html)

 The second filter for host or service notification is a check to see
 if the host or service is flapping (if you enabled flap detection).
 If the service or host is currently flapping, no one gets notified.
 Otherwise it gets passed to the next filter. 

You might want to adjust your notification_options to include flapping 
start/stop, that is, in your host definition/template:

  notification_options  d,r,f

...or for your service:

  notification_options  c,r,f


Cheers,

-tt

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] OT: snmpwalk output format query

2008-06-04 Thread Tom Throckmorton
On Jun 04 16:34, Jay R. Ashworth wrote:
 I'm trying to snmpwalk some of my ProCurve switches to find out why
 they won't report CPU-TEMP as I was led to believe they should (a
 couple do), and in the process, I figured I'd see if there was anything
 else I could monitor, Just For Fun.
 
 To make that process more expeditious, I was hoping that there was a
 way to get snmpwalk to give with *both* the numeric OIDs, and the MIB
 translations, since I need to know what the parameters *are* to decide
 if I'm interested in them... but I don't want to burden Nagios
 (remember Nagios?  This is a mailing-list about Nagios?... :-) with
 having to look them up ever 5 minutes.
 
 There doesn't seem to *be* a way to do that, though.
 
 Anyone ever try this, and figure out the magic mojo?

Hi Jay,

I find myself MIB-spelunking occaisionally as well :-)

I don't think snmpwalk can show OID's both numerically and textually in one run
- it's an either/or operation.  snmptranslate, OTOH, can show both in one
command, though only for a single OID at a time.  I find using them together to
be useful - snmpwalk to see what's there, and then snmptranslate the OID's of
interest, and since snmptranslate can give the full description of the OID, I
get the numeric, full OID, _and_ the description, so I have a better chance of
understanding what the object might mean ;-) e.g.:

 [EMAIL PROTECTED] snmptranslate -Ib -On -Td sysDescr
 .1.3.6.1.2.1.1.1
 sysDescr OBJECT-TYPE
   -- FROM   SNMPv2-MIB, RFC1213-MIB
   -- TEXTUAL CONVENTION DisplayString
   SYNTAXOCTET STRING (0..255) 
   DISPLAY-HINT255a
   MAX-ACCESSread-only
   STATUS  current
   DESCRIPTION   A textual description of the entity.  This value should
 include the full name and version identification of
 the system's hardware type, software operating-system,
 and networking software.
 ::= { iso(1) org(3) dod(6) internet(1) mgmt(2) mib-2(1) system(1) 1 }

Cheers,

-tt

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Easier way to make nagios send mail as different user?

2008-05-28 Thread Tom Throckmorton
On 5/28/08 7:50 AM, Drew Weaver wrote:
 The two ways I've come up with to make nagios send notifications as a 
 different user are:

Drew, are you trying to customize the outgoing mail so that it _appears_ 
the mail is coming from a different user, or so that the mail is 
actually being emitted by a user other than the one running your nagios 
daemon?  I'm guessing it's the former...

 Write a wrapper which allows mail to be sent with normal headers

No need to write anything - just use a different/better MUA.  For 
example, I use mutt (in mailx mode) in a notification command, something 
like:

 /usr/bin/printf $MACROS_YOU_WANT_IN_THE_MESSAGE_BODY |/usr/bin/mutt -x -s 
 $MACROS_YOU_WANT_IN_THE_SUBJECT $CONTACTEMAIL

...and then in a .muttrc for the nagios user, add a 'set from=' to match 
the pretty name of the user from whom I want the mail to appear to have 
originated.  The bonus with this arrangement is that it gives me the 
opportunity to tweak additional settings on mail notifications, for 
example to set custom headers, inside .muttrc:

 set hdrs
 my_hdr X-Nagios: alert
 my_hdr X-Priority: 1

...etc.  Depending on what you're actually shooting for, you may also 
need to make some adjustments to the local MTA configuration, but if 
your goal is to change the superficial appearance of the mail, the above 
might be sufficient.

 run nagios as a different user

...how would that solve your problem?

 are there any different options?

Yes :)

Cheers,

-tt

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Problems With proc statament at snmpd.conf

2008-05-16 Thread Tom Throckmorton
On 5/16/08 9:16 AM, Saulo Augusto Silva wrote:

  ...
  proc db2sysc
  proc snmpd
 
 the snmpwalk : 
 snmpwalk -v 2c -c public 127.0.0.1 .1.3.6.1.4.1.2021.2.1
 UCD-SNMP-MIB::prIndex.1 = INTEGER: 1
 UCD-SNMP-MIB::prIndex.2 = INTEGER: 2
 UCD-SNMP-MIB::prNames.1 = STRING: db2sysc
 UCD-SNMP-MIB::prNames.2 = STRING: snmpd
 UCD-SNMP-MIB::prMin.1 = INTEGER: 0
 UCD-SNMP-MIB::prMin.2 = INTEGER: 0
 UCD-SNMP-MIB::prMax.1 = INTEGER: 0
 UCD-SNMP-MIB::prMax.2 = INTEGER: 0
 UCD-SNMP-MIB::prCount.1 = INTEGER: 0
 UCD-SNMP-MIB::prCount.2 = INTEGER: 0
 UCD-SNMP-MIB::prErrorFlag.1 = INTEGER: 1
 UCD-SNMP-MIB::prErrorFlag.2 = INTEGER: 1
 UCD-SNMP-MIB::prErrMessage.1 = STRING: No db2sysc process running.
 UCD-SNMP-MIB::prErrMessage.2 = STRING: No snmpd process running.
 UCD-SNMP-MIB::prErrFix.1 = INTEGER: 0
 UCD-SNMP-MIB::prErrFix.2 = INTEGER: 0
 UCD-SNMP-MIB::prErrFixCmd.1 = STRING: 
 UCD-SNMP-MIB::prErrFixCmd.2 = STRING:
 
 The ps -e 
 
 ps -e |grep db2sysc 
  9812 pts/000:00:00 db2syscr
  9814 pts/000:00:00 db2sysc
  9815 pts/000:00:00 db2syscr
  9816 pts/000:00:00 db2syscr
  9817 pts/000:00:00 db2syscr

Greetings Saulo,

You didn't mention which OS or net-snmp version you're using

Regardless, the default for recent vintages of net-snmp is to find 
process names by looking at the 'Name' line in /proc/$pid/status, so 
you'll need to make sure the names in there match the entry in your 
config.  From what I'm seeing, though, you should at least be getting a 
match on the snmpd process - are you by chance running with SELinux enabled?

-tt

-
This SF.net email is sponsored by: Microsoft 
Defy all challenges. Microsoft(R) Visual Studio 2008. 
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] change management or source control for Nagios configs?

2008-04-15 Thread Tom Throckmorton
On Apr 10 23:05, Max wrote:
 Hi,
 
 SVN or CVS work very well for this; I personally use SVN.  Version
 control admin directories will not interfere with Nagios parsing
 configs from directories as it looks for files that end in .cfg (as
 you pointed out in your post).
 
 Terrific way to keep a hot backup of your configs off site and to
 allow multiple configuration editors to work on files concurrently;
 you could even tie in a nice project management web interface like
 Trac for SVN and have a central place for people to easily see what
 changed when via the web as well as managing requests for changes to
 Nagios (new service / host monitoring requests etc) using the trouble
 ticketing features of a system like Trac ... and then their monitoring
 requests can be tied back to change sets by using the Milestone
 features of Trac.

+1

I'm doing basically what you've described using SVN+Jira.  On the Subversion
side, I use a pre-commit hook that does the preflight test, to keep bad configs
out of the repo (and away from Nagios).  Adding an issue number to the commit
message associates the commit with an issue; Jira shows the commit summary in
the issue detail, with links to the full changes.  On the Nagios side, I run a
script out of cron (or at will) that checks the repo for updates, cleans out
any config files that have been removed, gets the latest rev and reloads.  And
logs everything :)

One thing to keep in mind - this solution works well, but only as long as
you're still hand-editing your configs.  AFAIK, none of the front-ends support
using any kind of SCM tool for the backend, though that would be a welcome 
feature.

Cheers,

-tt


-- 
Tom Throckmorton

-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ANNOUNCE: Blosxom4Nagios

2008-03-28 Thread Tom Throckmorton
On Mar 28 09:27, Aaron M. Segura wrote:
 On Wed, 2008-03-26 at 16:03 -0400, Tom Throckmorton wrote:
  On Mar 26 09:00, Gavin Carr wrote:
   Hi Nagios Users,
   
   I've packaged up a special-purpose instance of the 'blosxom' blogging
   engine as another way of getting nagios notifications via RSS or atom.
   
   Announcement post is here, with mentions of existing solutions, some
   examples of it's slice and dice functionality, and a few screenshots:
   
 http://www.openfusion.net/blosxom/blosxom4nagios
  
  Well done!  I like this one a little better than the others - and fairly
  painless to get it running.
  
  My first question ishow are you getting those spiffy yum/kernel
  checks? :-D
  
  -tt
  
 
 My question is: how do you get that nice nagios-esque red/yellow/green
 display?  After initial install, mine is just black text on white
 background.

I had the same issue at first :)

In my case, i'd moved the paths around a bit, and separated the code from the
state files (code in /usr/lib/nagios/blosxom, state files in
/var/log/nagios/blosxom).  I found that I hadn't caught all of the paths in
blosxom/config/*, which I discovered in my apache error_log.   Once that was
fixed, the pretty colors appeared :-D

hth,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] solaris snmp memory check question - plugin

2008-03-26 Thread Tom Throckmorton
On Mar 25 09:44, August Simonelli wrote:
 hi all,
 
 not sure if there is a plugins list i could send this to so apologies if
 it's misdirected.
 
 i've been using check_snmp_mem.pl from //www.manubulon.com/nagios/ for
 remote checking of memory.

There is a list, fwiw, for users of this and related plugins, but it appears to
be mostly dead...

http://sourceforge.net/mailarchive/forum.php?forum_name=nagios-snmp-users

...so you're probably better off asking here.

 i recently implemented it against solaris 9 and 10 running net-snmp 5.0.9:
 
 bash-3.00# /usr/sfw/sbin/snmpd -v
 
 NET-SNMP version:  5.0.9
 Web:   http://www.net-snmp.org/
 Email: [EMAIL PROTECTED]
 
 unfortunately one of the OIDs doesn't answer:
 
 [EMAIL PROTECTED] libexec]# snmpget -v 1 -c mycommunity 1.2.3.4 
 1.3.6.1.4.1.2021.4.15.0
 Error in packet
 Reason: (noSuchName) There is no such variable name in this MIB.
 Failed object: UCD-SNMP-MIB::memCached.0
 ...
 I've snmpwalked but don't really understand why this oid fails (something
 about solaris not using this concept of cached memory?).

You're on the right track.  Solaris and Linux have different memory management
models - Solaris doesn't have the same concept of cached memory as does Linux,
at least in a way that net-snmp knows about.  See the description for this
object from a more recent release of net-snmp:

(from http://www.net-snmp.org/docs/mibs/ucdavis.html)

| This object will not be implemented on hosts where the
| underlying operating system does not explicitly identify
| memory as specifically reserved for this purpose.

 Does anyone know another oid i could query to get this value? Or perhaps
 share a workaround/solution you may have found?

Short of hacking that plugin a bit, you've got a couple of options - you could
just check the corresponding objects (or just the ones of concern) directly
using check_snmp.  Considering what check_snmp_mem actually checks,  you could
also use check_snmp_storage, from the same author, and only look at Real memory
and Swap, e.g.:

[EMAIL PROTECTED] $plugins/check_snmp_storage.pl -H 1.2.3.4 -C mycommunity -m 
'^Swap|^Real' -w 75 -c 90 -S 0,1 -f

Note that this is using a different mib (host-resources-mib) than
check_snmp_mem (ucd-snmp-mib) and therefore _should_ be available and yield
similar results across platforms.  There are some caveats to using this mib,
but for checking memory/swap usage, it should be fine.

Cheers,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ANNOUNCE: Blosxom4Nagios

2008-03-26 Thread Tom Throckmorton
On Mar 26 09:00, Gavin Carr wrote:
 Hi Nagios Users,
 
 I've packaged up a special-purpose instance of the 'blosxom' blogging
 engine as another way of getting nagios notifications via RSS or atom.
 
 Announcement post is here, with mentions of existing solutions, some
 examples of it's slice and dice functionality, and a few screenshots:
 
   http://www.openfusion.net/blosxom/blosxom4nagios

Well done!  I like this one a little better than the others - and fairly
painless to get it running.

My first question ishow are you getting those spiffy yum/kernel
checks? :-D

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios

2008-03-11 Thread Tom Throckmorton
On Mar 11 18:58, Max wrote:
 Whoops,
 
 Actually, looks like you need to install the Sybase client software
 for Linux on your Nagios host unless the sybase libraries are
 available as a standalone package,

DBD::Sybase can use either the Sybase libraries (which are provided with Sybase
ASE),  or FreeTDS (www.freetds.org).  Personally, I find the FreeTDS package
easier, and much, *much* smaller (596k vs. 200+MB).

I've used both, and they seem to be functionally the same, especially for the
purposes of this plugin.

Cheers,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring Connection attempts

2008-03-05 Thread Tom Throckmorton
On Mar 05 09:34, Matthew Macdonald-Wallace wrote:
 Hi All,
 
 Before I start coding my own plugin to do this, does anyone know of a
 plugin that monitors the number of external connection attempts over a
 given period of time for a given service and sends alerts accordingly?
 
 I've noticed on a number of servers that we maintain recently that
 there are unauthorised attempts to connect via SSH/FTP.  These appear
 in the log files about 2 seconds apart and are obviously automated.  
 
 We've got Logcheck in place which alerts us to this kind of thing
 already, however I like the idea of a nice visual/audible alert (we all
 use the nagios-plugin for firefox here).

Since you already have an investment in Logcheck, you could feed those events
directly to Nagios using NSCA.  There's an example configuration outlined in
this document:

https://www2.sans.org/reading_room/whitepapers/logging/198.php

If you're looking for something more real-time, you might consider dropping
Logcheck in favor of swatch/SEC.  

Cheers,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring Connection attempts

2008-03-05 Thread Tom Throckmorton
On Mar 05 14:22, Michael Schwartzkopff wrote:
 1) Better use the logmatch option in the net-snmp configuration. It is quite 
 undocumented but works like a charm. nagios can read these values with 
 check_snmp.
 
 Syntax:
 logmatch name logile interval regex

That's a good method, also, if you're just looking for counts of a very general
regex.   I've found this approach useful when I need to watch a log that isn't
already getting fed to the central loghost, or I'm just looking for counters
and not content.

One caveat of logmatch is that some values are reset upon log rotation
(logMatchCurrentCounter) or reading (logMatchCounter) - it's a feature :-D

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] verify CIFS/SMB protocol negotiation?

2008-02-25 Thread Tom Throckmorton
On Feb 25 17:51, Steve Huff wrote:
 Hello folks!
 
 I'm monitoring several Samba servers (SuSE OES) that suffer from a  
 debilitating problem: periodically their authentication cache stops  
 working, and as a result none of the services that depend on that  
 authentication cache, including Samba, respond to requests.
 
 I'm currently using the check_disk_smb plugin to verify that a test  
 user can establish a CIFS connection (and also to monitor disk usage,  
 but that's secondary).  Unfortunately, when the abovementioned  
 problem occurs, check_disk_smb fails not CRITICAL, but UNKNOWN; this  
 is because the request times out instead of being rejected, since the  
 server is still listening on the port, but the SMB negotiation never  
 completes.

Hi Steve,

You could modify check_disk_smb so that it outputs CRITICAL instead of UNKNOWN
on a timeout - looks pretty straightforward to do so.  Alternately, you could
configure those services to notify on UNKNOWN.

 I thought I would try to add an additional layer of testing by using  
 check_tcp's ability to send a string and expect a string in return;  
 unfortunately, it looks like SMB doesn't talk using human-readable  
 strings (or at least it looked that way to me).  check_tcp always  
 succeeds, even when check_disk_smb will time out and fail.
 
 Anybody else run into a similar problem?  Any suggestions for another  
 way to implement the extra later of testing, or any other ideas?

Anything in your smb.log when the service dies?  Are the processes zombied?

hth,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] using fping to find missing A records

2008-02-13 Thread Tom Throckmorton
On Feb 13 18:05, Roger wrote:
 I have a specific question on how to do something, and a larger question
 addressing other better ways to do the spirit of what I'm asking.
 
 I'm setting up Nagios, and am trying to make sure that all of the subnets
 have A records, as IP addresses will be changing very, very rapidly, and
 when that day comes, I will not be here (but on another Nagios project).
 
 So, I'd like to create A records for each host.

...so, are you trying to make sure that for every name you have an A record, or
for every A you have a PTR (i.e. a name that corresponds to the numeric)?  It
sounds like the latter, since you're thinking of doing this by looking at
numbers...

 How do I fping -a a range
 of IP addresses, strip out the names of the name that the host replies with,
 and then ping that list of names?  (You don't have to tell me how exactly, a
 general idea of which command line tools will do)
 
 I'm thinking something like: fping -g 192.168.1.0 192.168.1.255 -A | grep
 whatever  file, and then shove that file in another fping statement.

If I understand correctly what you're shooting for, it can be done in one step:

[EMAIL PROTECTED] fping -ang 192.168.1.0/24

...would return

- all hosts that are alive
- by name (if one exists)
- for a generated range (192.168.1.0..192.168.1.255)

If you want to only show hosts for which there is no PTR record, just 'grep -v'
your target domain(s) from the output.

-tt

p.s.  Hopefully, none of those hosts in that range are blocking your pings :)

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ndo2db database query

2008-01-31 Thread Tom Throckmorton
On Jan 31 18:51, Tim Kay wrote:
 Hello list,
 I am trying to query the nagios database entries generated by ndo2db 
 (Nagios 3.0rc1/ndoutils 1.47b) to pull out only those entries in the 
 servicestatus table that are in current_state != 0 and whose 
 contactgroup is testing.

 I have a contact group testing and that contact group is set up in the
 service template for (at the moment) all services. I can see the contact
 group in the SQL contactgroups table and I can see the members of the
 contactgroup in the SQL contactgroup_members table but there are no entries
 in the service_contacts table. Is this a config error on my part?

I don't think so - that table as well as host_contacts is empty for me also, on
2.10/ndoutils 1.4b6 - I'm wondering if those tables just aren't getting updated
as they should...

 I have:
 
 define contactgroup{
 deletia
 }
 
 
 the testcontact is notified when the load on server1 exceeds the warn / 
 critical limits but I have no way of associating the contact with the 
 service just through querying the database
 
Just to be clear - you're just trying to validate a contact for a given service
in a given state, right?  I'm using something similar to show effective contact
groups for hosts; you'll have to do a few JOINS to get what you want.  Have a
look at the sample queries that came w/ ndoutils for some good examples.

-tt


-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] flap_detection_enabled persistence acrossreload/restart?

2008-01-24 Thread Tom Throckmorton
On Jan 23 17:35, Marc Powell wrote:
  Before I dig further, can anyone verify that the state of host or
 service
  flap detection (enabled/disabled) should or shouldn't be persistent across
 a
  reload/restart?  I've combed the docs and archives, and am coming up
 dry.
 
 The retention file should take precedence. 

Thanks, that's what I thought - it appears to for all but the flapping options.
I should add that this applies to the global flap detection option as well.

Would you (or anyone) mind testing this on your own installation:

- find a host/service for which flapping is enabled
- disable flap detection for that host/service; wait until that state is
  reflected in the extended info
- reload the configs
- check the state of flap detection - has it reverted to enabled?

 Not all directives can be overridden by the retention file but
 flap_detection_enabled is one of them
 (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service).

I'm not sure what you mean here - which directives _can't_ be overridden by the
retention file?

 Your configuration snippets look accurate, assuming they're applied as
 you indicate.

They were cut and pasted from live configs, so yeah.

Cheers,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] flap_detection_enabled persistenceacrossreload/restart?

2008-01-24 Thread Tom Throckmorton
On Jan 24 09:42, Marc Powell wrote:
 
 
  -Original Message-
  From: [EMAIL PROTECTED] [mailto:nagios-users-
  [EMAIL PROTECTED] On Behalf Of Tom Throckmorton
  Sent: Thursday, January 24, 2008 8:12 AM
  To: Marc Powell
  Cc: nagios-users@lists.sourceforge.net
  Subject: Re: [Nagios-users] flap_detection_enabled
  persistenceacrossreload/restart?
  
 
 
  Would you (or anyone) mind testing this on your own installation:
  
  - find a host/service for which flapping is enabled
  - disable flap detection for that host/service; wait until that state
 is
reflected in the extended info
  - reload the configs
  - check the state of flap detection - has it reverted to enabled?
 
 I can confirm that it ignores the retention file setting with
 nagios-2.7. I don't have a more recent version up to test with. 

Thanks.  Bummer - I'm still trying to find something that points to this being
intentional.  I'll ask on nagios-devel...

   Not all directives can be overridden by the retention file but
   flap_detection_enabled is one of them
   (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service).
  
  I'm not sure what you mean here - which directives _can't_ be
 overridden
  by the
  retention file?
 
 Any that aren't marked with a *. Click the * or read the Retention Notes
 at the top of the page.

Oh, duh.  *kicks self in head*

http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#retention_notes

   Your configuration snippets look accurate, assuming they're applied
 as
   you indicate.
  
  They were cut and pasted from live configs, so yeah.
 
 I meant that the template really was applied to the hosts/services.

Well, assuming any status data that gets fed to the state files and via the NEB
is accurate, then my configs match my live settings.  I'm verifying by seeing
what ends up in the status_file and state_retention_file, as well as the
NDO-fed database.

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] NDO Troubles

2008-01-23 Thread Tom Throckmorton
On Jan 23 11:00, Giles Coochey wrote:
  
 Have you tried the advice of someone else to give InnoDB loads of
 memory? I would like to try that, but have no idea as to how to do it!
 Perhaps someone can advise?

Have a look at the sample mysql configs that come w/ the mysql-server package
(/usr/share/doc/mysql-server-version, on Red Hat and friends, at least).  You
could use/adapt one of the configs intended for larger environments, depending
on your configuration.  If nothing else, that's a good starting point.

I'd also recommend checking out http://www.mysqlperformanceblog.com/,
particularly the section on performance presentations.  Good stuff.

-tt


-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] flap_detection_enabled persistence across reload/restart?

2008-01-23 Thread Tom Throckmorton
Nagios 2.10, CentOS 5 x86_64 / i386, NDOUtils 1.4b6

Hi all,

I've noticed something odd wrt flapping, and am wondering if I'm overlooking
something simple, or just misunderstanding the way flapping and state retention
are supposed to work.

First, just to verify, my main nagios config has:

 enable_flap_detection=1
 retain_state_information=1
 state_retention_file=/var/log/nagios/retention.dat
 retention_update_interval=1,5,60 or even 0, doesn't matter
 use_retained_program_state=1
 use_retained_scheduling_info=1

In my global host and service templates that all hosts use, I have:

 flap_detection_enabled  1
 retain_status_information   1
 retain_nonstatus_information1

I've also disabled flap detection explicitly for a few hosts, and it all works
as advertised.

However, for any host/service, if I manually disable/enable flap detection via
an external command / cgi, and then reload or restart, the option reverts to
whatever is in the config for that given host/service.  This happens regardless
of whether 1) the option is being set in retention.dat, which it is, 2) the
NDO-fed database thinks flapping for this host/service is disabled, or 3) w/out
the broker_module enabled.  Other state options are being preserved (such as
active_checks_enabled) both at retention_update_interval and on a reload.

So, what I think I'm seeing then is that during a reload, the
flap_detection_enabled option isn't getting read from the retention file at
program start, though I'd expect it to be persistent just as the other options
are.  FWIW, same behavior on earlier Nagios 2.x releases, but 1.x seems to do
the right thing.

Before I dig further, can anyone verify that the state of host or service flap
detection (enabled/disabled) should or shouldn't be persistent across a
reload/restart?  I've combed the docs and archives, and am coming up dry.

Thanks,

-tt


-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!

2008-01-19 Thread Tom Throckmorton
On Jan 18 18:44, Tom Throckmorton wrote:
 ...you have an unnecessary trailing '!' on the check_command, which I think
 will be silently ignored, based on your command_line def), but I can't recall
 whether or not spaces in the ARGn macros are legal (I think not, but the docs
 don't specify and I haven't tested).

Just to follow up, I did test, and they're legal, so you can ignore my previous
post about adjusting the format of your command defs.

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ndoutil question

2008-01-18 Thread Tom Throckmorton
On 01/18/2008 10:26 AM, Alex Dehaini wrote:
 Guys,
 Any other ideas on this. This is the output I get when I try to start NDO
 
 snmp:/# /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg
 Support for the specified database server is either not yet supported, or
 was not found on your system.

Silly question:

What is db_servertype set to in your ndo2db.cfg?  Does it match the name 
of the mysql package?

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!

2008-01-18 Thread Tom Throckmorton
On 01/18/2008 04:24 PM, Bret Goodfellow wrote:
 I have the snmpd running on a client box.  I have nagois monitoring this
 box without any issues.  I decided to implement a new plugin I found,
 check_snmp_storage.pl.  From the nagios server I can issue this
 command to the client box, and I get a valid response, e.g.
  
  ...
 My command is defined as:
  
 define command{
 command_namecheck_snmp_storage
 command_line$USER$/check_snmp_storage.pl -H $HOSTADDRESS$ $ARG1$

$USER$ won't get you much, I don't think.  Maybe you meant $USER1$ ?

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!

2008-01-18 Thread Tom Throckmorton
On 01/18/2008 05:39 PM, Bret Goodfellow wrote:
 Here's what I get:
 ./check_snmp_storage.pl -H somehost -C community -m /tmp -w 95 -c 98 -f
 -v
 Alarm at 15
 SNMP v1 login
 Filter : /tmp
  .
  .
  .
 /tmp: 6%used(221MB/3937MB) (95%) : OK

Reading the check_snmp_storage code again, I see that you'll get that error if
you haven't specified a community.  Looking at your check_command in the
service definition:

 check_commandcheck_snmp_storage!-C public -m /temp -w 95 -c 98 
 -f!

...you have an unnecessary trailing '!' on the check_command, which I think
will be silently ignored, based on your command_line def), but I can't recall
whether or not spaces in the ARGn macros are legal (I think not, but the docs
don't specify and I haven't tested).

The usual form is something like:

In your checkcommand def:

..
command_name  check_foo
command_line  $USERn$/check_foo.pl -H $HOSTADDRESS$ $ARG1$ $ARG2$ $ARG3$
...

In your service def:

...
host_name   somehost
check_command  check_foo!arg1value!arg2value!arg3value
...

Can you try formatting your commands as such?

-tt


-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!

2008-01-18 Thread Tom Throckmorton
On 01/18/2008 05:07 PM, Bret Goodfellow wrote:
 It actually is $USER1$.  I just typed it incorrectly in this email.
 Thanks 

OK.  That error looks vaguely like an SNMPv3 login failure.

What do you see when you run the plugin from the command-line with a 
'-v' at the end?  It should show you the SNMP version:

 ./check_snmp_storage.pl -H somehost -C community -m /foo -w 95 -c 98 -f -v
 Alarm at 15
 SNMP v1 login
 Filter : /foo
 OID : 1.3.6.1.2.1.25.2.3.1.3.3, Desc : Swap Space
 OID : 1.3.6.1.2.1.25.2.3.1.3.8, Desc : /proc/sys/fs/binfmt_misc
 OID : 1.3.6.1.2.1.25.2.3.1.3.2, Desc : Real Memory
 ... etc.

-tt


-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] new plugin

2008-01-17 Thread Tom Throckmorton
On 01/14/2008 11:22 AM, [EMAIL PROTECTED] wrote:
 I've just written a script that checks to see if the ndo2db daemon is
 running and if it is not, deletes the sock file and restarts it. This is
 how it runs in our current setup. It's just a simple shell script, but
 it gets the job done.

Thanks for sharing and posting your script.

You might also consider using check_procs + an eventhandler, which would 
give you the advantage of having some additional flexibility around how 
and when the process is restarted.

Out of curiosity, are you actually seeing ndo2db _die_, or just stop 
working?

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] NDOUtils

2008-01-17 Thread Tom Throckmorton
On 01/17/2008 12:12 PM, Matthias Kloth wrote:
 Tom Throckmorton schrieb:
 | On Jan 11 08:51, [EMAIL PROTECTED] wrote:
 | Has anyone got a plugin or script they use to check the status of the
 | ndo2db daemon? I've been looking on nagiosexchange and through google
 | and am yet to find one. I'm not much of a programmer, so I really can't
 | right one up myself or I would. Any help on this issue would be
 wonderful.
 |
 | I'm using check_mysql_query to check that db updates aren't stale:
 |
 |   check_mysql_query -H naghost -d nagios_db -u readonlyuser -p
 somepass \
 |   -q SELECT NOW() - status_update_time FROM nagios_programstatus -w
 120 -c 300
 |
 | If ndo2db is dead/failed to start, or there are other issues (corrupt
 tables),
 | I believe this will catch it.  You could also do something like watch
 the nagios.log
 | for the dreaded unable to connect to data sink message, or check
 that the
 | ndo2db process is running, though I'm finding that checking the db for
 | freshness is a good indicator.
 |
 | -tt
 |
 |
 There is a plugin that checks for the ndo2db daemon called . You can
 find it on
 http://www.nagiosexchange.org/Check_Plugins.21.0.html?tx_netnagext_pi1[p_view]=1203.

Thanks, Matthias.  I saw svalding's later post with the shell script.

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] NDOUtils

2008-01-11 Thread Tom Throckmorton
On Jan 11 08:51, [EMAIL PROTECTED] wrote:
 Has anyone got a plugin or script they use to check the status of the
 ndo2db daemon? I've been looking on nagiosexchange and through google
 and am yet to find one. I'm not much of a programmer, so I really can't
 right one up myself or I would. Any help on this issue would be wonderful. 

I'm using check_mysql_query to check that db updates aren't stale:

  check_mysql_query -H naghost -d nagios_db -u readonlyuser -p somepass \
  -q SELECT NOW() - status_update_time FROM nagios_programstatus -w 120 -c 300

If ndo2db is dead/failed to start, or there are other issues (corrupt tables),
I believe this will catch it.  You could also do something like watch the 
nagios.log
for the dreaded unable to connect to data sink message, or check that the
ndo2db process is running, though I'm finding that checking the db for
freshness is a good indicator.

-tt


-- 
Tom Throckmorton
OIT - CSI
Duke University

-
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] best solution for configuration changes

2008-01-03 Thread Tom Throckmorton
On 01/03/2008 12:44 PM, Max wrote:
 On the Nagios server side you could have a cron job that updated the
 configs every N minutes or even a sub-version commit trigger that
 would run a nagios -v /path/nagios.cfg check on your configuration
 and either 1) restart Nagios if the configuration is good or 2) alert
 you if someone has broken the configs.

...or, do the pre-flight check using a pre-commit hook, so that the 
broken configs never make it to the nagios server in the first place ;-)

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Slow Nagios reloads with NDOUtils

2007-11-27 Thread Tom Throckmorton
On 11/16/2007 04:23 PM, [EMAIL PROTECTED] wrote:

 My first problem, and I am not sure it is actually a problem, is that when 
 I do a reload of nagios (/etc/init.d/nagios reload) it takes, what seems 
 to me to be, a long time. It is usually around 90-120 seconds for Nagios 
 to start allowing use of the web interface once the reload is initiated.

Mark,

I encountered the same issue, spent a while poking at NDO and found that 
tuning down the table trimming options in ndo2db.cfg helped reduce the 
time of this symptom by a third or more, depending on how short of an 
age I chose.  This also cut the user cpu time (taken up by mysqld) by 
about half, as well as the storage and I/O overhead.  YMMV, of course, 
but I found that at this point I don't need to keep the data in those 
very busy tables longer than an hour or so - it'll be *very* useful of 
course when I have an interface that can use that data, but for now, 
keeping it for only a short period is fine.

IANA db expert, but from what I can tell and as Ton suggests, the table 
indexing doesn't appear to be optimal - I see a high 
handler_read_rnd_next count, which usually indicates unnecessary table 
scans).  As far as this particular symptom goes, though, I think it 
might be the indexing in conjunction with the re-sending of retained 
status data that's the culprit here (see the blurb on 'Do not resend 
retained status to NDO' on 
http://altinity.blogs.com/dotorg/2007/09/nagios-patch-da.html)

 We are monitoring:
 
 # Active Host / Service Checks:272 / 329 
 This doesn't seem like enough to bog down a Dell PowerEdge 1855 with 2GB 
 RAM onboard.

For comparison, I'm running one of our hosts (Nagios 2.10, ndoutils 
1.4b6, MySQL 5.0.22 on CentOS 5, active hosts/services 504/2771), on a 
PowerEdge 1850 w/ 2GB :)  Without any ndo2db tuning, I was seeing around 
a 30-second wait while the status.dat was being recreated (presumably 
waiting on ndomod).  After tuning, I was able to drop that time from ~30 
seconds to between 6-8.

One caveat - if you do choose to trim those ages, you'll note a longer 
pause after the next Nagios restart, as NDO goes about dropping older 
rows.  Once that's complete, be sure to optimize the relevant tables to 
reclaim the unused space.

HTH,

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] putting limits on check_by_ssh

2007-11-16 Thread Tom Throckmorton
On 11/16/2007 01:06 PM, Dave wrote:
 I've been RTFMing SSH. For background authentication like nagios uses,
 the book I'm reading recommends using the user's ssh config file to
 limit using passwordless keys to just do one task each. So if you want
 nagios to be able to do 3 kinds of checks without a password, you put
 3 keys in nagios .ssh/authorized_keys file with command= stuff for
 each. Then no matter what nagios thinks it is asking for (parameter of
 check_by_ssh) it gets whatever is configured for that key.
 
 I'm just wondering if anyone has taken this approach. It seems a bit
 complicated, spreading some of the nagios config info around to each
 monitored system, but it sort of appeals to me. Then you know that
 even if someone manages to get your key, all they can do is check_disk
 or something else boring.

I do something similar, though also add a 'from' restriction, in the 
event the private key is compromised - here are a few examples:

For remote nagios instance monitoring:

 from=host-01.full.domain,command=/usr/lib/nagios/plugins/check_nagios -F 
 /dev/shm/status.log -e 5 -C nagios ssh-dss ...
 from=host-02.full.domain,command=/usr/lib/nagios/plugins/check_nagios -F 
 /dev/shm/status.log -e 5 -C nagios ssh-dss ...

*dons protective aluminum foil headgear*

In the the keys on _some_ systems, I also add:

  no-port-forwarding
  no-X11-forwarding
  no-agent-forwarding

I usually only allow a single command per host - on hosts which I want 
to execute multiple commands, rather than having a keypair-per-command, 
I make the command a script which sanitizes the input and checks the 
command against a list of predetermined allowed commands.  Probably not 
as safe as keypairs-per-command because it's not as explicit, but more 
flexible/manageable in certain situations.

 Am I missing something? In addition to limiting key authentication to
 doing specific tasks, I also put an '*' in the nagios user's password
 field in /etc/passwd, which prevents them from logging in by password.

For the nagios user (and other system-ish accounts), setting the shell 
to /sbin/nologin should do the trick.

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios - SNMP v3

2007-08-14 Thread Tom Throckmorton
On Aug 14 12:18, Lopez, Denise wrote:
 I am in the process of rebuilding our Nagios server from RHEL AS 4 to
 RHEL 5.  The version of net-snmp on the older server is net-snmp 5.1.2
 and the build for RHEL 5 is net-snmp 5.3.1.  
 
 I can get SNMP version 2 working on the new server but I keep getting
 'No response from Host' when query using the same credentials as I am
 using for the old server that is getting a response back.
 
 We have verified that all ACL's are allowing the traffic and there is no
 ACL on the device I am querying that would be blocking the response. We
 also observed that the traffic pattern from the older server is
 different from the newer server. I can see a one for one packet response
 from the older server to the device but from the newer server there is
 about 6 requests and finally we see a packet response from the device.
 
 I was just wondering if net-snmp version 3 works differently in the
 newer version and how I can resolve this issue?

Are you having a problem with check_snmp, or SNMP in general?

It's likely this bug in net-snmp:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=251332
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=228066

The fc6 patch works on rhel5, so you could wait for the upstream patch, or
rebuild with the patch from fc6.

-tt

-- 
Tom Throckmorton
OIT - CSI
Duke University

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null