Re: [Nagios-users] Plugin for dmesg/kernel messages
On Tue, Jul 16, 2013 at 12:13 AM, Alex mysqlstud...@gmail.com wrote: Those messages didn't show up in /var/log/messages ? That's normally where I catch that stuff with a log monitor. No, they didn't show up in syslog, but perhaps they should have. What application are you using to monitor syslog log messages such as these? SEC[1] works really well. You set it up as an in-line destination on your central loghost, with actions that send events back to nagios via NSCA. -tt [1] http://simple-evcorr.sourceforge.net/ Thanks, Alex -- See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Simulating downtime in nagios
On Oct 07 20:30, Kelly Jones wrote: On 10/6/08, Tom Throckmorton [EMAIL PROTECTED] wrote: On Oct 06 18:57, Kelly Jones wrote: Thanks, Tom. Yes, I'm trying to simulate a host/service outage, not scheduled downtime. The problem w/ submitting a passive check is that the next ACTIVE check will invalidate it. Example: you tell nagios that machine foo is down. That's soft alert 1, not enough to generate any emails. Nagios then active checks foo and sees that it's up. Of course, you can submit another passive check, but you'll ping-pong (flap) between up and down states. OK, so it sounds like you want to be able to have Nagios temporarily stop managing the service check scheduling for this service, long enough for you to inject some bogus results. Seems like rescheduling the next active check (SCHEDULE_FORCED_SVC_CHECK) would do the right thing as far as pushing the next scheduled check into the future. Or maybe you want to disable active checks for the service (DISABLE_SVC_CHECK), run your simulation, and then re-enable them...? I may've done it wrong, but SCHEDULE_FORCED_SVC_CHECK means that nagios won't send any alerts at all. This command manipulates the check scheduling queue for _active_ checks; it has no direct impact on alerts: http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=129 ...so forcing a service check to some time in the future will delay the active checks, but you can still submit passive checks (and generate alerts, assuming the result you're submitting is different than the current state of the service) Basically, messing with nagios' check schedule also screws up its notification schedule. And, since I'm testing notifications, that's not useful. I must be missing something here. If, for example, I do the following for a given service which is currently OK, and for which active checks are normally accepted: - delay the next check via SCHEDULE_FORCED_SVC_CHECK now + 1 hour - submit a passive result with a state of CRITICAL (PROCESS_SERVICE_CHECK_RESULT) x $max_check_attempts As expected, I see: - an alert for each result I've submitted - the status changes to SOFT/CRITICAL after the first result, and HARD/CRITICAL after $max_check_attempts has been reached - a notification about the problem The next scheduled check remains at the time + 1 hour. If I submit an OK result, the status changes from CRITICAL to OK, and I get a recovery notification. And I can repeat this as often as I like within the time before the next scheduled active check. How is this different than what you're trying to achieve? -tt I've written several nagios tests myself, and they're all in one Perl program (each subroutine = one test). For these, simulating downtime is easy. The script reads downtime from a file and automatically exits w/ 1 or 2 during downtime instead of running the subroutine. I'm tempted to run ALL nagios tests in a wrapper, but that seems so ugly for such a simple? problem. -- We're just a Bunch Of Regular Guys, a collective group that's trying to understand and assimilate technology. We feel that resistance to new ideas and technology is unwise and ultimately futile. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Simulating downtime in nagios
On Oct 06 18:57, Kelly Jones wrote: Thanks, Tom. Yes, I'm trying to simulate a host/service outage, not scheduled downtime. The problem w/ submitting a passive check is that the next ACTIVE check will invalidate it. Example: you tell nagios that machine foo is down. That's soft alert 1, not enough to generate any emails. Nagios then active checks foo and sees that it's up. Of course, you can submit another passive check, but you'll ping-pong (flap) between up and down states. OK, so it sounds like you want to be able to have Nagios temporarily stop managing the service check scheduling for this service, long enough for you to inject some bogus results. Seems like rescheduling the next active check (SCHEDULE_FORCED_SVC_CHECK) would do the right thing as far as pushing the next scheduled check into the future. Or maybe you want to disable active checks for the service (DISABLE_SVC_CHECK), run your simulation, and then re-enable them...? -tt - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Simulating downtime in nagios
On Oct 06 12:29, Kelly Jones wrote: What's the best way to simulate (not schedule) downtime in nagios? I want to pretend a service is down for a certain amount of time to see what alerts nagios sends, etc. Just to clarify, are you trying to simulate a service outage (as opposed to simulating a scheduled downtime) so you can test alerts, and perhaps notifications, in order to validate your configuration? I've come up w/ two bad ways to do this: % Edit the config file to change the test to check_dummy. I want to run these fire drills via cron, and editing a file and restarting nagios seems a little ugly. % Submit a passive check saying the service is down, and reschedule the next check 4 hours later, so the service is 'down' for 4 hours. This can be done using the nagios named pipe, so it's easy to cron. Problem: doing things this way suppresses the alerts (when you don't test a service, it doesn't send an alert). Thoughts? I use something similar to the second method to do ad hoc validation of alerts/notifications, by submitting passive results via an external command, though without diddling the service check scheduling. I'm a little confused by your last statement though... If you're only submitting a single passive check and then rescheduling the next check, of course there will be no alerts (and you'll likely never reach $max_check_attempts) - is there some reason you can't submit multiple passive check results? -tt - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring 64-bit and 32-bit servers - plugin path problem
On Sep 23 08:18, Kenneth Holter wrote: Hello all. I've set up Nagios to monitor a few 64-bit machines, and this seems to be working correctly. For example, my command for checking a remote disk is defined like this: define command{ command_namecheck_remote_disk command_line/somepath/check_by_ssh args -C /usr/lib64/nagios/plugins/check_disk remote args } This command executes /usr/*lib64*/nagios/plugins/check_disk on the remote system. On 32-bit systems, on the other hand, the path to the plugin is /usr/*lib*/nagios/plugins/check_disk. It thus looks like I have to differentiate between 32-bit and 64-bit commands, and in effect devide the configuration itself in a 32-bit section and 64-bit section. I'm sure there is a neat way of solving this, and could use some advice. I guess I could symlink all over the place to get things going, but there are probably a better way of doing this. Kenneth, Here are two more ideas for you: 1) You could add some test logic into your command def, like so: $USER1$/check_by_ssh args -C if [ -e /usr/lib64/nagios/plugins/ ] ; then /usr/lib64/nagios/plugins/check_disk remote args ; else /usr/lib/nagios/plugins/check_disk remote args ; fi Rather ugly, but I think that would do what you want. 2) configure the remote ssh client to know it's local path to the plugins; you could do this either in the public key, by using something like 'environment=NAGIOSPATH=/usr/lib64/plugins' after your command and host restrictions (you _are_ using command and host restrictions in your public keys, right? ;-), or in ~/.ssh/environment on the remote host. Note that to use either of those, you'd need to set PermitUserEnvironment=yes in your sshd_config - see sshd(8) and sshd_config(5). Once that's set, you should be able to run: $USER1$/check_by_ssh args -C \$NAGIOSPATH\/check_disk remote_args ...which would also get you there, I think. Couple of disclaimers... - allowing user environments to be set via sshd can be a security risk - I've only tried this from the command line; you'll likely need to do some interesting escaping to get it to work correctly. Also, Gavin Carr wrote: We've found it easier to just leave the plugin invocations unqualified and add the relevant nagios plugins directory to the $PATH for the remote nagios user. Works nicely here. +1 on that one, too - simple is good. Cheers, -tt -- Tom Throckmorton - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] MySQL Alive
On Jul 16 10:40, Jeff Koch wrote: Hi: We're new with nagios but learning. Welcome :) We need to be able to check whether mysql is alive on a server - we don't want to log in. Is this a policy or a preference? In other words, is there some specific reason you don't want to / can't log in, is it OK to log in locally but not remotely, etc? For reference, check_mysql only requires PROCESS privs to perform a basic check - check_mysql_query only requires SELECT. I've looked at the check_mysql and check_mysql_query plugins and they seem to require username and password. Indeed, because they are actually performing SQL queries, which as Stephen V. points out will provide a higher degree of assurance that everything is working as expected. Can anyone advise on how to verify that MySQL is alive? Aside from the above, here are a couple of alternatives: - check_tcp -p 3306, which is moderatly reliable, but can produce false positives - check_procs (as Stephen suggested), though that presumes you're using nrpe/nsca - check_log, to watch your error log, esp. (which may again presume nrpe/nsca) - if the db is the backend for a web application, check the content from some url that you _know_ generates a fetch against the db - some combination of the above + check_cluster :) You'll find other alternatives on nagiosexchange.org as well, I'm sure. Hope this helps get you started, -tt - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Flapping notification
On 6/6/08 7:00 AM, Cyril Jaquier wrote: Hi nagios users, We are currently using nagios 3.0.2 on Debian/testing. Everything works fine. However, something is really annoying with flapping detection and notification. Let's take an example. We have SMS notification using gnokii for CRITICAL and RECOVERED state. One service goes into the CRITICAL state and admins receive an SMS. Now, the host start flapping between CRITICAL and WARNING state so that flapping detection inhibits the notification. Then the service recovers but flapping detection is still enabled. After a while the flapping detection algorithm enables the notification again. At this moment, I expect nagios to send a notification saying that the service recovered. However, nothing happens and the admins do not know if the service is still CRITICAL or not. Maybe we are doing something wrong but I think this makes the flapping detection feature a bit unreliable as you will miss important events. Any suggestions? Hi, Nagios is behaving as expected - I'd suggest you re-read the docs on notifications, particularly the section on service and host filters: (from http://nagios.sourceforge.net/docs/3_0/notifications.html) The second filter for host or service notification is a check to see if the host or service is flapping (if you enabled flap detection). If the service or host is currently flapping, no one gets notified. Otherwise it gets passed to the next filter. You might want to adjust your notification_options to include flapping start/stop, that is, in your host definition/template: notification_options d,r,f ...or for your service: notification_options c,r,f Cheers, -tt - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] OT: snmpwalk output format query
On Jun 04 16:34, Jay R. Ashworth wrote: I'm trying to snmpwalk some of my ProCurve switches to find out why they won't report CPU-TEMP as I was led to believe they should (a couple do), and in the process, I figured I'd see if there was anything else I could monitor, Just For Fun. To make that process more expeditious, I was hoping that there was a way to get snmpwalk to give with *both* the numeric OIDs, and the MIB translations, since I need to know what the parameters *are* to decide if I'm interested in them... but I don't want to burden Nagios (remember Nagios? This is a mailing-list about Nagios?... :-) with having to look them up ever 5 minutes. There doesn't seem to *be* a way to do that, though. Anyone ever try this, and figure out the magic mojo? Hi Jay, I find myself MIB-spelunking occaisionally as well :-) I don't think snmpwalk can show OID's both numerically and textually in one run - it's an either/or operation. snmptranslate, OTOH, can show both in one command, though only for a single OID at a time. I find using them together to be useful - snmpwalk to see what's there, and then snmptranslate the OID's of interest, and since snmptranslate can give the full description of the OID, I get the numeric, full OID, _and_ the description, so I have a better chance of understanding what the object might mean ;-) e.g.: [EMAIL PROTECTED] snmptranslate -Ib -On -Td sysDescr .1.3.6.1.2.1.1.1 sysDescr OBJECT-TYPE -- FROM SNMPv2-MIB, RFC1213-MIB -- TEXTUAL CONVENTION DisplayString SYNTAXOCTET STRING (0..255) DISPLAY-HINT255a MAX-ACCESSread-only STATUS current DESCRIPTION A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. ::= { iso(1) org(3) dod(6) internet(1) mgmt(2) mib-2(1) system(1) 1 } Cheers, -tt - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Easier way to make nagios send mail as different user?
On 5/28/08 7:50 AM, Drew Weaver wrote: The two ways I've come up with to make nagios send notifications as a different user are: Drew, are you trying to customize the outgoing mail so that it _appears_ the mail is coming from a different user, or so that the mail is actually being emitted by a user other than the one running your nagios daemon? I'm guessing it's the former... Write a wrapper which allows mail to be sent with normal headers No need to write anything - just use a different/better MUA. For example, I use mutt (in mailx mode) in a notification command, something like: /usr/bin/printf $MACROS_YOU_WANT_IN_THE_MESSAGE_BODY |/usr/bin/mutt -x -s $MACROS_YOU_WANT_IN_THE_SUBJECT $CONTACTEMAIL ...and then in a .muttrc for the nagios user, add a 'set from=' to match the pretty name of the user from whom I want the mail to appear to have originated. The bonus with this arrangement is that it gives me the opportunity to tweak additional settings on mail notifications, for example to set custom headers, inside .muttrc: set hdrs my_hdr X-Nagios: alert my_hdr X-Priority: 1 ...etc. Depending on what you're actually shooting for, you may also need to make some adjustments to the local MTA configuration, but if your goal is to change the superficial appearance of the mail, the above might be sufficient. run nagios as a different user ...how would that solve your problem? are there any different options? Yes :) Cheers, -tt - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problems With proc statament at snmpd.conf
On 5/16/08 9:16 AM, Saulo Augusto Silva wrote: ... proc db2sysc proc snmpd the snmpwalk : snmpwalk -v 2c -c public 127.0.0.1 .1.3.6.1.4.1.2021.2.1 UCD-SNMP-MIB::prIndex.1 = INTEGER: 1 UCD-SNMP-MIB::prIndex.2 = INTEGER: 2 UCD-SNMP-MIB::prNames.1 = STRING: db2sysc UCD-SNMP-MIB::prNames.2 = STRING: snmpd UCD-SNMP-MIB::prMin.1 = INTEGER: 0 UCD-SNMP-MIB::prMin.2 = INTEGER: 0 UCD-SNMP-MIB::prMax.1 = INTEGER: 0 UCD-SNMP-MIB::prMax.2 = INTEGER: 0 UCD-SNMP-MIB::prCount.1 = INTEGER: 0 UCD-SNMP-MIB::prCount.2 = INTEGER: 0 UCD-SNMP-MIB::prErrorFlag.1 = INTEGER: 1 UCD-SNMP-MIB::prErrorFlag.2 = INTEGER: 1 UCD-SNMP-MIB::prErrMessage.1 = STRING: No db2sysc process running. UCD-SNMP-MIB::prErrMessage.2 = STRING: No snmpd process running. UCD-SNMP-MIB::prErrFix.1 = INTEGER: 0 UCD-SNMP-MIB::prErrFix.2 = INTEGER: 0 UCD-SNMP-MIB::prErrFixCmd.1 = STRING: UCD-SNMP-MIB::prErrFixCmd.2 = STRING: The ps -e ps -e |grep db2sysc 9812 pts/000:00:00 db2syscr 9814 pts/000:00:00 db2sysc 9815 pts/000:00:00 db2syscr 9816 pts/000:00:00 db2syscr 9817 pts/000:00:00 db2syscr Greetings Saulo, You didn't mention which OS or net-snmp version you're using Regardless, the default for recent vintages of net-snmp is to find process names by looking at the 'Name' line in /proc/$pid/status, so you'll need to make sure the names in there match the entry in your config. From what I'm seeing, though, you should at least be getting a match on the snmpd process - are you by chance running with SELinux enabled? -tt - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] change management or source control for Nagios configs?
On Apr 10 23:05, Max wrote: Hi, SVN or CVS work very well for this; I personally use SVN. Version control admin directories will not interfere with Nagios parsing configs from directories as it looks for files that end in .cfg (as you pointed out in your post). Terrific way to keep a hot backup of your configs off site and to allow multiple configuration editors to work on files concurrently; you could even tie in a nice project management web interface like Trac for SVN and have a central place for people to easily see what changed when via the web as well as managing requests for changes to Nagios (new service / host monitoring requests etc) using the trouble ticketing features of a system like Trac ... and then their monitoring requests can be tied back to change sets by using the Milestone features of Trac. +1 I'm doing basically what you've described using SVN+Jira. On the Subversion side, I use a pre-commit hook that does the preflight test, to keep bad configs out of the repo (and away from Nagios). Adding an issue number to the commit message associates the commit with an issue; Jira shows the commit summary in the issue detail, with links to the full changes. On the Nagios side, I run a script out of cron (or at will) that checks the repo for updates, cleans out any config files that have been removed, gets the latest rev and reloads. And logs everything :) One thing to keep in mind - this solution works well, but only as long as you're still hand-editing your configs. AFAIK, none of the front-ends support using any kind of SCM tool for the backend, though that would be a welcome feature. Cheers, -tt -- Tom Throckmorton - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ANNOUNCE: Blosxom4Nagios
On Mar 28 09:27, Aaron M. Segura wrote: On Wed, 2008-03-26 at 16:03 -0400, Tom Throckmorton wrote: On Mar 26 09:00, Gavin Carr wrote: Hi Nagios Users, I've packaged up a special-purpose instance of the 'blosxom' blogging engine as another way of getting nagios notifications via RSS or atom. Announcement post is here, with mentions of existing solutions, some examples of it's slice and dice functionality, and a few screenshots: http://www.openfusion.net/blosxom/blosxom4nagios Well done! I like this one a little better than the others - and fairly painless to get it running. My first question ishow are you getting those spiffy yum/kernel checks? :-D -tt My question is: how do you get that nice nagios-esque red/yellow/green display? After initial install, mine is just black text on white background. I had the same issue at first :) In my case, i'd moved the paths around a bit, and separated the code from the state files (code in /usr/lib/nagios/blosxom, state files in /var/log/nagios/blosxom). I found that I hadn't caught all of the paths in blosxom/config/*, which I discovered in my apache error_log. Once that was fixed, the pretty colors appeared :-D hth, -tt -- Tom Throckmorton OIT - CSI Duke University - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] solaris snmp memory check question - plugin
On Mar 25 09:44, August Simonelli wrote: hi all, not sure if there is a plugins list i could send this to so apologies if it's misdirected. i've been using check_snmp_mem.pl from //www.manubulon.com/nagios/ for remote checking of memory. There is a list, fwiw, for users of this and related plugins, but it appears to be mostly dead... http://sourceforge.net/mailarchive/forum.php?forum_name=nagios-snmp-users ...so you're probably better off asking here. i recently implemented it against solaris 9 and 10 running net-snmp 5.0.9: bash-3.00# /usr/sfw/sbin/snmpd -v NET-SNMP version: 5.0.9 Web: http://www.net-snmp.org/ Email: [EMAIL PROTECTED] unfortunately one of the OIDs doesn't answer: [EMAIL PROTECTED] libexec]# snmpget -v 1 -c mycommunity 1.2.3.4 1.3.6.1.4.1.2021.4.15.0 Error in packet Reason: (noSuchName) There is no such variable name in this MIB. Failed object: UCD-SNMP-MIB::memCached.0 ... I've snmpwalked but don't really understand why this oid fails (something about solaris not using this concept of cached memory?). You're on the right track. Solaris and Linux have different memory management models - Solaris doesn't have the same concept of cached memory as does Linux, at least in a way that net-snmp knows about. See the description for this object from a more recent release of net-snmp: (from http://www.net-snmp.org/docs/mibs/ucdavis.html) | This object will not be implemented on hosts where the | underlying operating system does not explicitly identify | memory as specifically reserved for this purpose. Does anyone know another oid i could query to get this value? Or perhaps share a workaround/solution you may have found? Short of hacking that plugin a bit, you've got a couple of options - you could just check the corresponding objects (or just the ones of concern) directly using check_snmp. Considering what check_snmp_mem actually checks, you could also use check_snmp_storage, from the same author, and only look at Real memory and Swap, e.g.: [EMAIL PROTECTED] $plugins/check_snmp_storage.pl -H 1.2.3.4 -C mycommunity -m '^Swap|^Real' -w 75 -c 90 -S 0,1 -f Note that this is using a different mib (host-resources-mib) than check_snmp_mem (ucd-snmp-mib) and therefore _should_ be available and yield similar results across platforms. There are some caveats to using this mib, but for checking memory/swap usage, it should be fine. Cheers, -tt -- Tom Throckmorton OIT - CSI Duke University - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ANNOUNCE: Blosxom4Nagios
On Mar 26 09:00, Gavin Carr wrote: Hi Nagios Users, I've packaged up a special-purpose instance of the 'blosxom' blogging engine as another way of getting nagios notifications via RSS or atom. Announcement post is here, with mentions of existing solutions, some examples of it's slice and dice functionality, and a few screenshots: http://www.openfusion.net/blosxom/blosxom4nagios Well done! I like this one a little better than the others - and fairly painless to get it running. My first question ishow are you getting those spiffy yum/kernel checks? :-D -tt -- Tom Throckmorton OIT - CSI Duke University - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios
On Mar 11 18:58, Max wrote: Whoops, Actually, looks like you need to install the Sybase client software for Linux on your Nagios host unless the sybase libraries are available as a standalone package, DBD::Sybase can use either the Sybase libraries (which are provided with Sybase ASE), or FreeTDS (www.freetds.org). Personally, I find the FreeTDS package easier, and much, *much* smaller (596k vs. 200+MB). I've used both, and they seem to be functionally the same, especially for the purposes of this plugin. Cheers, -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Connection attempts
On Mar 05 09:34, Matthew Macdonald-Wallace wrote: Hi All, Before I start coding my own plugin to do this, does anyone know of a plugin that monitors the number of external connection attempts over a given period of time for a given service and sends alerts accordingly? I've noticed on a number of servers that we maintain recently that there are unauthorised attempts to connect via SSH/FTP. These appear in the log files about 2 seconds apart and are obviously automated. We've got Logcheck in place which alerts us to this kind of thing already, however I like the idea of a nice visual/audible alert (we all use the nagios-plugin for firefox here). Since you already have an investment in Logcheck, you could feed those events directly to Nagios using NSCA. There's an example configuration outlined in this document: https://www2.sans.org/reading_room/whitepapers/logging/198.php If you're looking for something more real-time, you might consider dropping Logcheck in favor of swatch/SEC. Cheers, -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Connection attempts
On Mar 05 14:22, Michael Schwartzkopff wrote: 1) Better use the logmatch option in the net-snmp configuration. It is quite undocumented but works like a charm. nagios can read these values with check_snmp. Syntax: logmatch name logile interval regex That's a good method, also, if you're just looking for counts of a very general regex. I've found this approach useful when I need to watch a log that isn't already getting fed to the central loghost, or I'm just looking for counters and not content. One caveat of logmatch is that some values are reset upon log rotation (logMatchCurrentCounter) or reading (logMatchCounter) - it's a feature :-D -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] verify CIFS/SMB protocol negotiation?
On Feb 25 17:51, Steve Huff wrote: Hello folks! I'm monitoring several Samba servers (SuSE OES) that suffer from a debilitating problem: periodically their authentication cache stops working, and as a result none of the services that depend on that authentication cache, including Samba, respond to requests. I'm currently using the check_disk_smb plugin to verify that a test user can establish a CIFS connection (and also to monitor disk usage, but that's secondary). Unfortunately, when the abovementioned problem occurs, check_disk_smb fails not CRITICAL, but UNKNOWN; this is because the request times out instead of being rejected, since the server is still listening on the port, but the SMB negotiation never completes. Hi Steve, You could modify check_disk_smb so that it outputs CRITICAL instead of UNKNOWN on a timeout - looks pretty straightforward to do so. Alternately, you could configure those services to notify on UNKNOWN. I thought I would try to add an additional layer of testing by using check_tcp's ability to send a string and expect a string in return; unfortunately, it looks like SMB doesn't talk using human-readable strings (or at least it looked that way to me). check_tcp always succeeds, even when check_disk_smb will time out and fail. Anybody else run into a similar problem? Any suggestions for another way to implement the extra later of testing, or any other ideas? Anything in your smb.log when the service dies? Are the processes zombied? hth, -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] using fping to find missing A records
On Feb 13 18:05, Roger wrote: I have a specific question on how to do something, and a larger question addressing other better ways to do the spirit of what I'm asking. I'm setting up Nagios, and am trying to make sure that all of the subnets have A records, as IP addresses will be changing very, very rapidly, and when that day comes, I will not be here (but on another Nagios project). So, I'd like to create A records for each host. ...so, are you trying to make sure that for every name you have an A record, or for every A you have a PTR (i.e. a name that corresponds to the numeric)? It sounds like the latter, since you're thinking of doing this by looking at numbers... How do I fping -a a range of IP addresses, strip out the names of the name that the host replies with, and then ping that list of names? (You don't have to tell me how exactly, a general idea of which command line tools will do) I'm thinking something like: fping -g 192.168.1.0 192.168.1.255 -A | grep whatever file, and then shove that file in another fping statement. If I understand correctly what you're shooting for, it can be done in one step: [EMAIL PROTECTED] fping -ang 192.168.1.0/24 ...would return - all hosts that are alive - by name (if one exists) - for a generated range (192.168.1.0..192.168.1.255) If you want to only show hosts for which there is no PTR record, just 'grep -v' your target domain(s) from the output. -tt p.s. Hopefully, none of those hosts in that range are blocking your pings :) -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ndo2db database query
On Jan 31 18:51, Tim Kay wrote: Hello list, I am trying to query the nagios database entries generated by ndo2db (Nagios 3.0rc1/ndoutils 1.47b) to pull out only those entries in the servicestatus table that are in current_state != 0 and whose contactgroup is testing. I have a contact group testing and that contact group is set up in the service template for (at the moment) all services. I can see the contact group in the SQL contactgroups table and I can see the members of the contactgroup in the SQL contactgroup_members table but there are no entries in the service_contacts table. Is this a config error on my part? I don't think so - that table as well as host_contacts is empty for me also, on 2.10/ndoutils 1.4b6 - I'm wondering if those tables just aren't getting updated as they should... I have: define contactgroup{ deletia } the testcontact is notified when the load on server1 exceeds the warn / critical limits but I have no way of associating the contact with the service just through querying the database Just to be clear - you're just trying to validate a contact for a given service in a given state, right? I'm using something similar to show effective contact groups for hosts; you'll have to do a few JOINS to get what you want. Have a look at the sample queries that came w/ ndoutils for some good examples. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] flap_detection_enabled persistence acrossreload/restart?
On Jan 23 17:35, Marc Powell wrote: Before I dig further, can anyone verify that the state of host or service flap detection (enabled/disabled) should or shouldn't be persistent across a reload/restart? I've combed the docs and archives, and am coming up dry. The retention file should take precedence. Thanks, that's what I thought - it appears to for all but the flapping options. I should add that this applies to the global flap detection option as well. Would you (or anyone) mind testing this on your own installation: - find a host/service for which flapping is enabled - disable flap detection for that host/service; wait until that state is reflected in the extended info - reload the configs - check the state of flap detection - has it reverted to enabled? Not all directives can be overridden by the retention file but flap_detection_enabled is one of them (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service). I'm not sure what you mean here - which directives _can't_ be overridden by the retention file? Your configuration snippets look accurate, assuming they're applied as you indicate. They were cut and pasted from live configs, so yeah. Cheers, -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] flap_detection_enabled persistenceacrossreload/restart?
On Jan 24 09:42, Marc Powell wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Tom Throckmorton Sent: Thursday, January 24, 2008 8:12 AM To: Marc Powell Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] flap_detection_enabled persistenceacrossreload/restart? Would you (or anyone) mind testing this on your own installation: - find a host/service for which flapping is enabled - disable flap detection for that host/service; wait until that state is reflected in the extended info - reload the configs - check the state of flap detection - has it reverted to enabled? I can confirm that it ignores the retention file setting with nagios-2.7. I don't have a more recent version up to test with. Thanks. Bummer - I'm still trying to find something that points to this being intentional. I'll ask on nagios-devel... Not all directives can be overridden by the retention file but flap_detection_enabled is one of them (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service). I'm not sure what you mean here - which directives _can't_ be overridden by the retention file? Any that aren't marked with a *. Click the * or read the Retention Notes at the top of the page. Oh, duh. *kicks self in head* http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#retention_notes Your configuration snippets look accurate, assuming they're applied as you indicate. They were cut and pasted from live configs, so yeah. I meant that the template really was applied to the hosts/services. Well, assuming any status data that gets fed to the state files and via the NEB is accurate, then my configs match my live settings. I'm verifying by seeing what ends up in the status_file and state_retention_file, as well as the NDO-fed database. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NDO Troubles
On Jan 23 11:00, Giles Coochey wrote: Have you tried the advice of someone else to give InnoDB loads of memory? I would like to try that, but have no idea as to how to do it! Perhaps someone can advise? Have a look at the sample mysql configs that come w/ the mysql-server package (/usr/share/doc/mysql-server-version, on Red Hat and friends, at least). You could use/adapt one of the configs intended for larger environments, depending on your configuration. If nothing else, that's a good starting point. I'd also recommend checking out http://www.mysqlperformanceblog.com/, particularly the section on performance presentations. Good stuff. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] flap_detection_enabled persistence across reload/restart?
Nagios 2.10, CentOS 5 x86_64 / i386, NDOUtils 1.4b6 Hi all, I've noticed something odd wrt flapping, and am wondering if I'm overlooking something simple, or just misunderstanding the way flapping and state retention are supposed to work. First, just to verify, my main nagios config has: enable_flap_detection=1 retain_state_information=1 state_retention_file=/var/log/nagios/retention.dat retention_update_interval=1,5,60 or even 0, doesn't matter use_retained_program_state=1 use_retained_scheduling_info=1 In my global host and service templates that all hosts use, I have: flap_detection_enabled 1 retain_status_information 1 retain_nonstatus_information1 I've also disabled flap detection explicitly for a few hosts, and it all works as advertised. However, for any host/service, if I manually disable/enable flap detection via an external command / cgi, and then reload or restart, the option reverts to whatever is in the config for that given host/service. This happens regardless of whether 1) the option is being set in retention.dat, which it is, 2) the NDO-fed database thinks flapping for this host/service is disabled, or 3) w/out the broker_module enabled. Other state options are being preserved (such as active_checks_enabled) both at retention_update_interval and on a reload. So, what I think I'm seeing then is that during a reload, the flap_detection_enabled option isn't getting read from the retention file at program start, though I'd expect it to be persistent just as the other options are. FWIW, same behavior on earlier Nagios 2.x releases, but 1.x seems to do the right thing. Before I dig further, can anyone verify that the state of host or service flap detection (enabled/disabled) should or shouldn't be persistent across a reload/restart? I've combed the docs and archives, and am coming up dry. Thanks, -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!
On Jan 18 18:44, Tom Throckmorton wrote: ...you have an unnecessary trailing '!' on the check_command, which I think will be silently ignored, based on your command_line def), but I can't recall whether or not spaces in the ARGn macros are legal (I think not, but the docs don't specify and I haven't tested). Just to follow up, I did test, and they're legal, so you can ignore my previous post about adjusting the format of your command defs. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ndoutil question
On 01/18/2008 10:26 AM, Alex Dehaini wrote: Guys, Any other ideas on this. This is the output I get when I try to start NDO snmp:/# /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg Support for the specified database server is either not yet supported, or was not found on your system. Silly question: What is db_servertype set to in your ndo2db.cfg? Does it match the name of the mysql package? -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!
On 01/18/2008 04:24 PM, Bret Goodfellow wrote: I have the snmpd running on a client box. I have nagois monitoring this box without any issues. I decided to implement a new plugin I found, check_snmp_storage.pl. From the nagios server I can issue this command to the client box, and I get a valid response, e.g. ... My command is defined as: define command{ command_namecheck_snmp_storage command_line$USER$/check_snmp_storage.pl -H $HOSTADDRESS$ $ARG1$ $USER$ won't get you much, I don't think. Maybe you meant $USER1$ ? -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!
On 01/18/2008 05:39 PM, Bret Goodfellow wrote: Here's what I get: ./check_snmp_storage.pl -H somehost -C community -m /tmp -w 95 -c 98 -f -v Alarm at 15 SNMP v1 login Filter : /tmp . . . /tmp: 6%used(221MB/3937MB) (95%) : OK Reading the check_snmp_storage code again, I see that you'll get that error if you haven't specified a community. Looking at your check_command in the service definition: check_commandcheck_snmp_storage!-C public -m /temp -w 95 -c 98 -f! ...you have an unnecessary trailing '!' on the check_command, which I think will be silently ignored, based on your command_line def), but I can't recall whether or not spaces in the ARGn macros are legal (I think not, but the docs don't specify and I haven't tested). The usual form is something like: In your checkcommand def: .. command_name check_foo command_line $USERn$/check_foo.pl -H $HOSTADDRESS$ $ARG1$ $ARG2$ $ARG3$ ... In your service def: ... host_name somehost check_command check_foo!arg1value!arg2value!arg3value ... Can you try formatting your commands as such? -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp_storage.pl returns Put snmp login info!
On 01/18/2008 05:07 PM, Bret Goodfellow wrote: It actually is $USER1$. I just typed it incorrectly in this email. Thanks OK. That error looks vaguely like an SNMPv3 login failure. What do you see when you run the plugin from the command-line with a '-v' at the end? It should show you the SNMP version: ./check_snmp_storage.pl -H somehost -C community -m /foo -w 95 -c 98 -f -v Alarm at 15 SNMP v1 login Filter : /foo OID : 1.3.6.1.2.1.25.2.3.1.3.3, Desc : Swap Space OID : 1.3.6.1.2.1.25.2.3.1.3.8, Desc : /proc/sys/fs/binfmt_misc OID : 1.3.6.1.2.1.25.2.3.1.3.2, Desc : Real Memory ... etc. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] new plugin
On 01/14/2008 11:22 AM, [EMAIL PROTECTED] wrote: I've just written a script that checks to see if the ndo2db daemon is running and if it is not, deletes the sock file and restarts it. This is how it runs in our current setup. It's just a simple shell script, but it gets the job done. Thanks for sharing and posting your script. You might also consider using check_procs + an eventhandler, which would give you the advantage of having some additional flexibility around how and when the process is restarted. Out of curiosity, are you actually seeing ndo2db _die_, or just stop working? -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NDOUtils
On 01/17/2008 12:12 PM, Matthias Kloth wrote: Tom Throckmorton schrieb: | On Jan 11 08:51, [EMAIL PROTECTED] wrote: | Has anyone got a plugin or script they use to check the status of the | ndo2db daemon? I've been looking on nagiosexchange and through google | and am yet to find one. I'm not much of a programmer, so I really can't | right one up myself or I would. Any help on this issue would be wonderful. | | I'm using check_mysql_query to check that db updates aren't stale: | | check_mysql_query -H naghost -d nagios_db -u readonlyuser -p somepass \ | -q SELECT NOW() - status_update_time FROM nagios_programstatus -w 120 -c 300 | | If ndo2db is dead/failed to start, or there are other issues (corrupt tables), | I believe this will catch it. You could also do something like watch the nagios.log | for the dreaded unable to connect to data sink message, or check that the | ndo2db process is running, though I'm finding that checking the db for | freshness is a good indicator. | | -tt | | There is a plugin that checks for the ndo2db daemon called . You can find it on http://www.nagiosexchange.org/Check_Plugins.21.0.html?tx_netnagext_pi1[p_view]=1203. Thanks, Matthias. I saw svalding's later post with the shell script. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NDOUtils
On Jan 11 08:51, [EMAIL PROTECTED] wrote: Has anyone got a plugin or script they use to check the status of the ndo2db daemon? I've been looking on nagiosexchange and through google and am yet to find one. I'm not much of a programmer, so I really can't right one up myself or I would. Any help on this issue would be wonderful. I'm using check_mysql_query to check that db updates aren't stale: check_mysql_query -H naghost -d nagios_db -u readonlyuser -p somepass \ -q SELECT NOW() - status_update_time FROM nagios_programstatus -w 120 -c 300 If ndo2db is dead/failed to start, or there are other issues (corrupt tables), I believe this will catch it. You could also do something like watch the nagios.log for the dreaded unable to connect to data sink message, or check that the ndo2db process is running, though I'm finding that checking the db for freshness is a good indicator. -tt -- Tom Throckmorton OIT - CSI Duke University - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] best solution for configuration changes
On 01/03/2008 12:44 PM, Max wrote: On the Nagios server side you could have a cron job that updated the configs every N minutes or even a sub-version commit trigger that would run a nagios -v /path/nagios.cfg check on your configuration and either 1) restart Nagios if the configuration is good or 2) alert you if someone has broken the configs. ...or, do the pre-flight check using a pre-commit hook, so that the broken configs never make it to the nagios server in the first place ;-) -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Slow Nagios reloads with NDOUtils
On 11/16/2007 04:23 PM, [EMAIL PROTECTED] wrote: My first problem, and I am not sure it is actually a problem, is that when I do a reload of nagios (/etc/init.d/nagios reload) it takes, what seems to me to be, a long time. It is usually around 90-120 seconds for Nagios to start allowing use of the web interface once the reload is initiated. Mark, I encountered the same issue, spent a while poking at NDO and found that tuning down the table trimming options in ndo2db.cfg helped reduce the time of this symptom by a third or more, depending on how short of an age I chose. This also cut the user cpu time (taken up by mysqld) by about half, as well as the storage and I/O overhead. YMMV, of course, but I found that at this point I don't need to keep the data in those very busy tables longer than an hour or so - it'll be *very* useful of course when I have an interface that can use that data, but for now, keeping it for only a short period is fine. IANA db expert, but from what I can tell and as Ton suggests, the table indexing doesn't appear to be optimal - I see a high handler_read_rnd_next count, which usually indicates unnecessary table scans). As far as this particular symptom goes, though, I think it might be the indexing in conjunction with the re-sending of retained status data that's the culprit here (see the blurb on 'Do not resend retained status to NDO' on http://altinity.blogs.com/dotorg/2007/09/nagios-patch-da.html) We are monitoring: # Active Host / Service Checks:272 / 329 This doesn't seem like enough to bog down a Dell PowerEdge 1855 with 2GB RAM onboard. For comparison, I'm running one of our hosts (Nagios 2.10, ndoutils 1.4b6, MySQL 5.0.22 on CentOS 5, active hosts/services 504/2771), on a PowerEdge 1850 w/ 2GB :) Without any ndo2db tuning, I was seeing around a 30-second wait while the status.dat was being recreated (presumably waiting on ndomod). After tuning, I was able to drop that time from ~30 seconds to between 6-8. One caveat - if you do choose to trim those ages, you'll note a longer pause after the next Nagios restart, as NDO goes about dropping older rows. Once that's complete, be sure to optimize the relevant tables to reclaim the unused space. HTH, -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] putting limits on check_by_ssh
On 11/16/2007 01:06 PM, Dave wrote: I've been RTFMing SSH. For background authentication like nagios uses, the book I'm reading recommends using the user's ssh config file to limit using passwordless keys to just do one task each. So if you want nagios to be able to do 3 kinds of checks without a password, you put 3 keys in nagios .ssh/authorized_keys file with command= stuff for each. Then no matter what nagios thinks it is asking for (parameter of check_by_ssh) it gets whatever is configured for that key. I'm just wondering if anyone has taken this approach. It seems a bit complicated, spreading some of the nagios config info around to each monitored system, but it sort of appeals to me. Then you know that even if someone manages to get your key, all they can do is check_disk or something else boring. I do something similar, though also add a 'from' restriction, in the event the private key is compromised - here are a few examples: For remote nagios instance monitoring: from=host-01.full.domain,command=/usr/lib/nagios/plugins/check_nagios -F /dev/shm/status.log -e 5 -C nagios ssh-dss ... from=host-02.full.domain,command=/usr/lib/nagios/plugins/check_nagios -F /dev/shm/status.log -e 5 -C nagios ssh-dss ... *dons protective aluminum foil headgear* In the the keys on _some_ systems, I also add: no-port-forwarding no-X11-forwarding no-agent-forwarding I usually only allow a single command per host - on hosts which I want to execute multiple commands, rather than having a keypair-per-command, I make the command a script which sanitizes the input and checks the command against a list of predetermined allowed commands. Probably not as safe as keypairs-per-command because it's not as explicit, but more flexible/manageable in certain situations. Am I missing something? In addition to limiting key authentication to doing specific tasks, I also put an '*' in the nagios user's password field in /etc/passwd, which prevents them from logging in by password. For the nagios user (and other system-ish accounts), setting the shell to /sbin/nologin should do the trick. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios - SNMP v3
On Aug 14 12:18, Lopez, Denise wrote: I am in the process of rebuilding our Nagios server from RHEL AS 4 to RHEL 5. The version of net-snmp on the older server is net-snmp 5.1.2 and the build for RHEL 5 is net-snmp 5.3.1. I can get SNMP version 2 working on the new server but I keep getting 'No response from Host' when query using the same credentials as I am using for the old server that is getting a response back. We have verified that all ACL's are allowing the traffic and there is no ACL on the device I am querying that would be blocking the response. We also observed that the traffic pattern from the older server is different from the newer server. I can see a one for one packet response from the older server to the device but from the newer server there is about 6 requests and finally we see a packet response from the device. I was just wondering if net-snmp version 3 works differently in the newer version and how I can resolve this issue? Are you having a problem with check_snmp, or SNMP in general? It's likely this bug in net-snmp: https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=251332 https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=228066 The fc6 patch works on rhel5, so you could wait for the upstream patch, or rebuild with the patch from fc6. -tt -- Tom Throckmorton OIT - CSI Duke University - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null