Re: [Nagios-users] service checks could not be rescheduled properly.
Julian Grunnell | Unix Analyst, Infrastructure | TD Direct Investing T: +44 (0) 113 346 2824 | M: +44 (0) 7889 352527 Andreas Ericsson a...@op5.se 16/03/2012 10:14 Please respond to Nagios Users List nagios-users@lists.sourceforge.net To Nagios Users List nagios-users@lists.sourceforge.net cc julian_grunn...@tdwh.co.uk Subject Re: [Nagios-users] service checks could not be rescheduled properly. On 03/15/2012 11:31 AM, julian_grunn...@tdwh.co.uk wrote: Anyone ... thought I'd go back to this. Does anyone have any ideas why I would get the following in the Nagios logs: nagios-03-10-2012-00.log:[1331336760] Warning: Check of service 'DEAL SERVER SERVICE TCP 4099' on host 'TDUKUBS01' could not be rescheduled properly. Scheduling check for next week... nagios-03-10-2012-00.log:[1331336760] Warning: Check of service 'DEAL SERVER SERVICE TCP 4099' on host 'TDUKUBS02' could not be rescheduled properly. Scheduling check for next week... As mentioed below, I've checked the config multiple times, I run 1100+ service checks across 100+ hosts and just these two are causing a problem - they differ in that they have specific time periods defined - all detailed below. I've checked the various reports that NTP is at fault but it made no difference at all. define timeperiod{ timeperiod_name ubs4099hours alias UBS 4099 Dealserver Monitoring Hours monday 23:46-20:30 tuesday 23:46-20:30 wednesday 23:46-20:30 thursday 23:46-20:30 friday 23:46-20:30 } This timeperiod isn't valid. FROM 23:46 TO 20:30 on the same date, there is no time for any checks to be executed in. 00:00-20:30,23:46-00:00 should work better, unless you meant 20:30-23:46, but I guess you wouldn't have screwed it up if that's what you intended. hth -- Thanks Andreas - well your right, changed my times as above and the scheduling now works. tbh no idea why I thought it would work how I had it just didn't occur to me. So thanks again - happy now. J. --- Consider the environment: Please don't print this e-mail unless you really need to. Confidentiality: This email and its attachments are intended for the above named only and may be confidential. If they have come to you in error you must take no action based on them, nor must you copy or show them to anyone; please reply to this email and highlight the error. Viruses: Although we have taken steps to ensure that this email and attachments are free from any virus, we advise that in keeping with good computing practice the recipient should ensure that they are actually virus-free. Brokerage services provided by TD Direct Investing (Europe) Limited (a subsidiary of The Toronto-Dominion Bank). Authorised and regulated by the Financial Services Authority (FSA registered number 141282), member of the London Stock Exchange and the PLUS market. Incorporated in England and Wales under registration number 2101863. Registered office: Exchange Court, Duncombe Street, Leeds LS1 4AX. Banking services provided by TD Bank N.V. authorised and regulated by De Nederlandsche Bank and the Financial Services Authority for UK Business (FSA registered number 216791). Incorporated in the Netherlands and registered as a branch in England and Wales under branch registration number BR006780. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] notes field missing in hostgroups table in ndoutils
I am trying to build a custom component for nagios and I need to access the notes field from the hostgroup definitions, but when I make a backend call to the function get_xml_hostgroup_objects($args=null) found in /usr/local/nagiosxi/html/includes/utils-objects.inc.php, the notes field is not returned. I tried to modify the code to request the notes field, but the result was always blank. Upon further inspection I found that the notes field does not appear to even make it into the ndoutils database at all (see copy of mysql output below). Is there a way I can reconfigure nagios to put that information in the database for later retrieval? mysql use nagios; Reading table information for completion of table and column names You can turn off this feature to get a quicker startup with -A Database changed mysql desc nagios_hostgroups; +-+--+--+-+-++ | Field | Type | Null | Key | Default | Extra | +-+--+--+-+-++ | hostgroup_id| int(11) | NO | PRI | NULL| auto_increment | | instance_id | smallint(6) | NO | MUL | 0 || | config_type | smallint(6) | NO | | 0 || | hostgroup_object_id | int(11) | NO | | 0 || | alias | varchar(255) | NO | | || +-+--+--+-+-++ 5 rows in set (0.00 sec) mysql -- Keith S. Morgan Remote Monitoring Unit Safeguards, IAEA Vienna, Austria Office: +43 1 2600 26672 Handy: +43 699 165 26672 k.mor...@iaea.orgmailto:k.mor...@iaea.org This email message is intended only for the use of the named recipient. Information contained in this email message and its attachments may be privileged, confidential and protected from disclosure. If you are not the intended recipient, please do not read, copy, use or disclose this communication to others. Also please notify the sender by replying to this message and then delete it from your system. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Root_partition check not reading correctly
I am not sure how I am launching the service, which I admit is bad, but I inherited the setup. I am not sure if it ever worked properly, because it took a catastrophic rampant application running amok, spewing 9GB files to clue me in. I am running nagios 3.3.1 and nagios-plugins 1.4.15 the application is check_local_disk but I think that must have been a check from the previous nagios the one in libexec is check_disk so I tried [code] define service{ use local-service host_name LTS-MASTERKEY-000 service_description Root Partition check_command check_disk -w 20% -c 10% --path=/ notifications_enabled 1 max_check_attempts 3 check_interval 5 retry_interval 3 check_period24x7 notification_interval 15 notification_period 24x7 notification_optionsw,c,r contact_groups admins } [/code] and nagios will not start - says there is a configuration error On Fri, Mar 9, 2012 at 7:49 AM, Claudio Kuenzler c...@claudiokuenzler.com wrote: Please show the service definition. How do you launch the check? By ssh, by nrpe? Seems you're using the same IP address or dns name as the hostname value. Can you verify this? On Fri, Mar 9, 2012 at 1:19 PM, Wolf Halton wolf.hal...@gmail.com wrote: All my machines show a similar output, regardless of how much is available on their root partitions. Root Partition OK 03-09-2012 07:11:08 28d 22h 18m 15s 1/3 DISK OK - free space: / 15903 MB (86% inode=93%): Up to and including ones that are 100% full. No alarms - ever. Is a client app needed on the monitored clients that has not been mentioned? -Wolf -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Root_partition check not reading correctly
I found it check_command check_disk! -w 20% -c 10% --path=/ It was missing the ! Thanks for helping me sort it out. Wolf On Mon, Mar 19, 2012 at 10:37 AM, Wolf Halton wolf.hal...@gmail.com wrote: I am not sure how I am launching the service, which I admit is bad, but I inherited the setup. I am not sure if it ever worked properly, because it took a catastrophic rampant application running amok, spewing 9GB files to clue me in. I am running nagios 3.3.1 and nagios-plugins 1.4.15 the application is check_local_disk but I think that must have been a check from the previous nagios the one in libexec is check_disk so I tried [code] define service{ use local-service host_name LTS-MASTERKEY-000 service_description Root Partition check_command check_disk -w 20% -c 10% --path=/ notifications_enabled 1 max_check_attempts 3 check_interval 5 retry_interval 3 check_period 24x7 notification_interval 15 notification_period 24x7 notification_options w,c,r contact_groups admins } [/code] and nagios will not start - says there is a configuration error On Fri, Mar 9, 2012 at 7:49 AM, Claudio Kuenzler c...@claudiokuenzler.com wrote: Please show the service definition. How do you launch the check? By ssh, by nrpe? Seems you're using the same IP address or dns name as the hostname value. Can you verify this? On Fri, Mar 9, 2012 at 1:19 PM, Wolf Halton wolf.hal...@gmail.com wrote: All my machines show a similar output, regardless of how much is available on their root partitions. Root Partition OK 03-09-2012 07:11:08 28d 22h 18m 15s 1/3 DISK OK - free space: / 15903 MB (86% inode=93%): Up to and including ones that are 100% full. No alarms - ever. Is a client app needed on the monitored clients that has not been mentioned? -Wolf -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Root_partition check not reading correctly
More stuff I am in commands.cfg and added check_disk as a command to check discks on the remote server as well as check_local_disk, which I understand to be about checking the nagios server disk. [code] define command{ command_namecheck_local_disk command_line$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ } define command{ command_namecheck_disk command_line$USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ #command_line $USER1$/check_disc } The front-end error is now (Return code of 127 is out of bounds - plugin may be missing) On Mon, Mar 19, 2012 at 11:27 AM, Wolf Halton wolf.hal...@gmail.com wrote: I found it check_command check_disk! -w 20% -c 10% --path=/ It was missing the ! Thanks for helping me sort it out. Wolf On Mon, Mar 19, 2012 at 10:37 AM, Wolf Halton wolf.hal...@gmail.com wrote: I am not sure how I am launching the service, which I admit is bad, but I inherited the setup. I am not sure if it ever worked properly, because it took a catastrophic rampant application running amok, spewing 9GB files to clue me in. I am running nagios 3.3.1 and nagios-plugins 1.4.15 the application is check_local_disk but I think that must have been a check from the previous nagios the one in libexec is check_disk so I tried [code] define service{ use local-service host_name LTS-MASTERKEY-000 service_description Root Partition check_command check_disk -w 20% -c 10% --path=/ notifications_enabled 1 max_check_attempts 3 check_interval 5 retry_interval 3 check_period 24x7 notification_interval 15 notification_period 24x7 notification_options w,c,r contact_groups admins } [/code] and nagios will not start - says there is a configuration error On Fri, Mar 9, 2012 at 7:49 AM, Claudio Kuenzler c...@claudiokuenzler.com wrote: Please show the service definition. How do you launch the check? By ssh, by nrpe? Seems you're using the same IP address or dns name as the hostname value. Can you verify this? On Fri, Mar 9, 2012 at 1:19 PM, Wolf Halton wolf.hal...@gmail.com wrote: All my machines show a similar output, regardless of how much is available on their root partitions. Root Partition OK 03-09-2012 07:11:08 28d 22h 18m 15s 1/3 DISK OK - free space: / 15903 MB (86% inode=93%): Up to and including ones that are 100% full. No alarms - ever. Is a client app needed on the monitored clients that has not been mentioned? -Wolf -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include
Re: [Nagios-users] Root_partition check not reading correctly
Check out the command lines and the way you are spelling disk/disc as you have 2 different spellings. command_line$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ command_line$USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ Check the actual name of the file in the libexec folder. Also what is $ARGS$ shouldn’t it be a $ARGnumber$ as you correctly have in the top example? -Original Message- From: Wolf Halton [mailto:wolf.hal...@gmail.com] Sent: 19 March 2012 16:50 To: Nagios Users List Subject: Re: [Nagios-users] Root_partition check not reading correctly More stuff I am in commands.cfg and added check_disk as a command to check discks on the remote server as well as check_local_disk, which I understand to be about checking the nagios server disk. [code] define command{ command_namecheck_local_disk command_line$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ } define command{ command_namecheck_disk command_line$USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ #command_line $USER1$/check_disc } The front-end error is now (Return code of 127 is out of bounds - plugin may be missing) On Mon, Mar 19, 2012 at 11:27 AM, Wolf Halton wolf.hal...@gmail.com wrote: I found it check_command check_disk! -w 20% -c 10% --path=/ It was missing the ! Thanks for helping me sort it out. Wolf On Mon, Mar 19, 2012 at 10:37 AM, Wolf Halton wolf.hal...@gmail.com wrote: I am not sure how I am launching the service, which I admit is bad, but I inherited the setup. I am not sure if it ever worked properly, because it took a catastrophic rampant application running amok, spewing 9GB files to clue me in. I am running nagios 3.3.1 and nagios-plugins 1.4.15 the application is check_local_disk but I think that must have been a check from the previous nagios the one in libexec is check_disk so I tried [code] define service{ use local-service host_name LTS-MASTERKEY-000 service_description Root Partition check_command check_disk -w 20% -c 10% --path=/ notifications_enabled 1 max_check_attempts 3 check_interval 5 retry_interval 3 check_period 24x7 notification_interval 15 notification_period 24x7 notification_options w,c,r contact_groups admins } [/code] and nagios will not start - says there is a configuration error On Fri, Mar 9, 2012 at 7:49 AM, Claudio Kuenzler c...@claudiokuenzler.com wrote: Please show the service definition. How do you launch the check? By ssh, by nrpe? Seems you're using the same IP address or dns name as the hostname value. Can you verify this? On Fri, Mar 9, 2012 at 1:19 PM, Wolf Halton wolf.hal...@gmail.com wrote: All my machines show a similar output, regardless of how much is available on their root partitions. Root Partition OK 03-09-2012 07:11:08 28d 22h 18m 15s 1/3 DISK OK - free space: / 15903 MB (86% inode=93%): Up to and including ones that are 100% full. No alarms - ever. Is a client app needed on the monitored clients that has not been mentioned? -Wolf -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org --- --- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org
Re: [Nagios-users] Root_partition check not reading correctly
I checked that, and I had made that error, however when I fixed that error, I still have a reading only from the nagios server rather than the masterkey server --- In my spare time, I found check_by_ssh and added that to the commands.cfg and its's counterpart lines in my service definitions for that server. The error there is that the -H $HOSTNAME$ that is suggested in check_by_ssh is wanting a FQDN or an IP address. It says the hostname is invalid (and it is) Remote Root PartitionUNKNOWN03-19-2012 14:58:39 0d 0h 6m 51s 3/3 check_by_ssh: Invalid hostname/address - LTS-MASTERKEY-000 I changed $HOSTNAME$ to $HOSTADDRESS$ which gives me an access denied error, so at least the system is believing in the HOSTADDRESS Remote Root Partition UNKNOWN 03-19-2012 15:16:56 0d 0h 27m 11s 3/3 Remote command execution failed: Permission denied, please try again. Running direct gets me a real answer: # /usr/local/nagios/libexec/check_by_ssh -H 192.168.10.160 /nagios/check_diskfree.sh sda2 70 90 OK. Free Space: 24GB, 95% What user is the thing expecting? On Mon, Mar 19, 2012 at 1:31 PM, Andrew Thompson and...@fulgent.co.uk wrote: Check out the command lines and the way you are spelling disk/disc as you have 2 different spellings. command_line $USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ command_line $USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ Check the actual name of the file in the libexec folder. Also what is $ARGS$ shouldn’t it be a $ARGnumber$ as you correctly have in the top example? -Original Message- From: Wolf Halton [mailto:wolf.hal...@gmail.com] Sent: 19 March 2012 16:50 To: Nagios Users List Subject: Re: [Nagios-users] Root_partition check not reading correctly More stuff I am in commands.cfg and added check_disk as a command to check discks on the remote server as well as check_local_disk, which I understand to be about checking the nagios server disk. [code] define command{ command_name check_local_disk command_line $USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ } define command{ command_name check_disk command_line $USER1$/check_disc -w $ARGS$ -c $ARGS$ -p $ARGS$ #command_line $USER1$/check_disc } The front-end error is now (Return code of 127 is out of bounds - plugin may be missing) On Mon, Mar 19, 2012 at 11:27 AM, Wolf Halton wolf.hal...@gmail.com wrote: I found it check_command check_disk! -w 20% -c 10% --path=/ It was missing the ! Thanks for helping me sort it out. Wolf On Mon, Mar 19, 2012 at 10:37 AM, Wolf Halton wolf.hal...@gmail.com wrote: I am not sure how I am launching the service, which I admit is bad, but I inherited the setup. I am not sure if it ever worked properly, because it took a catastrophic rampant application running amok, spewing 9GB files to clue me in. I am running nagios 3.3.1 and nagios-plugins 1.4.15 the application is check_local_disk but I think that must have been a check from the previous nagios the one in libexec is check_disk so I tried [code] define service{ use local-service host_name LTS-MASTERKEY-000 service_description Root Partition check_command check_disk -w 20% -c 10% --path=/ notifications_enabled 1 max_check_attempts 3 check_interval 5 retry_interval 3 check_period 24x7 notification_interval 15 notification_period 24x7 notification_options w,c,r contact_groups admins } [/code] and nagios will not start - says there is a configuration error On Fri, Mar 9, 2012 at 7:49 AM, Claudio Kuenzler c...@claudiokuenzler.com wrote: Please show the service definition. How do you launch the check? By ssh, by nrpe? Seems you're using the same IP address or dns name as the hostname value. Can you verify this? On Fri, Mar 9, 2012 at 1:19 PM, Wolf Halton wolf.hal...@gmail.com wrote: All my machines show a similar output, regardless of how much is available on their root partitions. Root Partition OK 03-09-2012 07:11:08 28d 22h 18m 15s 1/3 DISK OK - free space: / 15903 MB (86% inode=93%): Up to and including ones that are 100% full. No alarms - ever. Is a client app needed on the monitored clients that has not been mentioned? -Wolf -- This Apt Has Super Cow Powers - http://sourcefreedom.com Advancing Libraries Together - http://LYRASIS.org --- --- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also