The ! is to remove servers that have been included already in the line above in the hostgroup_name, that way I do not need to spell them all out individually or create different groups if you have one server that doesn't fit with everything. Basically I do not want to monitor Check CPU Load on every server in the Nmw_Servers group, so I exclude them on the next line. I thought this was OK to do this?
I will remove the host_name in case the ! is making things weird and my service dependencies as well and let you know what happens. Thanks for the help! Rob Yeo On 1/12/09 8:26 AM, "jmose...@corp.xanadoo.com" <jmose...@corp.xanadoo.com> wrote: > First, what's with the '!' preceding many of your hosts names? Second, in > your service and service dependency configs, why are you defining both > hosts and hostgroups? For example: > > define service{ > hostgroup_name Nmw_Servers,Wi_Servers > host_name > !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,Dgh_000_012_FairUse,Art > > _104_014_FilmServer,!Nmw_104_092_Nagios3 > > service_description Check CPU Load > use medium-high-priority-service > check_command check_nrpe!check_load > } > > I'd start with removing the host_name portion of the config and including > 'Dgh_Servers' in the hostgroup_name stanza. Otherwise, you've got a lot of > overkill... > > Lastly, get rid of the service dependencies until you have this sorted out > and working. > > > > James Moseley > > > > > > Robert Yeo > <r....@utah.edu> > To > 01/12/2009 08:57 <Nagios-users@lists.sourceforge.net > AM > > cc > > Subject > Re: [Nagios-users] check_ping stuck > on 'PENDING' > > > > > > > > > > > The operation I am employing is Check CPU Load but the symptoms were the > same as Michiel's check ping. > > Hopefully all the information is there now. > > Thanks! > > Rob Yeo > > # Finearts Nagios 2 Server > define host{ > > host_name Dgh_000_012_FairUse > alias Finearts Nagios 2 Server > address 155.xxx.xxx.xxx > > use server-high-priority > hostgroups Dgh_Servers > parents Dgh_Switch_02 > icon_image mac40.png > statusmap_image mac40.gd2 > } > > # '003 DGH Servers' server group definition > define hostgroup{ > hostgroup_name Dgh_Servers > alias DGH_SERVERS > } > > # 'check_nrpe' command definition > define command { > command_name check_nrpe > command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ -u > }' > > define service{ > hostgroup_name Nmw_Servers,Wi_Servers > host_name > !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,Dgh_000_012_FairUse,Art > > > _104_014_FilmServer,!Nmw_104_092_Nagios3 > > service_description Check CPU Load > use medium-high-priority-service > check_command check_nrpe!check_load > } > > define servicedependency{ > hostgroup_name Nmw_Servers,Wi_Servers > host_name > !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,!Nmw_104_011_Finearts_F > > ileserver,Dgh_000_012_FairUse,Art_104_014_FilmServer,!Nmw_104_092_Nagios3 > > service_description Check NRPE > dependent_hostgroup_name Nmw_Servers,Wi_Servers > dependent_host_name > !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,!Nmw_104_011_Finearts_F > > ileserver,Dgh_000_012_FairUse,Art_104_014_FilmServer,!Nmw_104_092_Nagios3 > dependent_service_description Check CPU Load,Check Free > Space,Current > Users,Total Processes,Zombie Processes > > execution_failure_criteria w,u,c,p > notification_failure_criteria w,u,c,p > } > > define host{ > name generic-host > notifications_enabled 1 > event_handler_enabled 1 > flap_detection_enabled 1 > failure_prediction_enabled 1 > process_perf_data 1 > retain_status_information 1 > retain_nonstatus_information 1 > register 0 > } > > define host{ > name server-high-priority > use generic-host > check_period 24x7 > check_interval 2 > max_check_attempts 2 > check_command check-host-alive > notification_period 24x7 > notification_interval 120 > notification_options d,r,f > contact_groups nagiosadmin > register 0 > } > > > define service{ > name medium-high-priority-service > use generic-service > check_period 24x7 > max_check_attempts 5 > normal_check_interval 5 > retry_check_interval 2 > contact_groups nagiosadmin > notification_options c,r,f > notification_interval 180 > notification_period extendedworkhours > register 0 > } > > define service{ > name generic-service > active_checks_enabled 1 > passive_checks_enabled 1 > parallelize_check 1 > obsess_over_service 0 > check_freshness 0 > notifications_enabled 1 > event_handler_enabled 1 > flap_detection_enabled 1 > failure_prediction_enabled 1 > process_perf_data 1 > retain_status_information 1 > retain_nonstatus_information 1 > is_volatile 0 > register 0 > } > > # 'extendedworkhours' timeperiod definition > define timeperiod{ > timeperiod_name extendedworkhours > alias Extended Work Hours > sunday 10:00-22:00 > monday 07:00-22:00 > tuesday 07:00-22:00 > wednesday 07:00-22:00 > thursday 07:00-22:00 > friday 07:00-22:00 > saturday 10:00-22:00 > } > > # '24x7' timeperiod definition > define timeperiod{ > timeperiod_name 24x7 > alias 24 Hours A Day, 7 Days A Week > sunday 00:00-24:00 > monday 00:00-24:00 > tuesday 00:00-24:00 > wednesday 00:00-24:00 > thursday 00:00-24:00 > friday 00:00-24:00 > saturday 00:00-24:00 > } > > Nagios 3.0.6 on 10.4.11 OSX. > > The service check is stuck in pending for the DGH server, and is returning > fine for 2 of the NMW servers, but not all of them. > > Thanks, > > Rob Yeo > > > On 1/9/09 4:34 PM, "jmose...@corp.xanadoo.com" <jmose...@corp.xanadoo.com> > wrote: > >> I don't see a service definition for 'server-high-priority', nor do I see >> where you are employing a check_ping operation, nor have you included the >> check_period config for 24x7. >> >> >> James Moseley >> >> >> >> >> >> Robert Yeo >> <r....@utah.edu> >> > To >> 01/09/2009 05:13 nagios-users Mailinglist >> PM > <Nagios-users@lists.sourceforge.net >>> >> > cc >> >> > Subject >> Re: [Nagios-users] check_ping > stuck >> on 'PENDING' >> >> >> >> >> >> >> >> >> >> >> >> This is an example config for one of the services. Please let me know if > I >> am missing any info. >> >> # Finearts Nagios 2 Server >> define host{ >> >> host_name Dgh_000_012_FairUse >> alias Finearts Nagios 2 Server (fairuse.music.utah.edu) >> address 155.97.55.12 >> >> use server-high-priority >> hostgroups Dgh_Servers >> parents Dgh_Switch_02 >> icon_image mac40.png >> statusmap_image mac40.gd2 >> } >> >> # '003 DGH Servers' server group definition >> define hostgroup{ >> hostgroup_name Dgh_Servers >> alias DGH_SERVERS >> } >> >> # 'check_nrpe' command definition >> define command { >> command_name check_nrpe >> command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ -u >> }' >> >> define service{ >> hostgroup_name Nmw_Servers,Wi_Servers >> host_name >> > !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,Dgh_000_012_FairUse,Art > >> >> _104_014_FilmServer,!Nmw_104_092_Nagios3 >> >> service_description Check CPU Load >> use medium-high-priority-service >> check_command check_nrpe!check_load >> } >> >> define service{ >> name medium-high-priority-service >> use generic-service >> check_period 24x7 >> max_check_attempts 5 >> normal_check_interval 5 >> retry_check_interval 2 >> contact_groups nagiosadmin >> notification_options c,r,f >> notification_interval 180 >> notification_period extendedworkhours >> register 0 >> } >> >> define service{ >> name generic-service >> active_checks_enabled 1 >> passive_checks_enabled 1 >> parallelize_check 1 >> obsess_over_service 0 >> check_freshness 0 >> notifications_enabled 1 >> event_handler_enabled 1 >> flap_detection_enabled 1 >> failure_prediction_enabled 1 >> process_perf_data 1 >> retain_status_information 1 >> retain_nonstatus_information 1 >> is_volatile 0 >> register 0 >> } >> >> # 'extendedworkhours' timeperiod definition >> define timeperiod{ >> timeperiod_name extendedworkhours >> alias Extended Work Hours >> sunday 10:00-22:00 >> monday 07:00-22:00 >> tuesday 07:00-22:00 >> wednesday 07:00-22:00 >> thursday 07:00-22:00 >> friday 07:00-22:00 >> saturday 10:00-22:00 >> } >> >> Nagios 3.0.6 on 10.4.11 OSX. >> >> The service check is stuck in pending for the DGH server, and is > returning >> fine for 2 of the NMW servers, but not all of them. >> >> Thanks, >> >> Rob Yeo >> >> On 1/9/09 4:04 PM, "Marc Powell" <m...@ena.com> wrote: >> >>> >>> On Jan 9, 2009, at 4:48 PM, <ibloodyhates...@gmail.com> >>> <ibloodyhates...@gmail.com >>>> wrote: >>> >>>> As per the subject... Nagios 3.0.6 and it happens both to lan and >>>> external servers. >>> >>>> As ping¹s a very basic check and as far as I can see everything >>>> should work normally, right now I¹m stumped. Any idea what's going >>>> on here? >>> >>> Actual config information is necessary otherwise we're just guessing >>> in the dark. Nagios doesn't think it's supposed to perform a check. Is >>> the timeperiod right? >>> >>> host definition, service definition (and templates for both if used) >>> and timeperiod definition would be a good start. >>> >>> -- >>> Marc >>> >> > ------------------------------------------------------------------------------ > >> >>> Check out the new SourceForge.net Marketplace. >>> It is the best place to buy or sell services for >>> just about anything Open Source. >>> http://p.sf.net/sfu/Xq1LFB >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users@lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting >>> any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >> >> >> > ------------------------------------------------------------------------------ > >> >> Check out the new SourceForge.net Marketplace. >> It is the best place to buy or sell services for >> just about anything Open Source. >> http://p.sf.net/sfu/Xq1LFB >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> >> > > > ------------------------------------------------------------------------------ > > Check out the new SourceForge.net Marketplace. > It is the best place to buy or sell services for > just about anything Open Source. > http://p.sf.net/sfu/Xq1LFB > _______________________________________________ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ Check out the new SourceForge.net Marketplace. It is the best place to buy or sell services for just about anything Open Source. http://p.sf.net/sfu/Xq1LFB _______________________________________________ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null