Re: [Nagios-users] nagios scheduling and Hard/soft state question

2009-09-03 Thread Marc Powell

On Sep 3, 2009, at 2:29 PM, shadih rahman wrote:

 All,
according to the definition hard state is reached upon completing  
 the max_check_attempt .  This particular service status information  
 is stating otherwise.  This particular service check has  
 max_check_attempt set to 3.  However it looks like soft state  
 changed into Hard with checking for 3 times.  Is this a bug or am I  
 missing something here.  Please advise on this.  Thanks

 [09-02-2009 13:58:30] SERVICE ALERT: Host  
 B;batteryliebert;WARNING;HARD;1;Status is a WARNING level - SNMP OID  
 does not exist
 [09-02-2009 13:56:30] SERVICE ALERT:  
 HOSTA;batteryliebert;WARNING;SOFT;1;Status is a WARNING level - SNMP  
 agent not responding

Do you have 'is_volatile' enabled? Please post the entire service  
definition from objects.cache if you do not as well as a few prior log  
entries for this service.

--
Marc


--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagios scheduling and Hard/soft state question

2009-09-03 Thread shadih rahman
I don't have  is_volatile enabled.  Below I am pasting my log entries and
service definition.  Thanks


*log entries*

[1251777600] CURRENT HOST STATE: HOST A;UP;HARD;1;FPING OK - HOST A
(loss=0%, rta=0.85 ms)
[1251777600] CURRENT SERVICE STATE: HOST A;batteryliebert;OK;HARD;1;Status
is OK - GXT2-2000RT120 - STATUS NORMAL -
[1251777600] CURRENT SERVICE STATE: HOST A;capacity;OK;HARD;1;SNMP OK - 100
[1251777600] CURRENT SERVICE STATE: HOST A;output_current;OK;HARD;1;SNMP OK
- 32 0.1 RMS Amp
[1251777600] CURRENT SERVICE STATE: HOST A;search_aid;OK;HARD;1;(null)
[1251777600] CURRENT SERVICE STATE: HOST A;temp;OK;HARD;1;SNMP OK - 32
[1251914190] SERVICE ALERT: HOST A;batteryliebert;WARNING;SOFT;1;Status is a
WARNING level - SNMP agent not responding
[1251914200] HOST ALERT: HOST A;DOWN;SOFT;1;FPING CRITICAL - HOST A
(loss=100% )
[1251914310] SERVICE ALERT: HOST A;batteryliebert;WARNING;HARD;1;Status is a
WARNING level - SNMP OID does not exist
[1251914390] HOST ALERT: HOST A;DOWN;SOFT;2;FPING CRITICAL - HOST A
(loss=100% )
[1251914580] HOST ALERT: HOST A;UP;SOFT;3;FPING WARNING - HOST A
[1251914600] SERVICE ALERT: HOST A;batteryliebert;OK;HARD;1;Status is OK -
GXT2-2000RT120 - STATUS NORMAL -
[1251915210] SERVICE ALERT: HOST A;batteryliebert;WARNING;SOFT;1;Status is a
WARNING level - SNMP OID does not exist
[1251915330] SERVICE ALERT: HOST A;batteryliebert;WARNING;SOFT;2;Status is a
WARNING level - SNMP agent not responding
[1251915440] SERVICE ALERT: HOST A;batteryliebert;OK;SOFT;3;Status is OK -
GXT2-2000RT120 - STATUS NORMAL -


*service definition*

define service {
host_name  HOST A
service_description batteryliebert
check_periodnoncritical
check_command   check_liebert_ups
contact_groups  netsys
notification_period extended
initial_state   o
check_interval  5.00
retry_interval  2.00
max_check_attempts  3
is_volatile 0
parallelize_check   1
active_checks_enabled   1
passive_checks_enabled  1
obsess_over_service 1
event_handler_enabled   1
low_flap_threshold  0.00
high_flap_threshold 0.00
flap_detection_enabled  1
flap_detection_options  o,c
freshness_threshold 0
check_freshness 0
notification_optionsc,r,f
notifications_enabled   1
notification_interval   30.00
first_notification_delay0.00
stalking_optionsn
process_perf_data   1
failure_prediction_enabled  1
retain_status_information   1
retain_nonstatus_information1
}




On Thu, Sep 3, 2009 at 4:04 PM, Marc Powell m...@ena.com wrote:


 On Sep 3, 2009, at 2:29 PM, shadih rahman wrote:

  All,
 according to the definition hard state is reached upon completing
  the max_check_attempt .  This particular service status information
  is stating otherwise.  This particular service check has
  max_check_attempt set to 3.  However it looks like soft state
  changed into Hard with checking for 3 times.  Is this a bug or am I
  missing something here.  Please advise on this.  Thanks
 
  [09-02-2009 13:58:30] SERVICE ALERT: Host
  B;batteryliebert;WARNING;HARD;1;Status is a WARNING level - SNMP OID
  does not exist
  [09-02-2009 13:56:30] SERVICE ALERT:
  HOSTA;batteryliebert;WARNING;SOFT;1;Status is a WARNING level - SNMP
  agent not responding

 Do you have 'is_volatile' enabled? Please post the entire service
 definition from objects.cache if you do not as well as a few prior log
 entries for this service.

 --
 Marc



 --
 Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day
 trial. Simplify your report design, integration and deployment - and focus
 on
 what you do best, core application coding. Discover what's new with
 Crystal Reports now.  http://p.sf.net/sfu/bobj-july
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null




-- 
Cordially,
Shadhin Rahman
--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: 

Re: [Nagios-users] nagios scheduling and Hard/soft state question

2009-09-03 Thread Marc Powell

On Sep 3, 2009, at 3:38 PM, shadih rahman wrote:

 I don't have  is_volatile enabled.  Below I am pasting my log  
 entries and service definition.  Thanks

Weren't you asking about HOST B;batteryliebert?

 log entries

 [1251914190] SERVICE ALERT: HOST A;batteryliebert;WARNING;SOFT; 
 1;Status is a WARNING level - SNMP agent not responding

Service isn't responding... get's a warning (must be default for that  
plugin?). Nagios now checks the host --

 [1251914200] HOST ALERT: HOST A;DOWN;SOFT;1;FPING CRITICAL - HOST A  
 (loss=100% )

Host is down!

 [1251914310] SERVICE ALERT: HOST A;batteryliebert;WARNING;HARD; 
 1;Status is a WARNING level - SNMP OID does not exist

If service has problem and host is down, retries aren't needed, HARD  
state results.

I haven't looked in depth at the new check logic with the introduction  
of parallel host checks to be absolutely certain but the above seems  
reasonable based on what nagios did in the past.

--
Marc


--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null