[ 
https://issues.apache.org/jira/browse/UIMA-4829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15269479#comment-15269479
 ] 

Lou DeGenaro edited comment on UIMA-4829 at 5/4/16 11:06 AM:
-------------------------------------------------------------

When SM sets the state of a service, it now considers whether the pinger is 
stale.

If the current state is Available and the pinger is stale, then force the state 
to become Waiting.  This will block allocation of resources for those 
Jobs/Services that have a dependency on the now Waiting service.  Once the 
pinger becomes un-stale, the SM will change the service's state back to 
Available.

Pinger staleness occurs when the last update time from the pinger exceeds 
ducc.sm.meta.ping.stability * ducc.sm.meta.ping.rate, which are already 
existing ducc.properties variables.

How many consecutive missed pings are required to mark a service unresponsive
ducc.sm.meta.ping.stability = 10

How often services are monitored to insure they're responding, in milliseconds
ducc.sm.meta.ping.rate = 60000





was (Author: lou.degenaro):
When SM sets the state of a service, it now considers whether the pinger is 
stale.

If the current state is Available and the pinger is stale, then force the state 
to become Waiting.  This will block allocation of resources for those 
Jobs/Services that have a dependency on the now Waiting service.  Once the 
pinger becomes un-stale, the SM will change the service's state back to 
Available.

Pinger staleness occurs when the last update time from the pinger exceeds 
ducc.sm.meta.ping.stability * ducc.sm.meta.ping.rate, which are already 
existing ducc.properties variables.

# How many consecutive missed pings are required to mark a service unresponsive
ducc.sm.meta.ping.stability = 10

# How often services are monitored to insure they're responding, in milliseconds
ducc.sm.meta.ping.rate = 60000




> When a DUCC pinger hangs it is reported as Available
> ----------------------------------------------------
>
>                 Key: UIMA-4829
>                 URL: https://issues.apache.org/jira/browse/UIMA-4829
>             Project: UIMA
>          Issue Type: Bug
>          Components: DUCC
>            Reporter: Burn Lewis
>            Assignee: Lou DeGenaro
>            Priority: Minor
>             Fix For: 2.1.0-Ducc
>
>
> The DUCC pinger can hang and the main evidence is the stale ping statistics.  
> The SM should mark it as unavailable and indicate that pinger is active but 
> not responding.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to