Re: HealthCheckExecutor.execute(ServiceReference) ??

Georg Henzler Tue, 31 Dec 2013 03:08:54 -0800


Hi Carsten,

If - for whatever reason - I have an arbitrary set of HCs to executeinparallel, this is not possible atm. So what about having just asingle
method?

execute(ServiceReference<HealthCheck>[] hcs)


This is an option, but I would at least add the convenience method

execute(String... tags)

that handles the lookup - I can't think of a use case for the arbitraryset of HCs at the moment, execution for tags or all checks is really allI would need and I'd love to see the most useful method in the API ;-)

I also think we should move the executor service interface and theresult
interface (regardless of how this looks) into a separate package.


Looks good.

For the result, I think we should leave the Result class untouched asthisis the result as produces by the HC - which is not exactly the sameas the
result produced by the executor.
We can rename the result class from the Executor to
HealthCheckExecutionResult - to make this clear and instead of havingthesame methods as the Result class, simple add a method which returnstheResult object that was returned by the HealthCheck and just addadditionalmethods for the additional information. Instead of returning the metadata(like tags, name etc.) we could also simply return theServiceReference -as we now have the service reference as input, we can return it aswell.

I would NOT return the ServiceReference... the descriptor is a lotnicer as it handles the reading of the HC service properties (sort of adata binding). Also from an implementors point of view, it is a lotnicer to be able to use a typed class than a service reference+accessingproperties (having to find the prop names from somewhere, and thenhandling the complexity of reading an array making sure also getting theone-entry-array right).


Also we should not mix different data types:
1) The actual result being returned by the HC (unique per result)

2) HealthCheckDescriptor is unique per HC and gives easy, typed accessto the properties3) The timing information (unique per result) can go inHealthCheckExecutionResult (I don't really like to have to usegetHealthCheckResult().getStatus(), I would rather extend an interfacefrom org.apache.sling.hc.api, but then it gets all a bit morecomplicated than it needs to be - that's why I used one interface +UnsupportedOperationException)

@Betrand: In your wiki page you suggest mixing 2) and 3) in an untypedmap - I think this is not a good idea (this is mixing information thatis unique per HC and unique per execution).@Carsten: If we stay with a separate execution result, we shoulddefinitely use get rid of getHealthCheckName() and getHealthCheckTags()and use getHealthCheckDescriptor() (or meta data if you prefer) toseparate data type 2) from 3)


Regards
Georg

P.S. Happy & *healthy* new year! Last email from my side this year :)

I've committed the changes except the last point to make it clearerwhat I
mean - we can easily revert if we disagree (rev 1554396)

WDYT?

Regards
Carsten


2013/12/30 Georg Henzler <[email protected]>
Hi Betrand and Christian,
now I'm afraid we might go a step backward... some things that yousuggest
to be unnecessary are essentially important from my point of view -
supported by one year experience getting to this stage :). Here aremy
explanations:

--- RE: executeXXX Methods
We definitely need the executeForTags(String... tags) andexecuteAll()methods because only this way we get parallel execution withintelligent
timeout handling and current results:
- If all checks are quick (let's say the the longest check requires100msso finish), executeXxx() will return after 100ms having all currentresults(Bertrand, your patch was returning the results of the last call, ifthe
last call was 2h ago, all results would be two hours old!)
- If all checks except one are quick and let's say the one checktakes 3stiming out, we get all current results for the quick checks and oneWARNtimeout result for one check (if we wanted we could change it to thelastcached result, but for my last project this behaviour worked better,also
see next bullet point)
- The global timeout is normally configured reasonably ensuring thatwhen
things are ok, checks never time out
- Sometimes, if things go bad, a check can be stuck completely (Iknowthis shouldn't happen, but I have seen it happening). For this caseaspecial timeout can be configured marking checks as critical if theyexceed
this timeout (10sec default)

The current implementation is robust and correct in many ways:
- no matter how many calls to the health check executor are made(e.g. 100per sec or only 1 every day), we get the latest results (respectingthecache TTL) but without ever executing the same check twice (tworequest tothe web console started at the exact same time will only cause eachcheckto be called once and the two requests will return the exact sameresult)
- it uses never more threads than checks exist in the system
- no matter how bad one check is implemented or how "sick" theaspect thecheck observes is, it will never in any way cause the executor tostopworking except that it returns red for the check (returning theresults for
the other checks correctly)

--- RE: Health Check MetaData

It is important to distinguish two aspects:

1) defining the meta data
2) allow access to the meta data
to 1) I definitely think this should be done in a declarative way(notprogrammatically by adding a method to the HealthCheck interface!).There
is just no need for touring complete code here. Using the service
properties as it's done in the moment is IMHO the best way.
to 2) There should be definitely a way to access the meta data fromtheresult. My way was to use the class HealthCheckDescriptor and add amethodresult.getHealthCheckDescriptor() to the result (funny enough thatclasswas called HealthCheckMetaData in my first version but I renamed itlaterbecause I thought Descriptor is better). Having a flat resultstructure wasrequested by Carsten, that's why we have the methodsgetHealthCheckName()and getHealthCheckTags() now. The cleanest solution my opinion is togo
back to result.getHealthCheckDescriptor() - or we can call it
getHealthCheckMetaData() if you prefer. Also we can create aninterfaceHealthCheckMetaData() and keep that in line with the serviceproperties.
--- RE: "why can't we provide these values from the HealthCheck thatwas
used to create it?" (these values=timing information)

Because
- every check would have to do it (code duplication, pushingcomplexity to
the implementor)
- it cannot be guaranteed, that every health check does it correctly


--- RE: UnsupportedOperationException in Result
Using the UnsupportedOperationException is IMHO an elegant solution.Theexception is part of the JDK and for instance the Collection API isusing
it as well. It is elegant because
- From a conceptual/object oriented view, it totally makes sense tohavegetElapsedTime() or getFinishedAt() as properties of a result.However, theHC itself cannot provide this information (see last section) andit's fine
to throw UnsupportedOperationException for these results then.
- If the methods would be taken away from the interface, a cast to
ExecutionResult would be needed (that is hidden at the momentanyway) to
show this information in the web console
- From an implementors point of view, it is quickly obvious that the
methods are only available when using the executor (the exceptionmessage
is pretty clear)
- Typical implementors would not even notice as they just constructtheresult, but don't call methods on it (that is the job of the webconsole
and JMX)
- We should have getElapsedTime() and getFinishedAt() in the
HealthCheckResult interface because this information is useful forhuman
observers (and should be shown in web console / JMX)


-Georg


Am 30.12.2013 11:30, schrieb Carsten Ziegeler:
The more I think about it, the more I get the feeling that if wecan't
away
with execute(ServiceReference) in the executor api, this should bethe
only
method of that service.
The two others can easily be replaced by using the HCFilter andthencalling the method above for each service reference. It's just oneline
more code, but this would completely remove the need to add meta
information about the HC in the result and make the wholeimplementation
easier.

WDYT?

Carsten


2013/12/30 Carsten Ziegeler <[email protected]>

 2013/12/27 Bertrand Delacretaz <[email protected]>
 Hi Carsten and Georg,
I was going to say that I still disagree, but being outnumberedI'dlet it go if it's just about a single method that I don'tlike...andthen I reviewed the org.apache.sling.hc.api package once againand
it's worse than I thought.
The problem is really, if we don't have a method with thissignature
how
can the jmx implementation make use of the executor while theexecutor isstill able to cache the result, do async processing etc. ? I don'tsee
any
other solution atm, apart from moving the jmx implementation intothe
core.
You guys both say you don't want metadata in APIs...and the
HealthCheckResult interface that you added has these two methods
(which make sense BTW):

  String getHealthCheckName();
  List<String> getHealthCheckTags();

So a HC is not brave enough to have this in its API, but a Result
needs them? This clearly demonstrates that we need a
HealthCheckMetadata interface that provides those values.
I think these are different concerns, a service(!) does not needan api
to expose the metadata because it's already in the serviceregistration.When we return a set of results, there need to be way ofcorrelationbetween a single result and an actual check, otherwise you get abunch of
results without any additional information about the check.
The way out of this, would be to just have a single method
execute(ServiceReference) in the executor :) - but as soon as weallowexecution of more than a single HC we need this additionalinformation.
 On top of that, the Result throws UnsupportedException on these
methods...how ugly is that, why can't it provide these valuesfrom the
HealthCheck that was used to create it?

 I haven't looked deeply at this part yet...
To me this smells of implementation-driven APIs, we are used tobetter
design in Sling.
As pointed out, I'm all ears for a better solution of theexecutor
interface that allows us to get the above mentioned features. Andas
pointed out below, if we simplify the executor service to a single
method,
we don't need to make any changes to Result or add any additional
information.

Carsten

What do others think?
-Bertrand

[1]



https://svn.apache.org/repos/asf/sling/trunk/bundles/
extensions/healthcheck/core/src/main/java/org/apache/sling/hc/api
--
Carsten Ziegeler
[email protected]

Re: HealthCheckExecutor.execute(ServiceReference) ??

Reply via email to