Re: mod_jk 1.2.17+ Recover time

Jean-frederic Clere Wed, 19 Jul 2006 03:59:41 -0700

Rainer Jung wrote:

No, I think it's not:
1) This is not a regression, it was always implemented like that.
2) The recover feature is used in the load balancer and the first wayof avoiding errors is meant to be retries, the second way is failover.Only then comes recovery.
3) A worker that goes into error state is somethingserious/heavy-weight. Timeouts leading to error state should not bechosen to small, so that workers go into errors just because ofregular long running requests.
4) Recovering a worker is not something lightweight, because a stucktomcat might mean, that every recovery times out at full length.Remember: we are doing recovery with real requests. I think it's not agood idea to try recovering with real requests very often. That's thereason for only trying to recover rarely.
5) Once we might have seperate management threads in mod_proxy_ itwould make sense to probe failed workers more often.

I am preparing a health checker separed process from httpd to healthcheck the workers. If not healthy no retries failover directly, therecovering will only occurs when the worker is marked healty again bythe health checker process.

6) We could make the interval configurable, but there is a real dangerof users thinking, that a low recovery interval, like 10 seconds wouldmake things better, whereas it is very likely, that it would makethere whole system kind of oscillate.

The next problem is to find a way to tell TC that its connexions havebeen closed (by a stupid firewall that eats the closes for example).That is nice to recover but how to make sure the TC part knows thatsomething has went wrong.


Cheers

Jean-Frederic


Reagrds

Rainer

 to the full timeouts in the worker

Henri Gomez wrote:

Well a new show stopper for 1.2.18 ;(

2006/7/18, David Rees <[EMAIL PROTECTED]>:

On 7/18/06, Jess Holle <[EMAIL PROTECTED]> wrote:
> Is the 60 seconds hard-coded?
>
> I'd hope not...
>

> Once you have some interesting web apps in Tomcat it often takes abit> longer than 10 seconds -- and on my laptop just took a full 60seconds,

> but that is rather unusual (a restart thereafter only took 18).

Yes, it's hard-coded. See my references in my first post.

-Dave

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: mod_jk 1.2.17+ Recover time

Reply via email to