Re: Having a backoff while experiencing checkpointing failures

2018-06-11 Thread Stefan Richter
Hi, I think the behaviour of min_pause_between_checkpoints is either buggy or we should at least discuss if it would not be better to respect a pause also for failed checkpoints. As far as I know there is no ongoing work to add backoff, so I suggest you open a jira issue and make a case for thi

Having a backoff while experiencing checkpointing failures

2018-06-07 Thread vipul singh
Hello all, Are there any recommendations on using a backoff when experiencing checkpointing failures? What we have seen is when a checkpoint starts to expire, the next checkpoint dosent care about the previous failure, and starts soon after. We experimented with *min_pause_between_checkpoints*, ho