Re: Flink 1.9.2 why always checkpoint expired

Congxian Qiu Sun, 03 May 2020 07:03:16 -0700

Hi

>From the picture and the previous eamil. you use RocksDBStateBackend, and
all the operators chained together, checkpoint timeout set to 2min.


Do you have keyed state in your job (do you have `keyby` in your job)?

I'll share some experience to find out the reason of checkpoint timeout
problem,
1. does the snapshot thread can get checkpoint lock if you run on version <
1.10
2. does the main thread consumes too much cpu, so that barrier can not be
handled.
3. could you please enable debug log and find out more information.


Best,
Congxian


qq <471237...@qq.com> 于2020年4月28日周二 上午9:20写道：

>
>
> 2020年4月27日 12:40，Jiayi Liao <buptliaoji...@gmail.com> 写道：
>
> Hi,
>
> The picture in your attachment is too vague to see any detail. And beside
> the overview, could you take a look at the details of a specific expired
> checkpoint in history tab? From my experience, the expiration is usually
> because:
>
> 1. The data skew problem, which you can find out from checkpoints' details.
> 2. The processing is too slow (or the job is back-pressured) and the
> checkpoint timeout is set too short.
>
> Best Regards,
> Jiayi Liao
>
> On Mon, Apr 27, 2020 at 12:34 PM qq <471237...@qq.com> wrote:
>
>> Hi all,
>>
>> Why my flink checkpoint always expired, I used RocksDB checkpoint,
>> and I can’t get any useful messages for this. Could you help me ? Thanks
>> very much.
>>
>>
>>
>> <粘贴的图形-1.tiff>
>>
>
>

Re: Flink 1.9.2 why always checkpoint expired

Reply via email to