Re: S3 Checkpointing taking long time with stateful operations

2020-06-20 Thread Congxian Qiu
Hi Sandeep The picture isn't shown. First, you can try to find out whether there is some operator's e2e time is big, the e2e time of snapshot for one operator is time${barrier align time} + time{sync-snapshot} + time{async-snapshot}. exactly-once and at least once both need to wait for barrier al

S3 Checkpointing taking long time with stateful operations

2020-06-18 Thread Kathula, Sandeep
Hi, We are running a stateful application in Flink with RocksDB as backend and set incremental state to true with checkpoints written to S3. * 10 task managers each with 2 task slots * Checkpoint interval 3 minutes * Checkpointing mode – At-least once processing After running app for