Re: Random incorrect checkpoint existence check

2022-04-25 Thread Hangxiang Yu
Hi, Chen-che,
I think it may be similar to FLINK-12381
.
You could adopt the suggestion like setting the job-id as you could see the
comment below the ticket.
I think you could also share your environment in this ticket to let us know
more information.

On Mon, Apr 25, 2022 at 9:58 AM Chen-Che Huang  wrote:

> Hi all,
>
> We recently encountered a random issue. When our Flink application is
> doing checkpoint creation, it occasionally fails because it thinks the
> medatafile of the checkpoint already exists. However, the medata file does
> not exist actually. We use Flink version 1.14.4 and the checkpoints are
> stored at google cloud storage. Does anyone encounter the same problem? Any
> comment is appreciated.
>
> Caused by: org.apache.hadoop.fs.FileAlreadyExistsException: 
> 'gs://bucket/flink-app//chk-5812/_metadata' 
> already exists
>
> Best wishes,
>
> Chen-Che Huang
>
>


Random incorrect checkpoint existence check

2022-04-24 Thread Chen-Che Huang
Hi all,

We recently encountered a random issue. When our Flink application is doing
checkpoint creation, it occasionally fails because it thinks the medatafile
of the checkpoint already exists. However, the medata file does not exist
actually. We use Flink version 1.14.4 and the checkpoints are stored at
google cloud storage. Does anyone encounter the same problem? Any comment
is appreciated.

Caused by: org.apache.hadoop.fs.FileAlreadyExistsException:
'gs://bucket/flink-app//chk-5812/_metadata'
already exists

Best wishes,

Chen-Che Huang