Re: Duplicate copies of job in Flink UI/API

2021-09-09 Thread Chesnay Schepler
Schepler *Date: *Thursday, September 9, 2021 at 9:11 AM *To: *Peter Westermann , Piotr Nowojski , user@flink.apache.org *Subject: *Re: Duplicate copies of job in Flink UI/API Just to double-check that I'm understanding things correctly: You have a job with HA, then Zookeeper breaks down, the job gets

Re: Duplicate copies of job in Flink UI/API

2021-09-09 Thread Peter Westermann
: Chesnay Schepler Date: Thursday, September 9, 2021 at 9:11 AM To: Peter Westermann , Piotr Nowojski , user@flink.apache.org Subject: Re: Duplicate copies of job in Flink UI/API Just to double-check that I'm understanding things correctly: You have a job with HA, then Zookeeper breaks down, the job

Re: Duplicate copies of job in Flink UI/API

2021-09-09 Thread Chesnay Schepler
: Duplicate copies of job in Flink UI/API Hi Peter, Can you provide relevant JobManager logs? And can you write down what steps have you taken before the failure happened? Did this failure occur during upgrading Flink, or after the upgrade etc. Best, Piotrek śr., 8 wrz 2021 o 16:11 Peter

Re: Duplicate copies of job in Flink UI/API

2021-09-09 Thread Peter Westermann
election is expected. Thanks, Peter From: Piotr Nowojski Date: Thursday, September 9, 2021 at 12:39 AM To: Peter Westermann Cc: user@flink.apache.org Subject: Re: Duplicate copies of job in Flink UI/API Hi Peter, Can you provide relevant JobManager logs? And can you write down what steps have you

Re: Duplicate copies of job in Flink UI/API

2021-09-08 Thread Piotr Nowojski
Hi Peter, Can you provide relevant JobManager logs? And can you write down what steps have you taken before the failure happened? Did this failure occur during upgrading Flink, or after the upgrade etc. Best, Piotrek śr., 8 wrz 2021 o 16:11 Peter Westermann napisał(a): > We recently upgraded

Duplicate copies of job in Flink UI/API

2021-09-08 Thread Peter Westermann
We recently upgraded from Flink 1.12.4 to 1.12.5 and are seeing some weird behavior after a change in jobmanager leadership: We’re seeing two copies of the same job, one of those is in SUSPENDED state and has a start time of zero. Here’s the output from the /jobs/overview endpoint: { "jobs":