Flink 1.13.1 PartitionNotFoundException

2021-07-14 Thread Debraj Manna
Hi I am observing my flink jobs is failing with the below error 2021-07-14T12:07:00.918Z INFO runtime.executiongraph.Execution flink-akka.actor.default-dispatcher-29 transitionState:1446 MetricAggregateFunction -> (Sink: LateMetricSink10, Sink: TSDBSink9) (12/30) (3489393394c13fd1ad85136e11d67deb

Re: Flink 1.13.1 PartitionNotFoundException

2021-07-14 Thread Timo Walther
Hi Debraj, I could find quite a few older emails that were suggesting to play around with the `taskmanager.network.request-backoff.max` option. This was also recomended in the link that you shared. Have you tried it? Here is some background: http://deprecated-apache-flink-user-mailing-list-a

Re: Flink 1.13.1 PartitionNotFoundException

2021-07-14 Thread Debraj Manna
Yes I forgot to mention in my first email. I have tried increasing taskmanager.network.request-backoff.max to 3 in flink-conf.yaml. But I am getting the same error. On Wed, Jul 14, 2021 at 7:10 PM Timo Walther wrote: > Hi Debraj, > > I could find quite a few older emails that were suggesting

Re: Flink 1.13.1 PartitionNotFoundException

2021-07-14 Thread Debraj Manna
I have increased it to 9 and seems to be running fine. If I see the failure still when I add some load I will post back in this thread. On Wed, Jul 14, 2021 at 7:19 PM Debraj Manna wrote: > Yes I forgot to mention in my first email. I have tried increasing > taskmanager.network.request-back