Re: [External Sender] Re: ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue

2020-12-09 Thread Piotr Nowojski
tart / crash) >>>> 5. Network problems >>>> >>>> Piotrek >>>> >>>> pon., 7 gru 2020 o 23:31 Kye Bae napisaƂ(a): >>>> >>>>> I forgot to mention: this is Flink 1.10. >>>>> >>>>&

Re: [External Sender] Re: ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue

2020-12-08 Thread Piotr Nowojski
gt; Then, we began to get the exception below from taskmanagers (random) >>>> since yesterday, and the job began to fail/restart every hour or so. >>>> >>>> The job does recover after each restart, but sometimes it takes more >>>> time to recover than a

Re: [External Sender] Re: ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue

2020-12-08 Thread Kye Bae
allowed in our environment. On a few occasions, it >>> took more than a few restarts to fully recover. >>> >>> Can you provide some insight into what this error means and also what we >>> can do to prevent this in future? >>

Re: ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue

2020-12-08 Thread Piotr Nowojski
gt;> >> Can you provide some insight into what this error means and also what we >> can do to prevent this in future? >> >> Thank you! >> >> +++ >> ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue - >> Encountered

Re: ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue

2020-12-07 Thread Kye Bae
some insight into what this error means and also what we > can do to prevent this in future? > > Thank you! > > +++ > ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue - > Encountered error while consuming partitions > java.io.IOException: Connection reset

ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue

2020-12-07 Thread Kye Bae
time to recover than allowed in our environment. On a few occasions, it took more than a few restarts to fully recover. Can you provide some insight into what this error means and also what we can do to prevent this in future? Thank you! +++ ERROR