yamanda created FLINK-34567: ------------------------------- Summary: flink task manager error occur, msg: Encountered error while consuming partitions Key: FLINK-34567 URL: https://issues.apache.org/jira/browse/FLINK-34567 Project: Flink Issue Type: Bug Affects Versions: 1.16.2 Reporter: yamanda Attachments: pic1.jpg
I deploy flink cluster and it run normally about 2 months, but recently i meet a problem. I see some sub tasks back pressure is high and the flink job is totally blocked(in pic1.jpg), these sub tasks are all in one task manager. so i stop the abnormal task manager and deploy flink job again, the problem is solved. I find some error log in the abnormal task manager: 2024-03-03 15:57:25,088 ERROR org.apache.flink.runtime.io.network.netty.PartitionRequestQueue [] - Encountered error while consuming partitions org.apache.flink.shaded.netty4.io.netty.channel.unix.Errors$NativeIoException: readAddress(..) failed: Connection timed out What does it mean? What should i do to solve this problem completely? -- This message was sent by Atlassian Jira (v8.20.10#820010)