Hi,

I'm running a simple, "embarassingly parallel" ETL-type job. I noticed that
a failure in one subtask causes the entire job to restart. Even with the
region failover strategy, all subtasks of this task and connected ones
would fail. Is there any way to limit restarting to only the single subtask
that failed, so all other subtasks can stay alive and keep working?

For context, I use Flink 1.11 in AWS Kinesis Data Analytics, so some
configuration is not controlled by me
<https://docs.aws.amazon.com/kinesisanalytics/latest/java/reference-flink-settings.title.html>
.

Thanks

Reply via email to