Hi In case of failure of a node what does it mean 'Fault tolerance for programs in the DataSet API works by retrying failed executions’ [1] ? -work already done by the rest of the nodes is not lost, only work of the lost node is recomputed, job execution will continue or -entire job execution is retried
[1] https://ci.apache.org/projects/flink/flink-docs-master/apis/batch/fault_tolerance.html <https://ci.apache.org/projects/flink/flink-docs-master/apis/batch/fault_tolerance.html> Best, Ovidiu