Hello team,

I am aware that in case of memory issues when a task fails, it will try to
restart 4 times since it is a default number and if it still fails then it
will cause the entire job to fail.

But suppose if I am reading a file that is distributed across nodes in
partitions. So, what will happen if a partition fails that holds some data?
Will it re-read the entire file and get that specific subset of data since
the driver has the complete information? or will it copy the data to the
other working nodes or tasks and try to run it?

Reply via email to