yaooqinn commented on PR #43746:
URL: https://github.com/apache/spark/pull/43746#issuecomment-1813660864

   > What do you mean by this, are you saying the Spark on YARN handling of 
preempted containers is not working properly? Meaning if the container is 
preempted it should not show up as an executor failure. Are you seeing those 
preempted containers show up as failed?
   Or are you saying that yes Spark on YARN doesn't mark preempted as failed?
   
   PREEMPTED is ok, and its cases are not counted by executor failure tracker, 
I was wrong about this, sorry to bother.
   
   > If that is the case then Spark should allow users to turn 
spark.executor.maxNumFailures off or I assume you could do the same thing by 
setting it to int.maxvalue.
   
   There are pros and cons to this suggestion, I guess. Disabling the executor 
failure tracker certainly keeps the app alive, but at the same time invalidates 
fast fail.
   
   > As implemented this seems very arbitrary and I would think hard for a 
normal user to set and use this feature.
   
   Most of configurations with numeric value and the defaults in spark are 
arbitrary?
   
   
   > I don't understand why this isn't the same as minimum number of executors 
as that seems more in line - saying you need some minimum number for this 
application to run and by the way its ok to keep running with this is launching 
new executors is failing.
   
   minimum number of executors can be 0
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to