yangwwei commented on pull request #34672:
URL: https://github.com/apache/spark/pull/34672#issuecomment-983067546


   > so I'm a bit on the fence about this. I hesitate to change this API and 
cause folks to recompile without really more investigation to properly make 
shuffle pluggable. The shuffleManager here is private[spark] so the fact we are 
adding a function for 3rd party libraries doesn't really match well with that 
by itself. Ideally we really make api's pluggable and classes developers apis 
for 3rd parties but I also realize that is a much larger task to find how to do 
that properly. I just hate to change the api multiple times if we are planning 
on doing more work there. At the same time, I think other attempts at that have 
stalled, so open here to thoughts from others?
   
   Thanks for sharing your thoughts @tgravescs . By far, there are several 3rd 
shuffle service implementations out there, and it seems the existing APIs are 
pluggable enough. Well, I may miss some facts, but at least we are pretty happy 
to run with Uber's RSS with minimal configs. This is the only issue we found 
that we cannot bypass, we need to have some changes in the Spark in order not 
to "always" register with the local shuffle service. This approach is the 
simplest and safest solution I can think of. I would be loving to hear thoughts 
from others as well.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to