Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/1486#issuecomment-56484097 @mridulm the proposal here was to avoid proposing a generalized/public API for these and instead do something simple/internal for the case of Hadoop RDD. The underscore is not a valid character in a hostname, so we can use it safely and continue to support it going forward at low cost. This just piggy-backs on the existing support we already have for in-memory input blocks. I'd like to see ups adding a publicly documented complete interface for specifying task locality levels like you said and supporting them in a general way in the TaskSetManager. URI's could be good for this, or some other structured format. But that is a much more complicated proposition, and one that requires some design discussion. The purpose of this patch is to do something more surgical in the short term.
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org