Let's suppose we're dealing with a non-secured (i.e. not Kerberized) YARN cluster. When I invoke spark-submit, is there a practical difference between specifying --proxy-user=foo (supposing impersonation is properly set up) or setting the environment variable HADOOP_USER_NAME=foo? Thanks for any insights or links to docs.
--------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org