GitHub user zjffdu opened a pull request: https://github.com/apache/zeppelin/pull/2577
ZEPPELIN-2898. Support Yarn-Cluster for Spark Interpreter ### What is this PR for? This is the first version for supporting yarn-cluster of `SparkInterpreter`. I just delegate all the function to `spark-submit` as yarn-cluster is natively supported by spark, we don't need to reinvent the wheel. But there's still improvement to be done in future, such as I put some spark specific login in `InterpreterSetting` which is not a good practise. I plan to improve it when I refactor the `Interpreter` class (ZEPPELIN-2685). Besides that, I also add `MiniHadoopCluster` & `MiniZeppelin` which help for the integration test of yarn-client & yarn-cluster mode, otherwise I have to manually verify yarn-client & yarn-cluster mode which would easily cause regression issue in future. ### What type of PR is it? [Feature] ### Todos * [ ] - Task ### What is the Jira issue? https://github.com/zjffdu/zeppelin/tree/ZEPPELIN-2898 ### How should this be tested? System test is added in `SparkInterpreterIT`. ### Questions: * Does the licenses files need update? No * Is there breaking changes for older versions? No * Does this needs documentation? No You can merge this pull request into a Git repository by running: $ git pull https://github.com/zjffdu/zeppelin ZEPPELIN-2898 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/zeppelin/pull/2577.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #2577 ---- commit 9ec46d931149eddc0a227cec00e1bee76180176d Author: Jeff Zhang <zjf...@apache.org> Date: 2017-09-04T13:54:56Z ZEPPELIN-2898. Support Yarn-Cluster for Spark Interpreter ---- ---