GitHub user zjffdu opened a pull request:

    https://github.com/apache/zeppelin/pull/2577

    ZEPPELIN-2898. Support Yarn-Cluster for Spark Interpreter

    ### What is this PR for?
    This is the first version for supporting yarn-cluster of 
`SparkInterpreter`.   I just delegate all the function to `spark-submit` as 
yarn-cluster is natively supported by spark, we don't need to reinvent the 
wheel. But there's still improvement to be done in future, such as I put some 
spark specific login in `InterpreterSetting` which is not a good practise.  I 
plan to improve it when I refactor the `Interpreter` class (ZEPPELIN-2685). 
    
    Besides that, I also add `MiniHadoopCluster` & `MiniZeppelin` which help 
for the integration test of yarn-client & yarn-cluster mode, otherwise I have 
to manually verify yarn-client & yarn-cluster mode which would easily cause 
regression issue in future.
    
    ### What type of PR is it?
    [Feature]
    
    ### Todos
    * [ ] - Task
    
    ### What is the Jira issue?
    https://github.com/zjffdu/zeppelin/tree/ZEPPELIN-2898
    
    
    ### How should this be tested?
    System test is added in `SparkInterpreterIT`. 
    
    ### Questions:
    * Does the licenses files need update?  No
    * Is there breaking changes for older versions? No
    * Does this needs documentation? No


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/zjffdu/zeppelin ZEPPELIN-2898

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/zeppelin/pull/2577.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2577
    
----
commit 9ec46d931149eddc0a227cec00e1bee76180176d
Author: Jeff Zhang <zjf...@apache.org>
Date:   2017-09-04T13:54:56Z

    ZEPPELIN-2898. Support Yarn-Cluster for Spark Interpreter

----


---

Reply via email to