[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14604440#comment-14604440 ]
Shivaram Venkataraman commented on SPARK-8596: ---------------------------------------------- Thanks ! These are very useful instructions. We can break up this jira into a bunch of smaller issues. 1. Opening the RStudio port in the EC2 cluster. For this we need to add the right port number to the Spark EC2 script at https://github.com/apache/spark/blob/0b5abbf5f96a5f6bfd15a65e8788cf3fa96fe54c/ec2/spark_ec2.py#L507.This should be a pretty simple change -- Would you like to open a PR for this ? 2. We need to add code to install rstudio, add a new user (lets say username rstudio, password rstudio) -- To do this we will need to modify scripts in the spark-ec2 repo at https://github.com/mesos/spark-ec2. At a high-level these scripts are run on the master node after the cluster is launched and these scripts install Spark, Hadoop etc. on the AMI. So we can just add a new module to spark-ec2 called rstudio and then in rstudio/setup.sh we can add code to setup the new users etc. as well. Let me know if you want to take a shot at the second one as well > Install and configure RStudio server on Spark EC2 > ------------------------------------------------- > > Key: SPARK-8596 > URL: https://issues.apache.org/jira/browse/SPARK-8596 > Project: Spark > Issue Type: Improvement > Components: EC2, SparkR > Reporter: Shivaram Venkataraman > > This will make it convenient for R users to use SparkR from their browsers -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org