[ 
https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14604440#comment-14604440
 ] 

Shivaram Venkataraman commented on SPARK-8596:
----------------------------------------------

Thanks ! These are very useful instructions. We can break up this jira into a 
bunch of smaller issues.

1. Opening the RStudio port in the EC2 cluster. For this we need to add the 
right port number to the Spark EC2 script at 
https://github.com/apache/spark/blob/0b5abbf5f96a5f6bfd15a65e8788cf3fa96fe54c/ec2/spark_ec2.py#L507.This
 should be a pretty simple change -- Would you like to open a PR for this ? 

2. We need to add code to install rstudio, add a new user (lets say username 
rstudio, password rstudio) -- To do this we will need to modify scripts in the 
spark-ec2 repo 
at https://github.com/mesos/spark-ec2. At a high-level these scripts are run on 
the master node after the cluster is launched and these scripts install Spark, 
Hadoop etc. on the AMI. So we can just add a new module to spark-ec2 called 
rstudio and then in rstudio/setup.sh we can add code to setup the new users 
etc. as well. 

Let me know if you want to take a shot at the second one as well


> Install and configure RStudio server on Spark EC2
> -------------------------------------------------
>
>                 Key: SPARK-8596
>                 URL: https://issues.apache.org/jira/browse/SPARK-8596
>             Project: Spark
>          Issue Type: Improvement
>          Components: EC2, SparkR
>            Reporter: Shivaram Venkataraman
>
> This will make it convenient for R users to use SparkR from their browsers 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to