[ 
https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14611832#comment-14611832
 ] 

Vincent Warmerdam edited comment on SPARK-8596 at 7/2/15 11:55 AM:
-------------------------------------------------------------------

By the way, I now have scripts that do install Rstudio (just ran and 
confirmed). 

The code is here: 

https://github.com/koaning/spark-ec2/tree/rstudio-install (added rstudio as a 
module) 
https://github.com/koaning/spark/tree/rstudio-install

When initializing with this command: 

./spark-ec2 --key-pair=spark-df 
--identity-file=/Users/code/Downloads/spark-df.pem --region=eu-west-1 -s 1 
--instance-type=c3.2xlarge 
--spark-ec2-git-repo=https://github.com/koaning/spark-ec2 
--spark-ec2-git-branch=rstudio-install launch mysparkr

I can confirm that rstudio is installand and that a correct user is added. 
There are two concerns:

- should we not force the user to supply the password themselves? setting a 
standard password seems like a security vulnerability. 
- I am not sure if this gets installed on all the slave nodes. I added this 
module 
(https://github.com/koaning/spark-ec2/blob/rstudio-install/rstudio/init.sh) and 
we only need it on the master node. I wonder what the best way is to ensure 
this.


was (Author: cantdutchthis):
By the way, I now have scripts that do install Rstudio (just ran and 
confirmed). 

The code is here: 

https://github.com/koaning/spark-ec2/tree/rstudio-install
https://github.com/koaning/spark/tree/rstudio-install

When initializing with this command: 

./spark-ec2 --key-pair=spark-df 
--identity-file=/Users/code/Downloads/spark-df.pem --region=eu-west-1 -s 1 
--instance-type=c3.2xlarge 
--spark-ec2-git-repo=https://github.com/koaning/spark-ec2 
--spark-ec2-git-branch=rstudio-install launch mysparkr

I can confirm that rstudio is installand and that a correct user is added. 
There are two concerns:

- should we not force the user to supply the password themselves? setting a 
standard password seems like a security vulnerability. 
- I am not sure if this gets installed on all the slave nodes. I added this 
module 
(https://github.com/koaning/spark-ec2/blob/rstudio-install/rstudio/init.sh) and 
we only need it on the master node. I wonder what the best way is to ensure 
this.

> Install and configure RStudio server on Spark EC2
> -------------------------------------------------
>
>                 Key: SPARK-8596
>                 URL: https://issues.apache.org/jira/browse/SPARK-8596
>             Project: Spark
>          Issue Type: Improvement
>          Components: EC2, SparkR
>            Reporter: Shivaram Venkataraman
>
> This will make it convenient for R users to use SparkR from their browsers 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to