[ 
https://issues.apache.org/jira/browse/SPARK-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14951696#comment-14951696
 ] 

Sun Rui commented on SPARK-10971:
---------------------------------

I agree that it is more flexible to allow configuration of location of RScript 
on both client and cluster modes. But I am not sure if it makes sense to 
distribute R itself onto worker nodes for jobs instead of have it installed on 
worker nodes, as R binary is platform specific (also may require platform 
specific installation steps), as well as performance cost of shipping R 
binaries.

> sparkR: RRunner should allow setting path to Rscript
> ----------------------------------------------------
>
>                 Key: SPARK-10971
>                 URL: https://issues.apache.org/jira/browse/SPARK-10971
>             Project: Spark
>          Issue Type: Bug
>          Components: SparkR
>    Affects Versions: 1.5.1
>            Reporter: Thomas Graves
>
> I'm running spark on yarn and trying to use R in cluster mode. RRunner seems 
> to just call Rscript and assumes its in the path. But on our YARN deployment 
> R isn't installed on the nodes so it needs to be distributed along with the 
> job and we need the ability to point to where it gets installed. sparkR in 
> client mode has the config spark.sparkr.r.command to point to Rscript. 
> RRunner should have something similar so it works in cluster mode



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to