[ https://issues.apache.org/jira/browse/SPARK-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14951696#comment-14951696 ]
Sun Rui commented on SPARK-10971: --------------------------------- I agree that it is more flexible to allow configuration of location of RScript on both client and cluster modes. But I am not sure if it makes sense to distribute R itself onto worker nodes for jobs instead of have it installed on worker nodes, as R binary is platform specific (also may require platform specific installation steps), as well as performance cost of shipping R binaries. > sparkR: RRunner should allow setting path to Rscript > ---------------------------------------------------- > > Key: SPARK-10971 > URL: https://issues.apache.org/jira/browse/SPARK-10971 > Project: Spark > Issue Type: Bug > Components: SparkR > Affects Versions: 1.5.1 > Reporter: Thomas Graves > > I'm running spark on yarn and trying to use R in cluster mode. RRunner seems > to just call Rscript and assumes its in the path. But on our YARN deployment > R isn't installed on the nodes so it needs to be distributed along with the > job and we need the ability to point to where it gets installed. sparkR in > client mode has the config spark.sparkr.r.command to point to Rscript. > RRunner should have something similar so it works in cluster mode -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org