[ https://issues.apache.org/jira/browse/SPARK-16611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15383206#comment-15383206 ]
Weiqiang Zhuang commented on SPARK-16611: ----------------------------------------- To answer @shivaram's question: we are calling lapply function to transform the dataset so that the system ml can run algorithms on it. The lapply accepts RDD. Hence the requirement for the exposure of these APIs and data types. We will investigate whether the dapply and gapply will work for the same purpose. > Expose several hidden DataFrame/RDD functions > --------------------------------------------- > > Key: SPARK-16611 > URL: https://issues.apache.org/jira/browse/SPARK-16611 > Project: Spark > Issue Type: Improvement > Components: SparkR > Reporter: Oscar D. Lara Yejas > > Expose the following functions: > - lapply or map > - lapplyPartition or mapPartition > - flatMap > - RDD > - toRDD > - getJRDD > - cleanup.jobj > cc: > [~javierluraschi] [~j...@rstudio.com] [~shivaram] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org