[jira] [Commented] (SPARK-12144) Implement DataFrameReader and DataFrameWriter API in SparkR
[ https://issues.apache.org/jira/browse/SPARK-12144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15046376#comment-15046376 ] Sun Rui commented on SPARK-12144: - [~yanboliang] go ahead. Could you please change the title of this JIRA according to the discussion, and summarize what to be done in the description of this JIRA (for example, add read.parquet(), ...)? If needed, you can break this one into a number of sub-tasks. > Implement DataFrameReader and DataFrameWriter API in SparkR > --- > > Key: SPARK-12144 > URL: https://issues.apache.org/jira/browse/SPARK-12144 > Project: Spark > Issue Type: New Feature > Components: SparkR >Affects Versions: 1.5.2 >Reporter: Sun Rui > > DataFrameReader API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameReader > DataFrameWriter API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-12144) Implement DataFrameReader and DataFrameWriter API in SparkR
[ https://issues.apache.org/jira/browse/SPARK-12144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15046311#comment-15046311 ] Yanbo Liang commented on SPARK-12144: - I'm working on SPARK-12146 which is similar with this one. I can also take this if no one worked on it. > Implement DataFrameReader and DataFrameWriter API in SparkR > --- > > Key: SPARK-12144 > URL: https://issues.apache.org/jira/browse/SPARK-12144 > Project: Spark > Issue Type: New Feature > Components: SparkR >Affects Versions: 1.5.2 >Reporter: Sun Rui > > DataFrameReader API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameReader > DataFrameWriter API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-12144) Implement DataFrameReader and DataFrameWriter API in SparkR
[ https://issues.apache.org/jira/browse/SPARK-12144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15042103#comment-15042103 ] Felix Cheung commented on SPARK-12144: -- +1 [~shivaram] The style {code} read.format("json").option(options).load("people.json") {code} doesn't really fit R instead, why don't we have {code} read(format, options_named_list, ...) {code} in fact we could leave/add convenience functions like {code} read.csv(options) read.json(options) read.parquet(options) {code} > Implement DataFrameReader and DataFrameWriter API in SparkR > --- > > Key: SPARK-12144 > URL: https://issues.apache.org/jira/browse/SPARK-12144 > Project: Spark > Issue Type: New Feature > Components: SparkR >Affects Versions: 1.5.2 >Reporter: Sun Rui > > DataFrameReader API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameReader > DataFrameWriter API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-12144) Implement DataFrameReader and DataFrameWriter API in SparkR
[ https://issues.apache.org/jira/browse/SPARK-12144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15041177#comment-15041177 ] Sun Rui commented on SPARK-12144: - @shivaram, your opinion is reasonable. It seems that read.df and write.df does not cover all functionalities exposed by DataFrameReader and DataFrameWriter. Maybe we don't need to provide these two API, but provide more wrapping functions in SparkR. > Implement DataFrameReader and DataFrameWriter API in SparkR > --- > > Key: SPARK-12144 > URL: https://issues.apache.org/jira/browse/SPARK-12144 > Project: Spark > Issue Type: New Feature > Components: SparkR >Affects Versions: 1.5.2 >Reporter: Sun Rui > > DataFrameReader API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameReader > DataFrameWriter API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-12144) Implement DataFrameReader and DataFrameWriter API in SparkR
[ https://issues.apache.org/jira/browse/SPARK-12144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15040773#comment-15040773 ] Shivaram Venkataraman commented on SPARK-12144: --- Personally I don't think this API is a good fit for SparkR -- This introduces a lot of chaining based methods which are awkward to write in R without using something like magrittr. I think the existing `read.df` and having `option = value` is more user-friendly. Is there any functionality we will gain from this ? > Implement DataFrameReader and DataFrameWriter API in SparkR > --- > > Key: SPARK-12144 > URL: https://issues.apache.org/jira/browse/SPARK-12144 > Project: Spark > Issue Type: New Feature > Components: SparkR >Affects Versions: 1.5.2 >Reporter: Sun Rui > > DataFrameReader API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameReader > DataFrameWriter API: > http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.DataFrameWriter -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org