[ https://issues.apache.org/jira/browse/SPARK-19773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15888827#comment-15888827 ]
Apache Spark commented on SPARK-19773: -------------------------------------- User 'actuaryzhang' has created a pull request for this issue: https://github.com/apache/spark/pull/17105 > SparkDataFrame should not allow duplicate names > ----------------------------------------------- > > Key: SPARK-19773 > URL: https://issues.apache.org/jira/browse/SPARK-19773 > Project: Spark > Issue Type: Bug > Components: SparkR > Affects Versions: 2.1.0 > Reporter: Wayne Zhang > Priority: Minor > > SparkDataFrame in SparkR seems to accept duplicate names at creation, but > incurs error when calling methods downstream. For example, we can do: > {{{code}}} > l <- list(list(1, 2), list(3, 4)) > df <- createDataFrame(l, c("a", "a")) > head(df) > {{{code}}} > But an error occurs when we do df$a = df$a * 2.0. > I suggest we add validity check for duplicate names at initialization. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org