[ https://issues.apache.org/jira/browse/SPARK-19773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15890740#comment-15890740 ]
Felix Cheung commented on SPARK-19773: -------------------------------------- Let's close this unless you want to look into getting mutate to support duplicated columns? ;) > SparkDataFrame should not allow duplicate names > ----------------------------------------------- > > Key: SPARK-19773 > URL: https://issues.apache.org/jira/browse/SPARK-19773 > Project: Spark > Issue Type: Bug > Components: SparkR > Affects Versions: 2.1.0 > Reporter: Wayne Zhang > Priority: Minor > > SparkDataFrame in SparkR seems to accept duplicate names at creation, but > incurs error when calling methods downstream. For example, we can do: > {{{code}}} > l <- list(list(1, 2), list(3, 4)) > df <- createDataFrame(l, c("a", "a")) > head(df) > {{{code}}} > But an error occurs when we do df$a = df$a * 2.0. > I suggest we add validity check for duplicate names at initialization. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org