Wayne Zhang created SPARK-19773: ----------------------------------- Summary: SparkDataFrame should not allow duplicate names Key: SPARK-19773 URL: https://issues.apache.org/jira/browse/SPARK-19773 Project: Spark Issue Type: Bug Components: SparkR Affects Versions: 2.1.0 Reporter: Wayne Zhang Priority: Minor
SparkDataFrame in SparkR seems to accept duplicate names at creation, but incurs error when calling methods downstream. For example, we can do: {{{code}}} l <- list(list(1, 2), list(3, 4)) df <- createDataFrame(l, c("a", "a")) head(df) {{{code}}} But an error occurs when we do df$a = df$a * 2.0. I suggest we add validity check for duplicate names at initialization. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org