Wayne Zhang created SPARK-19773:
-----------------------------------

             Summary: SparkDataFrame should not allow duplicate names
                 Key: SPARK-19773
                 URL: https://issues.apache.org/jira/browse/SPARK-19773
             Project: Spark
          Issue Type: Bug
          Components: SparkR
    Affects Versions: 2.1.0
            Reporter: Wayne Zhang
            Priority: Minor


SparkDataFrame in SparkR seems to accept duplicate names at creation, but 
incurs error when calling methods downstream. For example, we can do: 
{{{code}}}
l <- list(list(1, 2), list(3, 4))
df <- createDataFrame(l, c("a", "a"))
head(df)
{{{code}}}
But an error occurs when we do df$a = df$a * 2.0. 

I suggest we add validity check for duplicate names at initialization.  




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to