Ruifeng Zheng created SPARK-54691:
-------------------------------------

             Summary: DataFrame Column Resolution conflicts with delta
                 Key: SPARK-54691
                 URL: https://issues.apache.org/jira/browse/SPARK-54691
             Project: Spark
          Issue Type: Bug
          Components: Connect, SQL
    Affects Versions: 4.1.0, 4.2.0
            Reporter: Ruifeng Zheng


some delta queries fail with 4.1+

```
df = spark.read.option("readChangeFeed", True).option("startingVersion", 
0).table("sample_table")
 
df.select('_commit_version').show() <- expected output
 
df.select(df._commit_version).show() <- fail with 
[CANNOT_RESOLVE_DATAFRAME_COLUMN] Cannot resolve dataframe column 
"_commit_version". It's probably because of illegal references like 
`df1.select(df2.col("a"))`. SQLSTATE: 42704
```

 

It seems due to the lazy dataframe column resolution conflicts with delta rules 
and cause rule order dependency issue.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to