Xinyi Yu created SPARK-49454:
--------------------------------

             Summary: Avoid double normalization in the cache process
                 Key: SPARK-49454
                 URL: https://issues.apache.org/jira/browse/SPARK-49454
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 4.0.0
            Reporter: Xinyi Yu


There is an issue introduced in 
[#46465|https://github.com/apache/spark/pull/46465], which is that 
normalization is applied twice during the cache process. Some normalization 
rules may not be idempotent, so applying them repeatedly may break the plan 
shape and cause an unexpected cache miss.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to