Xinyi Yu created SPARK-49454:
--------------------------------
Summary: Avoid double normalization in the cache process
Key: SPARK-49454
URL: https://issues.apache.org/jira/browse/SPARK-49454
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 4.0.0
Reporter: Xinyi Yu
There is an issue introduced in
[#46465|https://github.com/apache/spark/pull/46465], which is that
normalization is applied twice during the cache process. Some normalization
rules may not be idempotent, so applying them repeatedly may break the plan
shape and cause an unexpected cache miss.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]