andygrove commented on code in PR #4374:
URL: https://github.com/apache/datafusion-comet/pull/4374#discussion_r3284854997


##########
spark/src/test/resources/tpcds-plan-stability/approved-plans-v1_4-spark4_0/q83.ansi/extended.txt:
##########
@@ -101,4 +127,4 @@ CometNativeColumnarToRow
                                                    +- CometFilter
                                                       +- CometNativeScan 
parquet spark_catalog.default.date_dim
 

Review Comment:
   Tried the narrower form (`originalPlan.canonicalized.output` in the 
canonical key, plus `mode.canonicalized` and `equals`/`hashCode` widened to 
include `output` and `mode`) and the count-bug regression still reproduces — 
`CometAggregateSuite` test returns 1 row vs Spark's 2. So the discriminator 
isn't (just) in `output`; the count+1 / count-1 projection lives deeper in the 
plan and is structurally indistinguishable at the output level after 
canonicalization. Looking into this more to see if there's a tighter key that 
recovers q83's reuse without losing the count-bug guard.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to