andygrove opened a new issue, #3315: URL: https://github.com/apache/datafusion-comet/issues/3315
## Summary 4 Spark SQL tests fail because `native_datafusion` produces different plan nodes and partitioning info than expected. ## Failing Tests - `ParquetV2Suite`: "Fallback Parquet V2 to V1" — expects `FileSourceScanExec` or `CometScanExec` in plan, but `native_datafusion` uses `CometNativeScan` - `BroadcastJoinSuite`: "broadcast join where streamed side's output partitioning is HashPartitioning" (x2) — `UnknownPartitioning(8)` instead of `PartitioningCollection` - `FileStreamSinkSuite`: "self-union, DSv1, read via DataStreamReader API" / "self-union, DSv1, read via table API" — streaming query expects specific plan structure ## Root Cause `native_datafusion` uses `CometNativeScan` instead of `CometScanExec`/`FileSourceScanExec` and reports `UnknownPartitioning` instead of preserving the original partitioning information. Tests that inspect plan internals fail. ## Related Discovered in CI for #3307 (enable native_datafusion in auto scan mode). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
