[I] [native_datafusion] [Spark SQL Tests] Plan structure differences cause test failures [datafusion-comet]

via GitHub Wed, 28 Jan 2026 16:42:30 -0800


andygrove opened a new issue, #3315:
URL: https://github.com/apache/datafusion-comet/issues/3315


   ## Summary
   
   4 Spark SQL tests fail because `native_datafusion` produces different plan 
nodes and partitioning info than expected.
   
   ## Failing Tests
   
   - `ParquetV2Suite`: "Fallback Parquet V2 to V1" — expects 
`FileSourceScanExec` or `CometScanExec` in plan, but `native_datafusion` uses 
`CometNativeScan`
   - `BroadcastJoinSuite`: "broadcast join where streamed side's output 
partitioning is HashPartitioning" (x2) — `UnknownPartitioning(8)` instead of 
`PartitioningCollection`
   - `FileStreamSinkSuite`: "self-union, DSv1, read via DataStreamReader API" / 
"self-union, DSv1, read via table API" — streaming query expects specific plan 
structure
   
   ## Root Cause
   
   `native_datafusion` uses `CometNativeScan` instead of 
`CometScanExec`/`FileSourceScanExec` and reports `UnknownPartitioning` instead 
of preserving the original partitioning information. Tests that inspect plan 
internals fail.
   
   ## Related
   
   Discovered in CI for #3307 (enable native_datafusion in auto scan mode).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[I] [native_datafusion] [Spark SQL Tests] Plan structure differences cause test failures [datafusion-comet]

Reply via email to