2010YOUY01 opened a new issue, #18261: URL: https://github.com/apache/datafusion/issues/18261
### Is your feature request related to a problem or challenge? I noticed it when reviewing https://github.com/apache/datafusion/pull/18207 When writing spill files in the sort executor, it requires some pre-processing steps for the batches to spill to avoid duplicate writes, specifically https://github.com/apache/datafusion/blob/167baf718278112d427910a0296f366909bc3384/datafusion/physical-plan/src/sorts/sort.rs#L409 This step should apply to all spilling usages, not only limited to the sort executor, so a better way is to put it inside `SpillManager`/`InProgressSpillFile`, or maybe `Arrow` IPC writer if possible. ### Describe the solution you'd like _No response_ ### Describe alternatives you've considered _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
