2010YOUY01 opened a new issue, #18261:
URL: https://github.com/apache/datafusion/issues/18261

   ### Is your feature request related to a problem or challenge?
   
   I noticed it when reviewing https://github.com/apache/datafusion/pull/18207
   
   When writing spill files in the sort executor, it requires some 
pre-processing steps for the batches to spill to avoid duplicate writes, 
specifically 
https://github.com/apache/datafusion/blob/167baf718278112d427910a0296f366909bc3384/datafusion/physical-plan/src/sorts/sort.rs#L409
   
   This step should apply to all spilling usages, not only limited to the sort 
executor, so a better way is to put it inside 
`SpillManager`/`InProgressSpillFile`, or maybe `Arrow` IPC writer if possible.
   
   ### Describe the solution you'd like
   
   _No response_
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to