cetra3 opened a new pull request, #20159:
URL: https://github.com/apache/datafusion/pull/20159

   ## Which issue does this PR close?
   
   Closes https://github.com/apache/datafusion/issues/19425
   
   ## Rationale for this change
   
   This adjusts the way that the spill channel works.  Currently we have a 
spill writer & reader pairing which uses a mutex to coordindate when a file is 
ready to be read.
   
   What happens is, that because we were using a `spawn_buffered` call, the 
read task would race ahead trying to read a file which is yet to be written out 
completely.
   
   Alongside this, we need to flush each write to the file, as there is a 
chance that another thread may see stale data.
   
   ## What changes are included in this PR?
   
   Adds a flush on write, and converts the read task to not buffer reads.
   
   ## Are these changes tested?
   
   I haven't written a test, but I have been running the example in the 
attached issue.  While it now fails with allocation errors, the original error 
goes away.
   
   ## Are there any user-facing changes?
   
   Nope


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to