wiedld opened a new pull request, #11399:
URL: https://github.com/apache/datafusion/pull/11399

   ## Which issue does this PR close?
   
   Closes #11397 
   
   ## Rationale for this change
   
   During the parallel writes in ParquetSink, we spawn a series of parallel 
tasks and then message pass the outcome from one task to the next. In 
abstraction:
   read_batches => channel => `Vec<col_write_tasks>` => channel => 
`Vec<serialize_rowgroup_tasks>`
   
   When we encounter an error in one of the `Vec<x_tasks>` we are first surface 
an error on the channel.send() rather than on the task join.
   
   ## What changes are included in this PR?
   
   Don't surface the errors on the channel send.
   This results in the proper error returned, as can be seen on the updated 
test.
   
   ## Are these changes tested?
   
   Yes.
   
   ## Are there any user-facing changes?
   
   No.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to