2010YOUY01 commented on PR #17029:
URL: https://github.com/apache/datafusion/pull/17029#issuecomment-3153115527

   If it fails, I think this approach will make the debugging very painful.
   
   I have an alternative idea to make this validation more fine-grained:
   Let's say there are 3 spills to merge, each has estimated max batch size 
10M, 15M, 12M
   Then we can only check during merging, each stream's batch size is always 
less than [10M, 15M, 12M]
   
   Though this approach is less comprehensive, and can be a bit hacky when 
implementing (to directly extend operator for this check), but it can make 
trouble-shooting much easier.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to