ding-young commented on issue #17334: URL: https://github.com/apache/datafusion/issues/17334#issuecomment-3235438790
> I think some specific timing can cause the non-spillable operators to OOM, however if spillable operators can spill earlier, the execution should be possible to finish. ([@ding-young](https://github.com/ding-young) Just want to make sure, is such race condition possible? 🤔 ) Yes, scheduling / timing matters here definitely since we don't have any mechanism to reclaim memory reservation from other spillable operators when non-spillable operators fail to `try_grow` its reservation. > I could definitely foresee race-conditions being possible, but I can also imagine deterministic failures. For example in case of a single partition: we deterministically fill up memory with the batches to sort, and let's say we get to the point where we need one more batch to trigger sort spilling. At this point the non-spillable input operator needs to allocate memory to provide that last batch, and errors because not enough memory available. But yes, this is also possible. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
