pedroerp commented on PR #43632: URL: https://github.com/apache/arrow/pull/43632#issuecomment-2339503509
@leventov the Velox part seems mostly accurate, with the exception we don't do a lot of cross-language flow control. The common pattern is to delegate the execution of an entire query fragment to Velox, and let Velox use exchanges to shuffle data across different hosts running the same Velox process. But then, Exchange* is just one implementation based on Presto's requirements; one could provide their own shuffle operators implementing specialized behavior. Also, in Velox parlance, Task is how you execute query plans; Drivers are the threads of execution inside Tasks; Splits are the unit of processing (one split can be a segment of a Parquet file, for example), and it ends up being processed by one Driver. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
