Shekharrajak opened a new issue, #19499: URL: https://github.com/apache/druid/issues/19499
### Description Introduce Apache Arrow as a first-class in-memory representation inside Druid's MSQ engine by adding ArrowRowsAndColumns . The #19456 discussion identified that swapping Druid's row-at-a-time JVM hot path requires touching multiple load-bearing abstractions (Frame, FrameProcessor, channels, leaf segment readers, shuffle, planner). As @gianm , @jtuglu1 mentioned - The first move is the operator with the smallest blast radius (one non-leaf single-input FrameProcessor) backed by a reusable abstraction (ArrowRowsAndColumns) — so every new implementations phases reuses it. ### Motivation introduce ArrowRowsAndColumns and adopt it inside GroupByPostShuffleFrameProcessor — first concrete step of the #19456 modernisation program. Related • #19456 — Native, vectorised, zero-copy execution path (this is the first concrete step) • #13458 — RowsAndColumns introduction (the abstraction this extends) • #18909 — WireTransferable (the seam that Phase C will use) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
