Shekharrajak opened a new issue, #19499:
URL: https://github.com/apache/druid/issues/19499

   ### Description
   Introduce Apache Arrow as a first-class in-memory representation inside 
Druid's MSQ engine by adding ArrowRowsAndColumns .
   
   The #19456 discussion identified that swapping Druid's row-at-a-time JVM hot 
path requires touching multiple load-bearing abstractions (Frame, 
FrameProcessor, channels, leaf segment readers, shuffle, planner).
   
   As @gianm , @jtuglu1 mentioned -  The first move is the operator with the 
smallest blast radius (one non-leaf single-input FrameProcessor) backed by a 
reusable abstraction (ArrowRowsAndColumns) — so every new implementations 
phases reuses  it.
   
   ### Motivation
   
   introduce ArrowRowsAndColumns and adopt it inside 
GroupByPostShuffleFrameProcessor — first concrete step of the #19456 
modernisation program.
   
   Related
   
   •  #19456 — Native, vectorised, zero-copy execution path (this is the first 
concrete step)
   •  #13458 — RowsAndColumns introduction (the abstraction this extends)
   •  #18909 — WireTransferable (the seam that Phase C will use)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to