alamb opened a new issue, #9128:
URL: https://github.com/apache/arrow-rs/issues/9128

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   - related to https://github.com/apache/arrow-rs/issues/9061
   
   I noticed on https://github.com/apache/arrow-rs/issues/9061 that there is 
non trivial overhead to array. I am trying to improve make_array in parallel, 
but @tustvold had an even better idea in 
https://github.com/apache/arrow-rs/pull/9058#issuecomment-3712272488
   
   > My 2 cents is it would be better to move the codepaths relying on 
ArrayData over to using the typed arrays directly, this should not only cut 
down on allocations but unnecessary validation and dispatch overheads.
   
   An ArrayData has at least one extra allocation (for the Vec that holds 
Buffers) as well as a bunch of dynamic function calls. While this overhead is 
small individually, it is paid for every array so in aggregate it can be 
substantial
   
   **Describe the solution you'd like**
   Change relying on ArrayData over to using the typed arrays directly, this 
should not only cut down on allocations but unnecessary validation and dispatch 
overheads.
   
   **Describe alternatives you've considered**
   <!--
   A clear and concise description of any alternative solutions or features 
you've considered.
   -->
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to