tobixdev opened a new pull request, #9535:
URL: https://github.com/apache/arrow-rs/pull/9535

   # Which issue does this PR close?
   
   - Related to https://github.com/apache/arrow-rs/issues/279
   
   # Rationale for this change
   
   The `take` kernel is very important for many operations (e.g., `HashJoin` in 
DataFusion IIRC). Currently, there is a gap between the performance of the take 
kernel for primitive arrays (e.g., `DataType::UInt32`) and fixed size binary 
arrays of the same length (e.g., `FixedSizeBinary<4>`). 
   
   In our case this lead to a performance reduction when moving from an 
integer-based id column to a fixed-size-binary-based id column. This PR aims to 
address parts of this gap.
   
   # What changes are included in this PR?
   
   - Add `take_fixed_size` that can be called for set of predefined fsb-lengths 
that we want to support. This is a "flat buffer" version of the `take_native` 
kernel.
   
   # Are these changes tested?
   
   I've added another test that still exercises the non-optimized code path.
   
   # Are there any user-facing changes?
   
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to