js8544 commented on issue #37055:
URL: https://github.com/apache/arrow/issues/37055#issuecomment-1688582657

   Ah, I had done some research on this issue but forgot to post my findings. I 
think @rok's comment 
[here](https://github.com/apache/arrow/blob/main/cpp/src/arrow/compute/kernels/vector_hash.cc#L452)
 and the discussion 
[here](https://github.com/apache/arrow/pull/9683#issuecomment-800442398) 
explain it well. We can optimize it by first computing it over each chunk and 
hash-aggregate the result. However, I don't think we can directly call hash 
aggregate functions in compute kernels, without having to depend on acero? 
   
   cc @westonpace Can you confirm?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to