Chendi.Xue created ARROW-7165: --------------------------------- Summary: [C++] Arrow Compute Group By Support Key: ARROW-7165 URL: https://issues.apache.org/jira/browse/ARROW-7165 Project: Apache Arrow Issue Type: New Feature Components: C++ - Compute Reporter: Chendi.Xue
Not sure if there is any plan to support groupby in arrow? Here is some to do in my mind: # To make current arrow/compute/kernels/hash supporting received a memo_table as input, so multiple array will be able to get dictencode and valuecount based on same hashmap with a unified index. # To add a split array function instead of using take multiple time to split one array to several ones. # so the output array can use current funcs under compute/kernels, such as sum/count/sort to support group by. But this is some of my basic idea, wanna know if there is a on going plan on this? -- This message was sent by Atlassian Jira (v8.3.4#803005)