[ https://issues.apache.org/jira/browse/ARROW-14035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17419366#comment-17419366 ]
David Li commented on ARROW-14035: ---------------------------------- {{value_counts}} gives you a histogram where the x-axis are the distinct values and the y-axis is the number of occurrences of that value. {{count_distinct}} is just {{COUNT(DISTINCT *)}}. Also, {{value_counts}} is a vector kernel whereas this should be a scalar aggregate kernel. > [C++][Compute] Implement non-hash count_distinct aggregate kernel > ----------------------------------------------------------------- > > Key: ARROW-14035 > URL: https://issues.apache.org/jira/browse/ARROW-14035 > Project: Apache Arrow > Issue Type: New Feature > Components: C++ > Reporter: Ian Cook > Assignee: Percy Camilo Triveño Aucahuasi > Priority: Critical > Labels: kernel > Fix For: 6.0.0 > > > ARROW-12728 added a {{hash_count_distinct}} hash aggregate kernel, but there > is no non-hash {{count_distinct}} aggregate kernel. -- This message was sent by Atlassian Jira (v8.3.4#803005)