[ 
https://issues.apache.org/jira/browse/ARROW-14035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17419366#comment-17419366
 ] 

David Li commented on ARROW-14035:
----------------------------------

{{value_counts}} gives you a histogram where the x-axis are the distinct values 
and the y-axis is the number of occurrences of that value. {{count_distinct}} 
is  just {{COUNT(DISTINCT *)}}.

Also, {{value_counts}} is a vector kernel whereas this should be a scalar 
aggregate kernel.

> [C++][Compute] Implement non-hash count_distinct aggregate kernel
> -----------------------------------------------------------------
>
>                 Key: ARROW-14035
>                 URL: https://issues.apache.org/jira/browse/ARROW-14035
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Ian Cook
>            Assignee: Percy Camilo Triveño Aucahuasi
>            Priority: Critical
>              Labels: kernel
>             Fix For: 6.0.0
>
>
> ARROW-12728 added a {{hash_count_distinct}} hash aggregate kernel, but there 
> is no non-hash {{count_distinct}} aggregate kernel.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to