[ 
https://issues.apache.org/jira/browse/HIVE-12763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106304#comment-15106304
 ] 

Pengcheng Xiong commented on HIVE-12763:
----------------------------------------

Thanks [~gopalv], I plan to first leverage existing ndv computation mechanism 
in HIVE. It is similar to what DataScketches uses and I assume there is not too 
much performance difference for hive, especially when they are stored in 
HBase.. DataScketches is interesting to me too and may be a good candidate for 
further improvement.

> Use bit vector to track per partition NDV
> -----------------------------------------
>
>                 Key: HIVE-12763
>                 URL: https://issues.apache.org/jira/browse/HIVE-12763
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Pengcheng Xiong
>            Assignee: Pengcheng Xiong
>         Attachments: HIVE-12763.01.patch, HIVE-12763.02.patch
>
>
> This will improve merging of per partitions stats.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to