[
https://issues.apache.org/jira/browse/HIVE-12763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15106304#comment-15106304
]
Pengcheng Xiong commented on HIVE-12763:
----------------------------------------
Thanks [~gopalv], I plan to first leverage existing ndv computation mechanism
in HIVE. It is similar to what DataScketches uses and I assume there is not too
much performance difference for hive, especially when they are stored in
HBase.. DataScketches is interesting to me too and may be a good candidate for
further improvement.
> Use bit vector to track per partition NDV
> -----------------------------------------
>
> Key: HIVE-12763
> URL: https://issues.apache.org/jira/browse/HIVE-12763
> Project: Hive
> Issue Type: Improvement
> Reporter: Pengcheng Xiong
> Assignee: Pengcheng Xiong
> Attachments: HIVE-12763.01.patch, HIVE-12763.02.patch
>
>
> This will improve merging of per partitions stats.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)