[ 
https://issues.apache.org/jira/browse/KYLIN-3453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722859#comment-16722859
 ] 

Chao Long commented on KYLIN-3453:
----------------------------------

Because the estimated split region number is used to create Hbase pre splitting 
table and "create hbase table" step is before "convert cuboid data to HFile" 
step, we can't get the real size of cube data.

> Improve cube size estimation for TOPN, COUNT DISTINCT
> -----------------------------------------------------
>
>                 Key: KYLIN-3453
>                 URL: https://issues.apache.org/jira/browse/KYLIN-3453
>             Project: Kylin
>          Issue Type: Improvement
>            Reporter: Chao Long
>            Assignee: Chao Long
>            Priority: Major
>             Fix For: v2.5.0
>
>         Attachments: image-2018-07-24-16-29-07-359.png, 
> image-2018-07-24-16-30-50-804.png, image-2018-07-24-16-33-43-231.png, 
> image-2018-07-24-16-37-09-199.png, image-2018-07-24-17-11-26-283.png, 
> image-2018-07-24-17-11-27-829.png, image-2018-07-24-17-12-25-880.png
>
>
> Currently, Kylin has poor cube size estimation for TOPN, COUNT DISTINCT. We 
> should improve it, then we can get a reasonable split num when cube building. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to