[ 
https://issues.apache.org/jira/browse/FLINK-5566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zhangjing updated FLINK-5566:
-----------------------------
    Description: 
We define two structure mode to hold statistics
1. TableStats: contain stats for table level, now only one element: rowCount
2. ColumnStats: contain stats of column level. 
for numeric column type: including ndv, nullCount, max, min, histogram
for string type: including ndv, nullCount, avgLen,maxLen
for boolean:including ndv, nullCount, trueCount, falseCount
for date/time/timestamp:  including ndv, nullCount, max, min, histogram 


> Introduce structure to hold table and column level statistics
> -------------------------------------------------------------
>
>                 Key: FLINK-5566
>                 URL: https://issues.apache.org/jira/browse/FLINK-5566
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Table API & SQL
>            Reporter: Kurt Young
>            Assignee: zhangjing
>
> We define two structure mode to hold statistics
> 1. TableStats: contain stats for table level, now only one element: rowCount
> 2. ColumnStats: contain stats of column level. 
> for numeric column type: including ndv, nullCount, max, min, histogram
> for string type: including ndv, nullCount, avgLen,maxLen
> for boolean:including ndv, nullCount, trueCount, falseCount
> for date/time/timestamp:  including ndv, nullCount, max, min, histogram 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to