Hyunsik Choi created TAJO-951:
---------------------------------

             Summary: Making good use of statistic of Parquet
                 Key: TAJO-951
                 URL: https://issues.apache.org/jira/browse/TAJO-951
             Project: Tajo
          Issue Type: Improvement
          Components: storage
            Reporter: Hyunsik Choi


Parquet already provides per-column statistics, including the number of null 
values, min, and max value. It seems to be easily integrated with Tajo's 
statistic system. Also, if query execution exploits it well, query performance 
would be improved.


* Added statistics to Parquet pages and rowGroups 
(https://github.com/Parquet/parquet-mr/commit/621cf4e92be3dd3f2dd1a92a8dd12f244a7d7be3)
* 
https://git-wip-us.apache.org/repos/asf?p=incubator-parquet-mr.git;a=blob;f=parquet-column/src/main/java/parquet/column/statistics/Statistics.java;h=2c5ac14a262c69a9a6545243330bbc3a77812c9b;hb=HEAD



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to