Dong Chen created HIVE-10257: -------------------------------- Summary: Ensure Parquet Hive has null optimization Key: HIVE-10257 URL: https://issues.apache.org/jira/browse/HIVE-10257 Project: Hive Issue Type: Sub-task Reporter: Dong Chen Assignee: Dong Chen
In Parquet statistics, a boolean value {{hasNonNullValue}} is used for each column chunk. Hive could use this value to skip a column, avoid null-checking logic, and speed up vectorization like HIVE-4478 (in the future, it is not completed yet). In this Jira we could check whether this null optimization works, and make changes if any. -- This message was sent by Atlassian JIRA (v6.3.4#6332)