Yongzhi Chen created HIVE-13200: ----------------------------------- Summary: Aggregation functions returning empty rows on partitioned columns Key: HIVE-13200 URL: https://issues.apache.org/jira/browse/HIVE-13200 Project: Hive Issue Type: Bug Components: Physical Optimizer Affects Versions: 2.0.0, 1.0.0 Reporter: Yongzhi Chen Assignee: Yongzhi Chen
Running aggregation functions like MAX, MIN, DISTINCT against partitioned columns will return empty rows if table has property: 'skip.header.line.count'='1' Reproduce: {noformat} DROP TABLE IF EXISTS test; CREATE TABLE test (a int) PARTITIONED BY (b int) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' TBLPROPERTIES('skip.header.line.count'='1'); INSERT OVERWRITE TABLE test PARTITION (b = 1) VALUES (1), (2), (3), (4); INSERT OVERWRITE TABLE test PARTITION (b = 2) VALUES (1), (2), (3), (4); SELECT * FROM test; SELECT DISTINCT b FROM test; SELECT MAX(b) FROM test; SELECT DISTINCT a FROM test; {noformat} The output: {noformat} 0: jdbc:hive2://localhost:10000/default> SELECT * FROM test; +---------+---------+--+ | test.a | test.b | +---------+---------+--+ | 2 | 1 | | 3 | 1 | | 4 | 1 | | 2 | 2 | | 3 | 2 | | 4 | 2 | +---------+---------+--+ 6 rows selected (0.631 seconds) 0: jdbc:hive2://localhost:10000/default> SELECT DISTINCT b FROM test; +----+--+ | b | +----+--+ +----+--+ No rows selected (47.229 seconds) 0: jdbc:hive2://localhost:10000/default> SELECT MAX(b) FROM test; +-------+--+ | _c0 | +-------+--+ | NULL | +-------+--+ 1 row selected (49.508 seconds) 0: jdbc:hive2://localhost:10000/default> SELECT DISTINCT a FROM test; +----+--+ | a | +----+--+ | 2 | | 3 | | 4 | +----+--+ 3 rows selected (46.859 seconds) {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)