Panagiotis Garefalakis created HIVE-23036:
---------------------------------------------
Summary: Incorrect ORC PPD eval with sub-millisecond timestamps
Key: HIVE-23036
URL: https://issues.apache.org/jira/browse/HIVE-23036
Project: Hive
Issue Type: Bug
Reporter: Panagiotis Garefalakis
Assignee: Panagiotis Garefalakis
See [ORC-611|https://issues.apache.org/jira/browse/ORC-611] for more details
ORC stores timestamps with:
- nanosecond precision for the data itself
- milliseconds precision for min-max statistics
As both min and max are rounded to the same value, timestamps with ns
precision will not pass the PPD evaluator.
{code:java}
create table tsstat (ts timestamp) stored as orc;
insert into tsstat values ("1970-01-01 00:00:00.0005");
select * from tsstat where ts = "1970-01-01 00:00:00.0005";
-- returned 0 rows{code}
ORC PPD evaluation currently happens as part of OrcInputFormat
[https://github.com/apache/hive/blob/7e39a2c13711f9377c9ce1edb4224880421b1ea5/ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcInputFormat.java#L2314]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)