Panagiotis Garefalakis created HIVE-23036:
---------------------------------------------

             Summary: Incorrect ORC PPD eval with sub-millisecond timestamps
                 Key: HIVE-23036
                 URL: https://issues.apache.org/jira/browse/HIVE-23036
             Project: Hive
          Issue Type: Bug
            Reporter: Panagiotis Garefalakis
            Assignee: Panagiotis Garefalakis


See [ORC-611|https://issues.apache.org/jira/browse/ORC-611] for more details

ORC stores timestamps with:
 - nanosecond precision for the data itself
 - milliseconds precision for min-max statistics

As both min and max are rounded to the same value,  timestamps with ns 
precision will not pass the PPD evaluator.
{code:java}
create table tsstat (ts timestamp) stored as orc;
insert into tsstat values ("1970-01-01 00:00:00.0005");
select * from tsstat where ts = "1970-01-01 00:00:00.0005";
-- returned 0 rows{code}

ORC PPD evaluation currently happens as part of OrcInputFormat 
[https://github.com/apache/hive/blob/7e39a2c13711f9377c9ce1edb4224880421b1ea5/ql/src/java/org/apache/hadoop/hive/ql/io/orc/OrcInputFormat.java#L2314]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to