[ https://issues.apache.org/jira/browse/IMPALA-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on IMPALA-8184 started by Csaba Ringhofer. ----------------------------------------------- > Add timestamp validation to Orc scanner > --------------------------------------- > > Key: IMPALA-8184 > URL: https://issues.apache.org/jira/browse/IMPALA-8184 > Project: IMPALA > Issue Type: Bug > Components: Backend > Reporter: Csaba Ringhofer > Assignee: Csaba Ringhofer > Priority: Critical > > Similarly to Parquet, Orc can also contain timestamps that are not valid in > Impala, e.g. Hive can insert timestamps before 1400 while these are invalid > in Impala. These invalid timestamps are often handled similarly to NULL, bur > are actually not "real" NULLs, which can lead to some some weird behavior: > Hive: > create table orcts (ts timestamp) stored as orc; > insert into orcts values ("1200-01-01"); > Impala: > select * from orcts where ts is not null; > Returns 1 row: > NULL -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org