[ https://issues.apache.org/jira/browse/HIVE-8732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14199353#comment-14199353 ]
Dain Sundstrom commented on HIVE-8732: -------------------------------------- DoubleStatisticsImpl merge and update methods don't handle NaN properly. Any comparison with NaN returns false, so if the first value is NaN you end up with min and max of NaN, which implies that the column only contains NaNs. We should consider tracking NaN specially in the stats. Regardless, for now any code reading the DoubleStatistic should discard a stat containing a NaN. > ORC string statistics are not merged correctly > ---------------------------------------------- > > Key: HIVE-8732 > URL: https://issues.apache.org/jira/browse/HIVE-8732 > Project: Hive > Issue Type: Bug > Components: File Formats > Reporter: Owen O'Malley > Assignee: Owen O'Malley > Priority: Blocker > Fix For: 0.14.0 > > Attachments: HIVE-8732.patch > > > Currently ORC's string statistics do not merge correctly causing incorrect > maximum values. -- This message was sent by Atlassian JIRA (v6.3.4#6332)