[ 
https://issues.apache.org/jira/browse/HIVE-8746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14200658#comment-14200658
 ] 

Dain Sundstrom commented on HIVE-8746:
--------------------------------------

A good first step would be to record the writer timezone in the file 
postscript.  Then the current reader could throw an exception if the JVM 
timezone doesn't match the timezone declared in the postscript.  Then when 
someone has more time, they could adjust the base epoch to the file timezone.

What do you think?

> ORC timestamp columns are sensitive to daylight savings time
> ------------------------------------------------------------
>
>                 Key: HIVE-8746
>                 URL: https://issues.apache.org/jira/browse/HIVE-8746
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>
> Hive uses Java's Timestamp class to manipulate timestamp columns. 
> Unfortunately the textual parsing in Timestamp is done in local time and the 
> internal storage is in UTC.
> ORC mostly side steps this issue by storing the difference between the time 
> and a base time also in local and storing that difference in the file. 
> Reading the file between timezones will mostly work correctly "2014-01-01 
> 12:34:56" will read correctly in every timezone.
> However, when moving between timezones with different daylight saving it 
> creates trouble. In particular, moving from a computer in PST to UTC will 
> read "2014-06-06 12:34:56" as "2014-06-06 11:34:56".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to