Sunil Kumar created HIVE-16056:
----------------------------------
Summary: Hive Changing Future Times tamp Values column values when
any clause or filter applied
Key: HIVE-16056
URL: https://issues.apache.org/jira/browse/HIVE-16056
Project: Hive
Issue Type: Bug
Components: Beeline, Database/Schema
Affects Versions: 1.2.1
Reporter: Sunil Kumar
Hi,
We are observing different behavior of Hive for the timestamp column values.
When we apply clause like order by, distinct on same or other other column in
the hive query it print different result for the timestamp value for year which
start after 2300..
Steps:
1. Create a hive table
create table cutomer_sample(id int, arrival_time timestamp, dob date) stored
as ORC;
2. Populate some data with future timestamp values
insert into table cutomer_sample values (1,'2015-01-01
00:00:00.0','2015-01-01'), (2,'2018-01-01 00:00:00.0','2018-01-01') ,
(3,'2099-01-01 00:00:00.0','2099-01-01'), (4,'2100-01-01
00:00:00.0','2100-01-01'),(5,'2500-01-01
00:00:00.0','2500-01-01'),(6,'2200-01-01
00:00:00.0','2200-01-01'),(7,'2300-01-01
00:00:00.0','2300-01-01'),(8,'2400-01-01 00:00:00.0','2400-01-01');
3. Select all data with any clause
select * from cutomer_sample;
Output:
select * from cutomer_sample;
+--------------------+------------------------------+---------------------+--+
| cutomer_sample.id | cutomer_sample.arrival_time | cutomer_sample.dob |
+--------------------+------------------------------+---------------------+--+
| 1 | 2015-01-01 00:00:00.0 | 2015-01-01 |
| 2 | 2018-01-01 00:00:00.0 | 2018-01-01 |
| 3 | 2099-01-01 00:00:00.0 | 2099-01-01 |
| 4 | 2100-01-01 00:00:00.0 | 2100-01-01 |
| 5 | 2500-01-01 00:00:00.0 | 2500-01-01 |
| 6 | 2200-01-01 00:00:00.0 | 2200-01-01 |
| 7 | 2300-01-01 00:00:00.0 | 2300-01-01 |
| 8 | 2400-01-01 00:00:00.0 | 2400-01-01 |
+--------------------+------------------------------+---------------------+--+
4. Apply order by on timestamp column
select * from cutomer_sample order by arrival_time ;
+--------------------+--------------------------------+---------------------+--+
| cutomer_sample.id | cutomer_sample.arrival_time | cutomer_sample.dob |
+--------------------+--------------------------------+---------------------+--+
| 7 | 1715-06-13 00:25:26.290448384 | 2300-01-01 |
| 8 | 1815-06-13 00:25:26.290448384 | 2400-01-01 |
| 5 | 1915-06-14 00:48:46.290448384 | 2500-01-01 |
| 1 | 2015-01-01 00:00:00.0 | 2015-01-01 |
| 2 | 2018-01-01 00:00:00.0 | 2018-01-01 |
| 3 | 2099-01-01 00:00:00.0 | 2099-01-01 |
| 4 | 2100-01-01 00:00:00.0 | 2100-01-01 |
| 6 | 2200-01-01 00:00:00.0 | 2200-01-01 |
+--------------------+--------------------------------+---------------------+--+
you can see value of timestamp got changed after 2300 year..
5. Apply order by on some other column still same behavior
+--------------------+--------------------------------+---------------------+--+
| cutomer_sample.id | cutomer_sample.arrival_time | cutomer_sample.dob |
+--------------------+--------------------------------+---------------------+--+
| 1 | 2015-01-01 00:00:00.0 | 2015-01-01 |
| 2 | 2018-01-01 00:00:00.0 | 2018-01-01 |
| 3 | 2099-01-01 00:00:00.0 | 2099-01-01 |
| 4 | 2100-01-01 00:00:00.0 | 2100-01-01 |
| 6 | 2200-01-01 00:00:00.0 | 2200-01-01 |
| 7 | 1715-06-13 00:25:26.290448384 | 2300-01-01 |
| 8 | 1815-06-13 00:25:26.290448384 | 2400-01-01 |
| 5 | 1915-06-14 00:48:46.290448384 | 2500-01-01 |
+--------------------+--------------------------------+---------------------+--+
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)