Nishant Bangarwa created HIVE-20014:
---------------------------------------
Summary: Druid SECOND/HOUR/MINUTE does not return correct values
when applied to String Columns
Key: HIVE-20014
URL: https://issues.apache.org/jira/browse/HIVE-20014
Project: Hive
Issue Type: Bug
Reporter: Nishant Bangarwa
Assignee: Nishant Bangarwa
Query SELECT MINUTE(`time1`) FROM calcs; returns null when the String column
only contains timestamp and does not contain any date information in the
column. The Druid parser fails to parse the time string values and returns
null.
{code}
1: jdbc:hive2://ctr-e138-1518143905142-379982> SELECT MINUTE(`time1`) FROM
calcs;
INFO : Compiling
command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db):
SELECT MINUTE(`time1`) FROM calcs
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:vc,
type:int, comment:null)], properties:null)
INFO : Completed compiling
command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db); Time
taken: 0.134 seconds
INFO : Executing
command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db):
SELECT MINUTE(`time1`) FROM calcs
INFO : Completed executing
command(queryId=hive_20180627145215_05147329-b8d8-491c-9bab-6fd5045542db); Time
taken: 0.002 seconds
INFO : OK
+-------+
| vc |
+-------+
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
| NULL |
+-------+
17 rows selected (0.266 seconds)
1: jdbc:hive2://ctr-e138-1518143905142-379982> SELECT time1 from calcs;
INFO : Compiling
command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d):
SELECT time1 from calcs
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:time1,
type:string, comment:null)], properties:null)
INFO : Completed compiling
command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d); Time
taken: 0.116 seconds
INFO : Executing
command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d):
SELECT time1 from calcs
INFO : Completed executing
command(queryId=hive_20180627145225_93b872de-a698-4859-9730-983eede6935d); Time
taken: 0.003 seconds
INFO : OK
+-----------+
| time1 |
+-----------+
| 22:20:14 |
| 22:50:16 |
| 19:36:22 |
| 19:48:23 |
| 00:05:57 |
| NULL |
| 04:48:07 |
| NULL |
| 19:57:33 |
| NULL |
| 04:40:49 |
| 02:05:25 |
| NULL |
| NULL |
| 12:33:57 |
| 18:58:41 |
| 09:33:31 |
+-----------+
17 rows selected (0.202 seconds)
1: jdbc:hive2://ctr-e138-1518143905142-379982> EXPLAIN SELECT MINUTE(`time1`)
FROM calcs;
INFO : Compiling
command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd):
EXPLAIN SELECT MINUTE(`time1`) FROM calcs
INFO : Semantic Analysis Completed (retrial = false)
INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:Explain,
type:string, comment:null)], properties:null)
INFO : Completed compiling
command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd); Time
taken: 0.107 seconds
INFO : Executing
command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd):
EXPLAIN SELECT MINUTE(`time1`) FROM calcs
INFO : Starting task [Stage-1:EXPLAIN] in serial mode
INFO : Completed executing
command(queryId=hive_20180627145237_39e53a7e-35cb-4e17-8ccb-884c6f6358cd); Time
taken: 0.003 seconds
INFO : OK
+----------------------------------------------------+
| Explain |
+----------------------------------------------------+
| Plan optimized by CBO. |
| |
| Stage-0 |
| Fetch Operator |
| limit:-1 |
| Select Operator [SEL_1] |
| Output:["_col0"] |
| TableScan [TS_0] |
|
Output:["vc"],properties:{"druid.fieldNames":"vc","druid.fieldTypes":"int","druid.query.json":"{\"queryType\":\"scan\",\"dataSource\":\"druid_tableau.calcs\",\"intervals\":[\"1900-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z\"],\"virtualColumns\":[{\"type\":\"expression\",\"name\":\"vc\",\"expression\":\"timestamp_extract(timestamp_parse(\\\"time1\\\",null,'UTC'),'MINUTE','UTC')\",\"outputType\":\"LONG\"}],\"columns\":[\"vc\"],\"resultFormat\":\"compactedList\"}","druid.query.type":"scan"}
|
| |
+----------------------------------------------------+
10 rows selected (0.136 seconds)
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)