Hao Zhu created DRILL-3578: ------------------------------ Summary: UnsupportedOperationException: Unable to get value vector class for minor type [FIXEDBINARY] and mode [OPTIONAL] Key: DRILL-3578 URL: https://issues.apache.org/jira/browse/DRILL-3578 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 1.1.0 Reporter: Hao Zhu Assignee: Hanifi Gunes
The issue is Drill fails to read "timestamp" type in parquet file generated by Hive. How to reproduce: 1. Create a external Hive CSV table in hive 1.0: {code} create external table type_test_csv ( id1 int, id2 string, id3 timestamp, id4 double ) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' STORED AS TEXTFILE LOCATION '/xxx/testcsv'; {code} 2. Put sample data for above external table: {code} 1,One,2015-01-01 00:01:00,1.0 2,Two,2015-01-02 00:02:00,2.0 {code} 3. Create a parquet hive table: {code} create external table type_test ( id1 int, id2 string, id3 timestamp, id4 double ) STORED AS PARQUET LOCATION '/xxx/type_test'; INSERT OVERWRITE TABLE type_test SELECT * FROM type_test_csv; {code} 4. Then querying the parquet file directly through filesystem storage plugin: {code} > select * from dfs.`xxx/type_test`; Error: SYSTEM ERROR: UnsupportedOperationException: Unable to get value vector class for minor type [FIXEDBINARY] and mode [OPTIONAL] Fragment 0:0 [Error Id: fccfe8b2-6427-46e5-8bfd-cac639e526e8 on h3.poc.com:31010] (state=,code=0) {code} 5. If the sample data is only 1 row: {code} 1,One,2015-01-01 00:01:00,1.0 {code} Then the error message would become: {code} > select * from dfs.`xxx/type_test`; Error: SYSTEM ERROR: UnsupportedOperationException: Unsupported type:INT96 [Error Id: b52b5d46-63a8-4be6-a11d-999a1b46c7c2 on h3.poc.com:31010] (state=,code=0) {code} Using Hive storage plugin works fine. This issue only applies to filesystem storage plugin. -- This message was sent by Atlassian JIRA (v6.3.4#6332)