Jerry Ylilammi created DRILL-4193: ------------------------------------- Summary: All Parquet columns are loaded when querying S3 Key: DRILL-4193 URL: https://issues.apache.org/jira/browse/DRILL-4193 Project: Apache Drill Issue Type: Bug Components: Storage - Parquet Affects Versions: 1.3.0 Reporter: Jerry Ylilammi
Drill starts downloading all data from S3 not making use of Parquet being columnar format and only loading required column. Query: {code}SELECT DISTINCT data.measurement.cid FROM mys3bucket.`test/datatable` AS data;{code} Parquet file: {code}... measurement: .cid: INT64 GZIP DO:0 FPO:9380624 SZ:110/76/0.69 VC:32327 ENC:BIT_PACKED,PLAIN_DICTIONARY,RLE ...{code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)