Jerry Ylilammi created DRILL-4193:
-------------------------------------

             Summary: All Parquet columns are loaded when querying S3
                 Key: DRILL-4193
                 URL: https://issues.apache.org/jira/browse/DRILL-4193
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Parquet
    Affects Versions: 1.3.0
            Reporter: Jerry Ylilammi


Drill starts downloading all data from S3 not making use of Parquet being 
columnar format and only loading required column.

Query:
{code}SELECT DISTINCT data.measurement.cid 
FROM mys3bucket.`test/datatable` AS data;{code}

Parquet file:
{code}...
measurement:
.cid:                  INT64 GZIP DO:0 FPO:9380624 SZ:110/76/0.69 VC:32327 
ENC:BIT_PACKED,PLAIN_DICTIONARY,RLE
...{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to