[ https://issues.apache.org/jira/browse/DRILL-7238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Venkata Jyothsna Donapati updated DRILL-7238: --------------------------------------------- Component/s: Metadata > Drill does not use DirectScan for non-existent columns > ------------------------------------------------------ > > Key: DRILL-7238 > URL: https://issues.apache.org/jira/browse/DRILL-7238 > Project: Apache Drill > Issue Type: Bug > Components: Metadata > Reporter: Venkata Jyothsna Donapati > Assignee: Venkata Jyothsna Donapati > Priority: Major > > This query does not use the summary metadata cache file: > select count(int_nulls_id), count(int_id), count(ss_ticket_number), > count(extra) from store_sales_null_blocks_int; > In this query, extra is a column that does not exist (non-existent column). > Here is the explain plan: > {noformat} > | 00-00 Screen > 00-01 Project(EXPR$0=[$0], EXPR$1=[$1], EXPR$2=[$2], EXPR$3=[$3]) > 00-02 StreamAgg(group=[{}], EXPR$0=[$SUM0($0)], EXPR$1=[$SUM0($1)], > EXPR$2=[$SUM0($2)], EXPR$3=[$SUM0($3)]) > 00-03 UnionExchange > 01-01 StreamAgg(group=[{}], EXPR$0=[COUNT($0)], > EXPR$1=[COUNT($1)], EXPR$2=[COUNT($2)], EXPR$3=[COUNT($3)]) > 01-02 Scan(table=[[dfs, parquet_metadata_cache, > store_sales_null_blocks_int]], groupscan=[ParquetGroupScan > [entries=[ReadEntryWithPath > [path=/drill/testdata/metadata_cache/store_sales_null_blocks_int]], > selectionRoot=/drill/testdata/metadata_cache/store_sales_null_blocks_int, > numFiles=1, numRowGroups=11, usedMetadataFile=true, > cacheFileRoot=/drill/testdata/metadata_cache/store_sales_null_blocks_int, > columns=[`int_nulls_id`, `int_id`, `ss_ticket_number`, `extra`]]]) > {noformat} > This is a regression from Drill 1.15. -- This message was sent by Atlassian JIRA (v7.6.3#76005)