[ 
https://issues.apache.org/jira/browse/DRILL-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14944138#comment-14944138
 ] 

Rahul Challapalli commented on DRILL-3892:
------------------------------------------

Verified and added a testcase. 

> Metadata cache not being leveraged when partition pruning is taking place
> -------------------------------------------------------------------------
>
>                 Key: DRILL-3892
>                 URL: https://issues.apache.org/jira/browse/DRILL-3892
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Metadata
>    Affects Versions: 1.2.0
>            Reporter: Rahul Challapalli
>            Assignee: Aman Sinha
>             Fix For: 1.3.0
>
>         Attachments: 
> 0001-DRILL-3892-Once-usedMetadataFile-is-set-to-true-don-.patch, 
> lineitem_deletecache.tgz
>
>
> git.commit.id.abbrev=92638dc
> As we can see from the below plan, metadata cache is not being leveraged even 
> when the cache file is being present
> {code}
> 0: jdbc:drill:zk=10.10.100.190:5181> refresh table metadata 
> dfs.`/drill/testdata/metadata_caching/lineitem_deletecache`;
> +-------+-------------------------------------------------------------------------------------------------+
> |  ok   |                                             summary                 
>                             |
> +-------+-------------------------------------------------------------------------------------------------+
> | true  | Successfully updated metadata for table 
> /drill/testdata/metadata_caching/lineitem_deletecache.  |
> +-------+-------------------------------------------------------------------------------------------------+
> 1 row selected (0.402 seconds)
> 0: jdbc:drill:zk=10.10.100.190:5181> explain plan for select count(*) from 
> dfs.`/drill/testdata/metadata_caching/lineitem_deletecache` where dir0=2006 
> group by l_linestatus;
> +------+------+
> | text | json |
> +------+------+
> | 00-00    Screen
> 00-01      Project(EXPR$0=[$1])
> 00-02        HashAgg(group=[{0}], EXPR$0=[COUNT()])
> 00-03          Project(l_linestatus=[$0])
> 00-04            Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath 
> [path=maprfs:/drill/testdata/metadata_caching/lineitem_deletecache/2006/1/lineitem_999.parquet]],
>  selectionRoot=maprfs:/drill/testdata/metadata_caching/lineitem_deletecache, 
> numFiles=1, usedMetadataFile=false, columns=[`l_linestatus`, `dir0`]]])
> {code}
> I attached the data set used. Let me know if you need anything more



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to