[jira] [Updated] (DRILL-7028) Reduce the planning time of queries on large Parquet tables with large metadata cache files

2019-11-04 Thread Arina Ielchiieva (Jira)


 [ 
https://issues.apache.org/jira/browse/DRILL-7028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arina Ielchiieva updated DRILL-7028:

Fix Version/s: (was: 1.16.0)

> Reduce the planning time of queries on large Parquet tables with large 
> metadata cache files
> ---
>
> Key: DRILL-7028
> URL: https://issues.apache.org/jira/browse/DRILL-7028
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Metadata
>Reporter: Venkata Jyothsna Donapati
>Assignee: Venkata Jyothsna Donapati
>Priority: Major
>  Labels: performance
> Fix For: 1.17.0
>
>
> If the Parquet table has a large number of small files, the metadata cache 
> files grow larger and the planner tries to read the large metadata cache file 
> which leads to the planning time overhead. Most of the time of execution is 
> spent during the planning phase.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (DRILL-7028) Reduce the planning time of queries on large Parquet tables with large metadata cache files

2019-04-10 Thread Sorabh Hamirwasia (JIRA)


 [ 
https://issues.apache.org/jira/browse/DRILL-7028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sorabh Hamirwasia updated DRILL-7028:
-
Fix Version/s: 1.17.0

> Reduce the planning time of queries on large Parquet tables with large 
> metadata cache files
> ---
>
> Key: DRILL-7028
> URL: https://issues.apache.org/jira/browse/DRILL-7028
> Project: Apache Drill
>  Issue Type: Improvement
>  Components: Metadata
>Reporter: Venkata Jyothsna Donapati
>Assignee: Venkata Jyothsna Donapati
>Priority: Major
>  Labels: performance
> Fix For: 1.16.0, 1.17.0
>
>
> If the Parquet table has a large number of small files, the metadata cache 
> files grow larger and the planner tries to read the large metadata cache file 
> which leads to the planning time overhead. Most of the time of execution is 
> spent during the planning phase.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)