[ 
https://issues.apache.org/jira/browse/DRILL-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15243067#comment-15243067
 ] 

ASF GitHub Bot commented on DRILL-4577:
---------------------------------------

Github user zfong commented on the pull request:

    https://github.com/apache/drill/pull/461#issuecomment-210493669
  
    Yes, 8 seconds may still be too long, but it's certainly better than what 
was there before.  We've also been working with Simba to add an "includeSchema" 
option to the ODBC driver so tools that query from INFORMATION_SCHEMA can 
further restrict the query to only the relevant tables within the specified 
schema.  That together with this patch should reduce the time significantly.  
Further improvements may still be needed even beyond this, but I suggest we 
address those incrementally rather than all at once.


> Improve performance for query on INFORMATION_SCHEMA when HIVE is plugged in
> ---------------------------------------------------------------------------
>
>                 Key: DRILL-4577
>                 URL: https://issues.apache.org/jira/browse/DRILL-4577
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Hive
>            Reporter: Sean Hsuan-Yi Chu
>            Assignee: Sean Hsuan-Yi Chu
>             Fix For: 1.7.0
>
>
> A query such as 
> {code}
> select * from INFORMATION_SCHEMA.`TABLES` 
> {code}
> is converted as calls to fetch all tables from storage plugins. 
> When users have Hive, the calls to hive metadata storage would be: 
> 1) get_table
> 2) get_partitions
> However, the information regarding partitions is not used in this type of 
> queries. Beside, a more efficient way is to fetch tables is to use 
> get_multi_table call.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to