[ https://issues.apache.org/jira/browse/DRILL-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15267771#comment-15267771 ]
ASF GitHub Bot commented on DRILL-4577: --------------------------------------- Github user jinfengni commented on the pull request: https://github.com/apache/drill/pull/461#issuecomment-216399417 +1. The patch looks good to me. Internally performance measurement shows orders of magnitude improvement for hive schema with up to 32k tables. > Improve performance for query on INFORMATION_SCHEMA when HIVE is plugged in > --------------------------------------------------------------------------- > > Key: DRILL-4577 > URL: https://issues.apache.org/jira/browse/DRILL-4577 > Project: Apache Drill > Issue Type: Improvement > Components: Storage - Hive > Reporter: Sean Hsuan-Yi Chu > Assignee: Sean Hsuan-Yi Chu > Fix For: 1.7.0 > > > A query such as > {code} > select * from INFORMATION_SCHEMA.`TABLES` > {code} > is converted as calls to fetch all tables from storage plugins. > When users have Hive, the calls to hive metadata storage would be: > 1) get_table > 2) get_partitions > However, the information regarding partitions is not used in this type of > queries. Beside, a more efficient way is to fetch tables is to use > get_multi_table call. -- This message was sent by Atlassian JIRA (v6.3.4#6332)