[ https://issues.apache.org/jira/browse/HIVE-23721?focusedWorklogId=477545&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-477545 ]
ASF GitHub Bot logged work on HIVE-23721: ----------------------------------------- Author: ASF GitHub Bot Created on: 02/Sep/20 00:41 Start Date: 02/Sep/20 00:41 Worklog Time Spent: 10m Work Description: github-actions[bot] commented on pull request #1202: URL: https://github.com/apache/hive/pull/1202#issuecomment-685209535 This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Feel free to reach out on the d...@hive.apache.org list if the patch is in need of reviews. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 477545) Time Spent: 0.5h (was: 20m) > MetaStoreDirectSql.ensureDbInit() need to optimize QuerySQL > ----------------------------------------------------------- > > Key: HIVE-23721 > URL: https://issues.apache.org/jira/browse/HIVE-23721 > Project: Hive > Issue Type: Bug > Affects Versions: 4.0.0, 3.1.2 > Environment: Hadoop 3.1(1700+ nodes) > YARN 3.1 (with timelineserver enabled,https enabled) > Hive 3.1 (15 HS2 instance) > 60000+ YARN Applications every day > Reporter: YulongZ > Assignee: zhangbutao > Priority: Critical > Labels: pull-request-available > Fix For: 4.0.0 > > Attachments: HIVE-23721.01.patch > > Time Spent: 0.5h > Remaining Estimate: 0h > > From Hive3.0,catalog added to hivemeta,many schema of metastore added column > “catName”,and index for table added column “catName”。 > In MetaStoreDirectSql.ensureDbInit() ,two queries below > “ > initQueries.add(pm.newQuery(MTableColumnStatistics.class, "dbName == > ''")); > initQueries.add(pm.newQuery(MPartitionColumnStatistics.class, "dbName > == ''")); > ” > should use "catName == ''" instead of "dbName == ''",because “catName” is the > first index column。 > When data of metastore become large,for example, table of > MPartitionColumnStatistics have millions of lines。The > “newQuery(MPartitionColumnStatistics.class, "dbName == ''")” for metastore > executed very slowly,and the query “show tables“ for hiveserver2 executed > very slowly too。 -- This message was sent by Atlassian Jira (v8.3.4#803005)