[ https://issues.apache.org/jira/browse/HIVE-26437?focusedWorklogId=814234&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-814234 ]
ASF GitHub Bot logged work on HIVE-26437: ----------------------------------------- Author: ASF GitHub Bot Created on: 06/Oct/22 06:14 Start Date: 06/Oct/22 06:14 Worklog Time Spent: 10m Work Description: atsaonerk opened a new pull request, #3644: URL: https://github.com/apache/hive/pull/3644 Currently partitions of table is dump in parallel manner. But if table is not partitioned, it is dumped serially. This change introduces parallelism at table level as well. A single thread pool which is currently being used for partition level is also used for table level. The table level dump task is added to same thread pool. The degree of parallelism depends upon config parameter REPL_PARTITIONS_DUMP_PARALLELISM whose defaul value is 100. The new ExportService is introduced with this change which would be responsible for exporting table and partitions during repl dump. The ExportService is initialized and configured with thread pools by HiveServer2 service. A new Hiveconfig variable ie "REPL_TABLE_DUMP_PARALLELISM is introduced to define the number of threads which would be created in thread pool. The ExportService which is created as singleton instance would be used by ReplDumpTask. Issue Time Tracking ------------------- Worklog Id: (was: 814234) Time Spent: 0.5h (was: 20m) > dump unpartitioned Tables in parallel > ------------------------------------- > > Key: HIVE-26437 > URL: https://issues.apache.org/jira/browse/HIVE-26437 > Project: Hive > Issue Type: Improvement > Components: Hive > Reporter: Amit Saonerkar > Assignee: Amit Saonerkar > Priority: Major > Labels: pull-request-available > Time Spent: 0.5h > Remaining Estimate: 0h > -- This message was sent by Atlassian Jira (v8.20.10#820010)