[
https://issues.apache.org/jira/browse/HIVE-9588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14313261#comment-14313261
]
Mithun Radhakrishnan commented on HIVE-9588:
--------------------------------------------
A minor update:
1. Dropping 2K partitions using HCatClient.dropPartitions() used to take 204
seconds for a managed table on my test setup (with an Oracle backend, and
remote metastore). This now takes 83 seconds.
2. Dropping 5K partitions used to take about 7 minutes. It now takes 4.
> Reimplement HCatClientHMSImpl.dropPartitions() with HMSC.dropPartitions()
> -------------------------------------------------------------------------
>
> Key: HIVE-9588
> URL: https://issues.apache.org/jira/browse/HIVE-9588
> Project: Hive
> Issue Type: Bug
> Components: HCatalog, Metastore, Thrift API
> Affects Versions: 0.14.0
> Reporter: Mithun Radhakrishnan
> Assignee: Mithun Radhakrishnan
> Attachments: HIVE-9588.1.patch, HIVE-9588.2.patch
>
>
> {{HCatClientHMSImpl.dropPartitions()}} currently has an embarrassingly
> inefficient implementation. The partial partition-spec is converted into a
> filter-string. The partitions are fetched from the server, and then dropped
> one by one.
> Here's a reimplementation that uses the {{ExprNode}}-based
> {{HiveMetaStoreClient.dropPartitions()}}. It cuts out the excessive
> back-and-forth between the HMS and the client-side. It also reduces the
> memory footprint (from loading all the partitions that are to be dropped).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)