[ 
https://issues.apache.org/jira/browse/IMPALA-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17036429#comment-17036429
 ] 

Vihang Karajgaonkar commented on IMPALA-7854:
---------------------------------------------

I think we should look at this. I will put this on my plate for now, but I 
don't think I will be able to take this up soon.

> Slow ALTER TABLE and LOAD DATA statements for tables with large number of 
> partitions
> ------------------------------------------------------------------------------------
>
>                 Key: IMPALA-7854
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7854
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Catalog
>    Affects Versions: Impala 2.12.0
>         Environment: 14 Nodes
> Table in question has 20 columns, 3 partition columns, and 57,475 partitions
>            Reporter: vietn
>            Priority: Major
>              Labels: impala, performance
>
> ALTER TABLE and LOAD DATA statements take minutes (9 minutes for ALTER TABLE 
> and 6 minutes for LOAD DATA) for tables with a large number of partitions.
> Our workaround was to use Hive to perform the LOAD DATA and then perform a 
> REFRESH PARTITION using Impala.
>  * 14 Nodes
>  * Table in question has 20 columns, 3 partition columns, and 57,475 
> partitions



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to