[
https://issues.apache.org/jira/browse/HIVE-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030836#comment-13030836
]
Steven Wong commented on HIVE-2087:
-----------------------------------
This problem seems to happen only when there is no static partition column.
> Dynamic partition insert performance problem
> --------------------------------------------
>
> Key: HIVE-2087
> URL: https://issues.apache.org/jira/browse/HIVE-2087
> Project: Hive
> Issue Type: Bug
> Components: Metastore
> Affects Versions: 0.7.0
> Environment: Amazon EMR, S3
> Reporter: Q Long
>
> Create an external(backed by S3) table T, make it partitioned by column P.
> Populate table T so it has large number of partitions (say 100). Execute
> statement like
> insert overwrite table T partition (p) select * from another_table
> check hive server log, and it will show that all existing partitions will be
> read and loaded before any mapper starts working. This feels excessive, given
> that the insert statement may only create or overwrite a very small number of
> partitions. Is there other reason that insert using dynamic partition
> requires loading the whole table?
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira