Sahil Takiar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15998


Change subject: IMPALA-9777: Set hive.optimize.sort.dynamic.partition to true 
for dynamic inserts
......................................................................

IMPALA-9777: Set hive.optimize.sort.dynamic.partition to true for dynamic 
inserts

This sets hive.optimize.sort.dynamic.partition to true by default during
data load. This option takes effect during Hive dynamic partitioning
inserts. It introduces a sort into the insert query so that all data is
sorted on the partition key. This allows the reducers to only open a single
file at a time when writing out files. When this config is set to false,
dynamic partitioning inserts will be run as a map-only job that
potentially opens hundreds of files per partition.

Testing:
* Ran core tests for Impala-EC

Change-Id: Ic2b7c0ec40a02da2640fae20cf640517fd1f4fef
---
M testdata/bin/generate-schema-statements.py
M testdata/datasets/tpcds/tpcds_schema_template.sql
2 files changed, 5 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/15998/1
--
To view, visit http://gerrit.cloudera.org:8080/15998
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ic2b7c0ec40a02da2640fae20cf640517fd1f4fef
Gerrit-Change-Number: 15998
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar <[email protected]>

Reply via email to