Amogh Margoor has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17592 )

Change subject: IMPALA-7635: Reducing HashTable size by packing it's buckets 
efficiently.
......................................................................


Patch Set 5:

> (1 comment)
 >
 > I grabbed this change and was doing some local testing, and I ran
 > into a scenario that crashes Impala.
 >
 > Here's what I did:
 > 1. I loaded a larger TPC-H dataset in parquet (takes quite a while
 > and will use up some disk space):
 > bin/load-data.py -w tpch -s 42 
 > --table_formats=text/none/none,parquet/none/none
 > 2. Ran the following multiple times (it is intermittent):
 > use tpch42_parquet;
 > SELECT l_orderkey,
 > count(*) AS cnt
 > FROM lineitem
 > GROUP BY l_orderkey
 > HAVING count(*) > 9999999999999;
 >
 > That intermittently crashes with this stack:
 > C  [impalad+0x27e8546]  long impala::HashTable::Probe<true,
 > false>(impala::HashTable::Bucket*, long, impala::HashTableCtx*,
 > unsigned int, bool*, impala::HashTable::BucketData*)+0x28c
 > C  [impalad+0x27e2141]  impala::HashTable::ResizeBuckets(long,
 > impala::HashTableCtx*, bool*)+0x7d1
 > C  [impalad+0x27e194d]  impala::HashTable::CheckAndResize(unsigned
 > long, impala::HashTableCtx*, bool*)+0xcb
 > C  [impalad+0x27c8a48]  
 > impala::GroupingAggregator::CheckAndResizeHashPartitions(bool,
 > int, impala::HashTableCtx*)+0x176
 >
 > This may reproduce on smaller datasets.

Thanks for checking this out. I will check it and revert back.


--
To view, visit http://gerrit.cloudera.org:8080/17592
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I72912ae9353b0d567a976ca712d2d193e035df9b
Gerrit-Change-Number: 17592
Gerrit-PatchSet: 5
Gerrit-Owner: Amogh Margoor <amarg...@gmail.com>
Gerrit-Reviewer: Amogh Margoor <amarg...@gmail.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Joe McDonnell <joemcdonn...@cloudera.com>
Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Comment-Date: Thu, 08 Jul 2021 17:06:11 +0000
Gerrit-HasComments: No

Reply via email to