[ https://issues.apache.org/jira/browse/DRILL-6032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16379694#comment-16379694 ]
ASF GitHub Bot commented on DRILL-6032: --------------------------------------- Github user Ben-Zvi commented on a diff in the pull request: https://github.com/apache/drill/pull/1101#discussion_r171130622 --- Diff: exec/java-exec/src/main/resources/drill-module.conf --- @@ -427,8 +427,8 @@ drill.exec.options: { exec.enable_union_type: false, exec.errors.verbose: false, exec.hashagg.mem_limit: 0, - exec.hashagg.min_batches_per_partition: 2, - exec.hashagg.num_partitions: 32, + exec.hashagg.min_batches_per_partition: 1, --- End diff -- This option was meant to create a "slack". **1** is the lowest value - requiring only 1 batch per each partition, i.e., no slack; so that requires the memory computations to be more precise now !! > Use RecordBatchSizer to estimate size of columns in HashAgg > ----------------------------------------------------------- > > Key: DRILL-6032 > URL: https://issues.apache.org/jira/browse/DRILL-6032 > Project: Apache Drill > Issue Type: Improvement > Reporter: Timothy Farkas > Assignee: Timothy Farkas > Priority: Major > Fix For: 1.13.0 > > > We need to use the RecordBatchSize to estimate the size of columns in the > Partition batches created by HashAgg. -- This message was sent by Atlassian JIRA (v7.6.3#76005)