[ https://issues.apache.org/jira/browse/DRILL-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16408548#comment-16408548 ]
Bridget Bevens commented on DRILL-6123: --------------------------------------- Added content here: [https://drill.apache.org/docs/configuring-drill-memory/#modifying-memory-allocated-to-queries] Removed the doc-impacting label. Please add the label back if doc coverage is not sufficient. Thanks, Bridget > Limit batch size for Merge Join based on memory > ----------------------------------------------- > > Key: DRILL-6123 > URL: https://issues.apache.org/jira/browse/DRILL-6123 > Project: Apache Drill > Issue Type: Improvement > Components: Execution - Flow > Affects Versions: 1.12.0 > Reporter: Padma Penumarthy > Assignee: Padma Penumarthy > Priority: Major > Labels: ready-to-commit > Fix For: 1.13.0 > > > Merge join limits output batch size to 32K rows irrespective of row size. > This can create very large or very small batches (in terms of memory), > depending upon average row width. Change this to figure out output row count > based on memory specified with the new outputBatchSize option and average row > width of incoming left and right batches. Output row count will be minimum of > 1 and max of 64k. -- This message was sent by Atlassian JIRA (v7.6.3#76005)