[ https://issues.apache.org/jira/browse/DRILL-5146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16067434#comment-16067434 ]
Paul Rogers commented on DRILL-5146: ------------------------------------ This case works with the code in DRILL-5601. {code} Results: 5,001 records, 2 batches, 104,034 ms {code} > Unnecessary spilling to disk by sort when we only have 5000 rows with one > column > -------------------------------------------------------------------------------- > > Key: DRILL-5146 > URL: https://issues.apache.org/jira/browse/DRILL-5146 > Project: Apache Drill > Issue Type: Bug > Components: Execution - Relational Operators > Reporter: Rahul Challapalli > Assignee: Paul Rogers > Attachments: 27a52efb-0ce6-f2ad-7216-aef007926649.sys.drill, > data.tgz, spill.log > > > git.commit.id.abbrev=cf2b7c7 > The below query spills to disk for the sort. The dataset contains 5000 files > and each file contains a single record. > {code} > select * from dfs.`/drill/testdata/resource-manager/5000files/text` order by > columns[1]; > {code} > Enviironment : > {code} > DRILL_MAX_DIRECT_MEMORY="16G" > DRILL_MAX_HEAP="4G" > {code} > I attached the dataset, logs and the profile -- This message was sent by Atlassian JIRA (v6.4.14#64029)