2010YOUY01 commented on code in PR #23181:
URL: https://github.com/apache/datafusion/pull/23181#discussion_r3504375797


##########
datafusion/core/tests/fuzz_cases/aggregate_fuzz.rs:
##########
@@ -373,9 +378,6 @@ async fn run_aggregate_test(input1: Vec<RecordBatch>, 
group_by_columns: Vec<&str
         .await
         .unwrap();
     assert!(collected_running.len() > 2);
-    // Running should produce more chunk than the usual AggregateExec.
-    // Otherwise it means that we cannot generate result in running mode.
-    assert!(collected_running.len() > collected_usual.len());

Review Comment:
   This makes sense with the current implementation.
   
   However, in the future, we might want to accumulate `batch_size` groups 
before emitting, so downstream operators can be better vectorized. In that 
case, this assertion would no longer hold.
   
   So we would need a different assertion to check whether the ordered 
aggregation variant is used, likely via `EXPLAIN`. However, let's delay that 
change until this idea is actually implemented.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to