xudong963 commented on code in PR #15473:
URL: https://github.com/apache/datafusion/pull/15473#discussion_r2024400775
##########
datafusion/datasource/src/mod.rs:
##########
@@ -313,6 +314,78 @@ async fn find_first_newline(
Ok(index)
}
+/// Generates test files with min-max statistics in different overlap patterns
+/// Used by tests and benchmarks
+pub fn generate_test_files(num_files: usize, overlap_factor: f64) ->
Vec<FileGroup> {
Review Comment:
Also, based on my recent experience working in statistics, this function
will be very helpful for testing functionality later
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]