alamb commented on issue #10336: URL: https://github.com/apache/datafusion/issues/10336#issuecomment-2094127979
THank you @yyy1000 🙏 I think a good place to start would be to write some sqllogic level tests to cover the important cases Perhaos for the first test: 1. Create files: file1.parquet, file2.parquet both sorted on `a` but file 1 has the columns in the order `a, b, c` and file has the columns in the order `c, b, a`. The keyranges of values of a should be non overlapping 2. Create an external table `a, b, c` with explicit order by `a,` and then query `SELECT ... ORDER BY a` and make sure the output plan doesn't use sort preserving merge I think we could extend datafusion/sqllogictest/test_files/parquet_sorted_statistics.slt -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org