Shekharrajak commented on issue #3371: URL: https://github.com/apache/datafusion-comet/issues/3371#issuecomment-3901273572
Actually we can use the TPCDS and create fragmented tables - insert in batches, such that we have enough number of rows, number of files created to analyse the benchmark. Last few days I am playing with default spark vs comet native compaction using https://github.com/apache/iceberg-rust/pull/2106 branch and I will share what I found after refactoring. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
