pmcgleenon commented on issue #17721:
URL: https://github.com/apache/datafusion/issues/17721#issuecomment-3499427159

   did you select Metric = Hot Run?   The default for metric is Combined, which 
I think is a mixture of Hot run + cold run.  
   
   For partitioned parquet files on `c6a.4xlarge` the overall result looks the 
same as the previous version
   
   Q23 is faster in v50, along with Q24 and Q26
   
   <img width="1343" height="1208" alt="Image" 
src="https://github.com/user-attachments/assets/2021e43b-c3b8-4318-a782-c910d69cbf09";
 />
   
   I ran Q23 a few times on a `c6a.4xlarge` instance and got these results 
   
   ```
   # datafusion-cli -f create.sql q23.sql 2>^1 | grep "Elapsed" |sed -n 2p | 
awk '{ print $2 }'
   9.190
   # datafusion-cli -f create.sql q23.sql 2>^1 | grep "Elapsed" |sed -n 2p | 
awk '{ print $2 }'
   9.106
   # datafusion-cli -f create.sql q23.sql 2>^1 | grep "Elapsed" |sed -n 2p | 
awk '{ print $2 }'
   9.310
   # datafusion-cli -f create.sql q23.sql 2>^1 | grep "Elapsed" |sed -n 2p | 
awk '{ print $2 }'
   9.076
   ```
   
   The [`grep 
Elapsed`](https://github.com/ClickHouse/ClickBench/blob/main/datafusion/run.sh#L18)
 is taking the two Elapsed times from the output and `sed -n 2p` takes the 
second from the CLI output
   
   
   ```
   datafusion-cli -f create.sql q23.sql |grep Elapsed
   Elapsed 0.042 seconds.
   Elapsed 9.186 seconds.
   ```
   
   If you are seeing different results on the ClickBench AWS instance types I'm 
happy to re-run any of these


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to