2010YOUY01 commented on issue #17698:
URL: https://github.com/apache/datafusion/issues/17698#issuecomment-3322309747

   Tested it on `TPCH` dataset (code at 
https://github.com/2010YOUY01/datafusion-sqlstorm)
   
   It has found 2500/17000 failures on valid queries (likely many duplicates)
   
   Here is the speed comparison with DuckDB. There are many queries 
(represented by the dots in the top-right corner of the figure) that are 100× 
slower than DuckDB. There may be even worse cases, since there is a 10-second 
timeout and timed-out queries are not counted.
   
   
![figure](https://github.com/2010YOUY01/datafusion-sqlstorm/blob/main/benchmark_plot_log.png?raw=true)
   
   There is still a lot of work to do to triage bugs and investigate 
significantly slower queries.
   (Is there any convenient way to filter out slow queries caused by bad join 
orders?)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to