Jörn Horstmann created ARROW-10240:
--------------------------------------

             Summary: [Rust] [Datafusion] Optionally load tpch data into memory 
before running benchmark query
                 Key: ARROW-10240
                 URL: https://issues.apache.org/jira/browse/ARROW-10240
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Rust, Rust - DataFusion
            Reporter: Jörn Horstmann


The tpch benchmark runtime seems to be dominated by csv parsing code and it is 
really difficult to see any performance hotspots related to actual query 
execution in a flamegraph.

With the date in memory and more iterations it should be easier to profile and 
find bottlenecks.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to