vegarsti commented on issue #17789:
URL: https://github.com/apache/datafusion/issues/17789#issuecomment-3351356260

   Here's a Samply profile as well:
   
   <img width="3040" height="1710" alt="Image" 
src="https://github.com/user-attachments/assets/84f9b2be-957e-4279-896a-a2424abef1bc";
 />
   
   From the root of the datafusion repo, I
   - Installed samply `cargo install --locked samply`
   - Installed tpchgen-cli `RUSTFLAGS='-C target-cpu=native' cargo install 
tpchgen-cli`
   - Ran `tpchgen-cli -s 0.1 --format=parquet --output-dir tpch-data`
   - Created `string-agg.sql`, see below
   - Compiled the CLI with the profiling profile `cargo build --profile 
profiling --bin datafusion-cli`
   - Ran `samply record -- ./target/profiling/datafusion-cli -f string-agg.sql`
   
   `string-agg.sql`:
   ```
   CREATE EXTERNAL TABLE partsupp
   STORED AS PARQUET
   LOCATION 'tpch-data/partsupp.parquet';
   
   select ps_partkey, string_agg(ps_comment, ';')
   from partsupp
   group by ps_partkey;
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to