zhuqi-lucas commented on PR #16604:
URL: https://github.com/apache/datafusion/pull/16604#issuecomment-3015305319

   Testing result, it looks good:
   
   ```rust
   cargo run --profile release-nonlto   --target aarch64-apple-darwin --bin 
dfbench -- clickbench --queries-path  ./benchmarks/queries/clickbench/queries/ 
-p ./benchmarks/data/hits_partitioned/ -q 21 -i 1  --debug
       Finished `release-nonlto` profile [optimized] target(s) in 0.28s
        Running `target/aarch64-apple-darwin/release-nonlto/dfbench clickbench 
--queries-path ./benchmarks/queries/clickbench/queries/ -p 
./benchmarks/data/hits_partitioned/ -q 21 -i 1 --debug`
   Running benchmarks with the following options: RunOpt { query: Some(21), 
common: CommonOpt { iterations: 1, partitions: None, batch_size: None, 
mem_pool_type: "fair", memory_limit: None, sort_spill_reservation_bytes: None, 
debug: true }, path: "./benchmarks/data/hits_partitioned/", queries_path: 
"./benchmarks/queries/clickbench/queries/", output_path: None }
   Q21: SELECT "SearchPhrase", MIN("URL"), COUNT(*) AS c FROM hits WHERE "URL" 
LIKE '%google%' AND "SearchPhrase" <> '' GROUP BY "SearchPhrase" ORDER BY c 
DESC LIMIT 10;
   
   Query 21 iteration 0 took 1162.6 ms and returned 10 rows
   +---------------+-------------------------------+
   | plan_type     | plan                          |
   +---------------+-------------------------------+
   | physical_plan | ┌───────────────────────────┐ |
   |               | │  SortPreservingMergeExec  │ |
   |               | │    --------------------   │ |
   |               | │      c DESClimit: 10      │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │       SortExec(TopK)      │ |
   |               | │    --------------------   │ |
   |               | │          c@2 DESC         │ |
   |               | │                           │ |
   |               | │         limit: 10         │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │       ProjectionExec      │ |
   |               | │    --------------------   │ |
   |               | │       SearchPhrase:       │ |
   |               | │        SearchPhrase       │ |
   |               | │                           │ |
   |               | │     c: count(Int64(1))    │ |
   |               | │                           │ |
   |               | │       min(hits.URL):      │ |
   |               | │       min(hits.URL)       │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │       AggregateExec       │ |
   |               | │    --------------------   │ |
   |               | │           aggr:           │ |
   |               | │  min(hits.URL), count(1)  │ |
   |               | │                           │ |
   |               | │         group_by:         │ |
   |               | │        SearchPhrase       │ |
   |               | │                           │ |
   |               | │           mode:           │ |
   |               | │      FinalPartitioned     │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │    CoalesceBatchesExec    │ |
   |               | │    --------------------   │ |
   |               | │     target_batch_size:    │ |
   |               | │            8192           │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │      RepartitionExec      │ |
   |               | │    --------------------   │ |
   |               | │ partition_count(in->out): │ |
   |               | │          14 -> 14         │ |
   |               | │                           │ |
   |               | │    partitioning_scheme:   │ |
   |               | │ Hash([SearchPhrase@0], 14)│ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │       AggregateExec       │ |
   |               | │    --------------------   │ |
   |               | │           aggr:           │ |
   |               | │  min(hits.URL), count(1)  │ |
   |               | │                           │ |
   |               | │         group_by:         │ |
   |               | │        SearchPhrase       │ |
   |               | │                           │ |
   |               | │       mode: Partial       │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │    CoalesceBatchesExec    │ |
   |               | │    --------------------   │ |
   |               | │     target_batch_size:    │ |
   |               | │            8192           │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │         FilterExec        │ |
   |               | │    --------------------   │ |
   |               | │         predicate:        │ |
   |               | │   URL LIKE %google% AND   │ |
   |               | │       SearchPhrase !=     │ |
   |               | └─────────────┬─────────────┘ |
   |               | ┌─────────────┴─────────────┐ |
   |               | │       DataSourceExec      │ |
   |               | │    --------------------   │ |
   |               | │         files: 113        │ |
   |               | │      format: parquet      │ |
   |               | │                           │ |
   |               | │         predicate:        │ |
   |               | │   URL LIKE %google% AND   │ |
   |               | │       SearchPhrase !=     │ |
   |               | └───────────────────────────┘ |
   |               |                               |
   +---------------+-------------------------------+
   Query 21 avg time: 1162.59 ms
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to