yaooqinn commented on PR #45453: URL: https://github.com/apache/spark/pull/45453#issuecomment-2082071977
Hey guys, I am currently regenerating the complete benchmark result with 20 jobs running simultaneously. Each job usually takes around 10 to 30 minutes to complete. However, the job that includes the new benchmark is taking an exceptionally long time, having been running for 2.5 hours as of 2024-04-29 15:38:56 (Shanghai time) and is still not finished. I am wondering if we should reduce the cardinality to potentially speed up the process. https://github.com/yaooqinn/spark/actions/runs/8872875176/job/24357926521 ```log Running org.apache.spark.sql.execution.benchmark.CollationBenchmark: Running benchmark: collation unit benchmarks - equalsFunction Running case: UTF8_BINARY_LCASE Stopped after 2 iterations, 13865 ms Running case: UNICODE Stopped after 2 iterations, 8754 ms Running case: UTF8_BINARY Stopped after 2 iterations, 8723 ms Running case: UNICODE_CI Stopped after 2 iterations, 92410 ms OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure AMD EPYC 7763 64-Core Processor collation unit benchmarks - equalsFunction: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative -------------------------------------------------------------------------------------------------------------------------- UTF8_BINARY_LCASE 6931 6933 2 0.0 69310.8 1.0X UNICODE 4355 4377 32 0.0 43547.4 1.6X UTF8_BINARY 4359 4362 3 0.0 43592.6 1.6X UNICODE_CI 46188 46205 24 0.0 461878.5 0.2X Running benchmark: collation unit benchmarks - compareFunction Running case: UTF8_BINARY_LCASE Stopped after 2 iterations, 13838 ms Running case: UNICODE Stopped after 2 iterations, 92542 ms Running case: UTF8_BINARY Stopped after 2 iterations, 16151 ms Running case: UNICODE_CI Stopped after 2 iterations, 97297 ms OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure AMD EPYC 7763 64-Core Processor collation unit benchmarks - compareFunction: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative --------------------------------------------------------------------------------------------------------------------------- UTF8_BINARY_LCASE 6912 6919 10 0.0 69119.6 1.0X UNICODE 46242 46271 42 0.0 462416.1 0.1X UTF8_BINARY 8071 8076 6 0.0 80713.3 0.9X UNICODE_CI 48626 48649 32 0.0 486262.5 0.1X Running benchmark: collation unit benchmarks - hashFunction Running case: UTF8_BINARY_LCASE Stopped after 2 iterations, 23280 ms Running case: UNICODE Stopped after 2 iterations, 373689 ms Running case: UTF8_BINARY Stopped after 2 iterations, 19852 ms Running case: UNICODE_CI Stopped after 2 iterations, 318401 ms OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure AMD EPYC 7763 64-Core Processor collation unit benchmarks - hashFunction: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ UTF8_BINARY_LCASE 11611 11640 41 0.0 116108.1 1.0X UNICODE 186807 186845 53 0.0 1868069.7 0.1X UTF8_BINARY 9896 9926 43 0.0 98959.6 1.2X UNICODE_CI 159154 159201 66 0.0 1591543.6 0.1X Running benchmark: collation unit benchmarks - contains Running case: UTF8_BINARY_LCASE Stopped after 2 iterations, 66470 ms Running case: UNICODE Stopped after 2 iterations, 36300 ms Running case: UTF8_BINARY Stopped after 2 iterations, 40600 ms Running case: UNICODE_CI Stopped after 2 iterations, 1821495 ms OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure AMD EPYC 7763 64-Core Processor collation unit benchmarks - contains: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ UTF8_BINARY_LCASE 33224 33235 17 0.0 332236.8 1.0X UNICODE 18132 18150 26 0.0 181316.5 1.8X UTF8_BINARY 20296 20300 6 0.0 202959.6 1.6X UNICODE_CI 905591 910748 7293 0.0 9055905.0 0.0X Running benchmark: collation unit benchmarks - startsWith Running case: UTF8_BINARY_LCASE Stopped after 2 iterations, 65119 ms Running case: UNICODE Stopped after 2 iterations, [3488](https://github.com/yaooqinn/spark/actions/runs/8872875176/job/24357926521#step:7:3488)9 ms Running case: UTF8_BINARY Stopped after 2 iterations, 39591 ms Running case: UNICODE_CI Stopped after 2 iterations, 1777884 ms OpenJDK 64-Bit Server VM 21.0.3+9-LTS on Linux 6.5.0-1018-azure AMD EPYC 7763 64-Core Processor collation unit benchmarks - startsWith: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ UTF8_BINARY_LCASE 32524 32560 51 0.0 325239.2 1.0X UNICODE 17439 17445 7 0.0 174393.7 1.9X UTF8_BINARY 19756 19796 57 0.0 197556.5 1.6X UNICODE_CI 888925 888942 24 0.0 8889250.7 0.0X Running benchmark: collation unit benchmarks - endsWith Running case: UTF8_BINARY_LCASE Stopped after 2 iterations, 65636 ms Running case: UNICODE Stopped after 2 iterations, 35204 ms Running case: UTF8_BINARY Stopped after 2 iterations, 39829 ms Running case: UNICODE_CI ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org