sunchao commented on code in PR #46447: URL: https://github.com/apache/spark/pull/46447#discussion_r1663221944
########## sql/core/benchmarks/DataSourceReadBenchmark-results.txt: ########## @@ -1,431 +1,431 @@ -================================================================================================ +DataSourceReadBenchmark-jdk21-results.txt================================================================================================ SQL Single Numeric Column Scan ================================================================================================ -OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure +OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure AMD EPYC 7763 64-Core Processor SQL Single BOOLEAN Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -SQL CSV 12156 12215 83 1.3 772.8 1.0X -SQL Json 8033 8276 343 2.0 510.7 1.5X -SQL Parquet Vectorized: DataPageV1 104 119 8 151.4 6.6 117.0X -SQL Parquet Vectorized: DataPageV2 104 117 25 151.2 6.6 116.8X -SQL Parquet MR: DataPageV1 1795 1811 22 8.8 114.2 6.8X -SQL Parquet MR: DataPageV2 1728 1739 15 9.1 109.9 7.0X -SQL ORC Vectorized 140 147 5 112.1 8.9 86.6X -SQL ORC MR 1617 1617 0 9.7 102.8 7.5X - -OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure +SQL CSV 10363 10364 2 1.5 658.9 1.0X +SQL Json 8667 8699 46 1.8 551.0 1.2X +SQL Parquet Vectorized: DataPageV1 103 114 8 153.3 6.5 101.0X +SQL Parquet Vectorized: DataPageV2 101 111 6 155.4 6.4 102.4X +SQL Parquet MR: DataPageV1 1809 1813 6 8.7 115.0 5.7X +SQL Parquet MR: DataPageV2 1715 1720 8 9.2 109.0 6.0X +SQL ORC Vectorized 139 146 5 113.1 8.8 74.5X +SQL ORC MR 1508 1511 5 10.4 95.8 6.9X + +OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure AMD EPYC 7763 64-Core Processor Parquet Reader Single BOOLEAN Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative --------------------------------------------------------------------------------------------------------------------------- -ParquetReader Vectorized: DataPageV1 94 95 1 166.6 6.0 1.0X -ParquetReader Vectorized: DataPageV2 101 103 2 155.3 6.4 0.9X -ParquetReader Vectorized -> Row: DataPageV1 74 75 2 213.8 4.7 1.3X -ParquetReader Vectorized -> Row: DataPageV2 81 83 1 193.2 5.2 1.2X +ParquetReader Vectorized: DataPageV1 88 90 2 178.9 5.6 1.0X +ParquetReader Vectorized: DataPageV2 95 96 1 166.2 6.0 0.9X +ParquetReader Vectorized -> Row: DataPageV1 73 74 1 215.3 4.6 1.2X +ParquetReader Vectorized -> Row: DataPageV2 81 83 1 193.1 5.2 1.1X -OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure +OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure AMD EPYC 7763 64-Core Processor SQL Single TINYINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative ------------------------------------------------------------------------------------------------------------------------ -SQL CSV 11407 11426 26 1.4 725.3 1.0X -SQL Json 9654 9660 7 1.6 613.8 1.2X -SQL Parquet Vectorized: DataPageV1 91 100 7 172.0 5.8 124.7X -SQL Parquet Vectorized: DataPageV2 88 97 8 178.6 5.6 129.5X -SQL Parquet MR: DataPageV1 1913 1916 5 8.2 121.6 6.0X -SQL Parquet MR: DataPageV2 1868 1871 5 8.4 118.7 6.1X -SQL ORC Vectorized 115 124 6 136.7 7.3 99.1X -SQL ORC MR 1519 1523 6 10.4 96.6 7.5X - -OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure +SQL CSV 11538 11589 73 1.4 733.5 1.0X +SQL Json 9586 9596 14 1.6 609.5 1.2X +SQL Parquet Vectorized: DataPageV1 109 116 6 144.8 6.9 106.2X +SQL Parquet Vectorized: DataPageV2 110 118 8 142.6 7.0 104.6X +SQL Parquet MR: DataPageV1 1901 1953 74 8.3 120.9 6.1X +SQL Parquet MR: DataPageV2 1817 1832 22 8.7 115.5 6.4X +SQL ORC Vectorized 118 126 7 133.6 7.5 98.0X +SQL ORC MR 1505 1535 43 10.5 95.7 7.7X + +OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure AMD EPYC 7763 64-Core Processor Parquet Reader Single TINYINT Column Scan: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative --------------------------------------------------------------------------------------------------------------------------- -ParquetReader Vectorized: DataPageV1 69 71 1 227.3 4.4 1.0X -ParquetReader Vectorized: DataPageV2 69 71 2 228.4 4.4 1.0X -ParquetReader Vectorized -> Row: DataPageV1 47 48 1 332.4 3.0 1.5X -ParquetReader Vectorized -> Row: DataPageV2 47 48 1 334.0 3.0 1.5X +ParquetReader Vectorized: DataPageV1 93 94 1 169.9 5.9 1.0X Review Comment: I guess this is probably due to other factors. It's fine as long as the relative stay the same? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org