sunchao commented on code in PR #46447:
URL: https://github.com/apache/spark/pull/46447#discussion_r1663221944


##########
sql/core/benchmarks/DataSourceReadBenchmark-results.txt:
##########
@@ -1,431 +1,431 @@
-================================================================================================
+DataSourceReadBenchmark-jdk21-results.txt================================================================================================
 SQL Single Numeric Column Scan
 
================================================================================================
 
-OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
+OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure
 AMD EPYC 7763 64-Core Processor
 SQL Single BOOLEAN Column Scan:           Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
------------------------------------------------------------------------------------------------------------------------
-SQL CSV                                           12156          12215         
 83          1.3         772.8       1.0X
-SQL Json                                           8033           8276         
343          2.0         510.7       1.5X
-SQL Parquet Vectorized: DataPageV1                  104            119         
  8        151.4           6.6     117.0X
-SQL Parquet Vectorized: DataPageV2                  104            117         
 25        151.2           6.6     116.8X
-SQL Parquet MR: DataPageV1                         1795           1811         
 22          8.8         114.2       6.8X
-SQL Parquet MR: DataPageV2                         1728           1739         
 15          9.1         109.9       7.0X
-SQL ORC Vectorized                                  140            147         
  5        112.1           8.9      86.6X
-SQL ORC MR                                         1617           1617         
  0          9.7         102.8       7.5X
-
-OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
+SQL CSV                                           10363          10364         
  2          1.5         658.9       1.0X
+SQL Json                                           8667           8699         
 46          1.8         551.0       1.2X
+SQL Parquet Vectorized: DataPageV1                  103            114         
  8        153.3           6.5     101.0X
+SQL Parquet Vectorized: DataPageV2                  101            111         
  6        155.4           6.4     102.4X
+SQL Parquet MR: DataPageV1                         1809           1813         
  6          8.7         115.0       5.7X
+SQL Parquet MR: DataPageV2                         1715           1720         
  8          9.2         109.0       6.0X
+SQL ORC Vectorized                                  139            146         
  5        113.1           8.8      74.5X
+SQL ORC MR                                         1508           1511         
  5         10.4          95.8       6.9X
+
+OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure
 AMD EPYC 7763 64-Core Processor
 Parquet Reader Single BOOLEAN Column Scan:   Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
---------------------------------------------------------------------------------------------------------------------------
-ParquetReader Vectorized: DataPageV1                    94             95      
     1        166.6           6.0       1.0X
-ParquetReader Vectorized: DataPageV2                   101            103      
     2        155.3           6.4       0.9X
-ParquetReader Vectorized -> Row: DataPageV1             74             75      
     2        213.8           4.7       1.3X
-ParquetReader Vectorized -> Row: DataPageV2             81             83      
     1        193.2           5.2       1.2X
+ParquetReader Vectorized: DataPageV1                    88             90      
     2        178.9           5.6       1.0X
+ParquetReader Vectorized: DataPageV2                    95             96      
     1        166.2           6.0       0.9X
+ParquetReader Vectorized -> Row: DataPageV1             73             74      
     1        215.3           4.6       1.2X
+ParquetReader Vectorized -> Row: DataPageV2             81             83      
     1        193.1           5.2       1.1X
 
-OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
+OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure
 AMD EPYC 7763 64-Core Processor
 SQL Single TINYINT Column Scan:           Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
------------------------------------------------------------------------------------------------------------------------
-SQL CSV                                           11407          11426         
 26          1.4         725.3       1.0X
-SQL Json                                           9654           9660         
  7          1.6         613.8       1.2X
-SQL Parquet Vectorized: DataPageV1                   91            100         
  7        172.0           5.8     124.7X
-SQL Parquet Vectorized: DataPageV2                   88             97         
  8        178.6           5.6     129.5X
-SQL Parquet MR: DataPageV1                         1913           1916         
  5          8.2         121.6       6.0X
-SQL Parquet MR: DataPageV2                         1868           1871         
  5          8.4         118.7       6.1X
-SQL ORC Vectorized                                  115            124         
  6        136.7           7.3      99.1X
-SQL ORC MR                                         1519           1523         
  6         10.4          96.6       7.5X
-
-OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1018-azure
+SQL CSV                                           11538          11589         
 73          1.4         733.5       1.0X
+SQL Json                                           9586           9596         
 14          1.6         609.5       1.2X
+SQL Parquet Vectorized: DataPageV1                  109            116         
  6        144.8           6.9     106.2X
+SQL Parquet Vectorized: DataPageV2                  110            118         
  8        142.6           7.0     104.6X
+SQL Parquet MR: DataPageV1                         1901           1953         
 74          8.3         120.9       6.1X
+SQL Parquet MR: DataPageV2                         1817           1832         
 22          8.7         115.5       6.4X
+SQL ORC Vectorized                                  118            126         
  7        133.6           7.5      98.0X
+SQL ORC MR                                         1505           1535         
 43         10.5          95.7       7.7X
+
+OpenJDK 64-Bit Server VM 17.0.11+9-LTS on Linux 6.5.0-1022-azure
 AMD EPYC 7763 64-Core Processor
 Parquet Reader Single TINYINT Column Scan:   Best Time(ms)   Avg Time(ms)   
Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
 
---------------------------------------------------------------------------------------------------------------------------
-ParquetReader Vectorized: DataPageV1                    69             71      
     1        227.3           4.4       1.0X
-ParquetReader Vectorized: DataPageV2                    69             71      
     2        228.4           4.4       1.0X
-ParquetReader Vectorized -> Row: DataPageV1             47             48      
     1        332.4           3.0       1.5X
-ParquetReader Vectorized -> Row: DataPageV2             47             48      
     1        334.0           3.0       1.5X
+ParquetReader Vectorized: DataPageV1                    93             94      
     1        169.9           5.9       1.0X

Review Comment:
   I guess this is probably due to other factors. It's fine as long as the 
relative stay the same?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to