steveloughran opened a new pull request, #3452: URL: https://github.com/apache/parquet-java/pull/3452
### Rationale for this change There's no benchmark for variant IO and so there's no knowledge of any problems which exist now, or any way to detect regressions. ### What changes are included in this PR? * adds parquet-variant to parquet-benchmark dependencies * new JMH benchmark `VariantBenchmark` ### Are these changes tested? Manually, initial PR doesn't fork the JVM for each option. ``` Benchmark (depth) (fieldCount) Mode Cnt Score Error Units VariantBenchmark.benchmarkBuildVariant Shallow 1000 ss 100 2007.783 ± 244.861 us/op VariantBenchmark.benchmarkBuildVariant Shallow 10000 ss 100 16214.358 ± 1048.142 us/op VariantBenchmark.benchmarkBuildVariant Nested 1000 ss 100 1544.472 ± 91.232 us/op VariantBenchmark.benchmarkBuildVariant Nested 10000 ss 100 15312.341 ± 226.414 us/op VariantBenchmark.benchmarkDeserialize Shallow 1000 ss 100 893.913 ± 36.284 us/op VariantBenchmark.benchmarkDeserialize Shallow 10000 ss 100 8499.729 ± 197.003 us/op VariantBenchmark.benchmarkDeserialize Nested 1000 ss 100 907.712 ± 80.187 us/op VariantBenchmark.benchmarkDeserialize Nested 10000 ss 100 8450.447 ± 163.247 us/op VariantBenchmark.benchmarkSerialize Shallow 1000 ss 100 16.095 ± 29.358 us/op VariantBenchmark.benchmarkSerialize Shallow 10000 ss 100 6.416 ± 7.178 us/op VariantBenchmark.benchmarkSerialize Nested 1000 ss 100 3.777 ± 0.528 us/op VariantBenchmark.benchmarkSerialize Nested 10000 ss 100 3.956 ± 0.536 us/op VariantBenchmark.writeShredded Shallow 1000 ss 100 1943.923 ± 103.121 us/op VariantBenchmark.writeShredded Shallow 10000 ss 100 20139.185 ± 341.913 us/op VariantBenchmark.writeShredded Nested 1000 ss 100 1920.326 ± 42.812 us/op VariantBenchmark.writeShredded Nested 10000 ss 100 20980.458 ± 539.303 us/op VariantBenchmark.writeUnshredded Shallow 1000 ss 100 29.876 ± 44.216 us/op VariantBenchmark.writeUnshredded Shallow 10000 ss 100 17.380 ± 39.148 us/op VariantBenchmark.writeUnshredded Nested 1000 ss 100 3.254 ± 1.061 us/op VariantBenchmark.writeUnshredded Nested 10000 ss 100 16.602 ± 33.320 us/op ``` there's 100 iterations per benchmark because some of the unshredded/small object operations are so fast that clock granuarity becomes an issue. ### Are there any user-facing changes? No Closes #3451 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
