alamb commented on PR #7911: URL: https://github.com/apache/arrow-rs/pull/7911#issuecomment-3070001048
I added some benchmarks and my local results suggest that avoiding the allocations makes parsing small repeated json objects about 10% faster. I think once we stop copying stuff around in the sub builders, the other bencmarks will be quite a bit faster too ``` Gnuplot not found, using plotters backend small_repeated_json 8k string time: [5.3628 ms 5.3743 ms 5.3862 ms] change: [−11.062% −10.867% −10.654%] (p = 0.00 < 0.05) Performance has improved. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) high mild Benchmarking small_random_json 8k string: Warming up for 3.0000 s Warning: Unable to complete 100 samples in 5.0s. You may wish to increase target time to 7.2s, or reduce sample count to 60. small_random_json 8k string time: [71.132 ms 71.245 ms 71.364 ms] change: [−1.7432% −1.4915% −1.2496%] (p = 0.00 < 0.05) Performance has improved. Found 10 outliers among 100 measurements (10.00%) 9 (9.00%) high mild 1 (1.00%) high severe ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org