gf2121 commented on PR #14361:
URL: https://github.com/apache/lucene/pull/14361#issuecomment-2729280384
OK i get expected results that multiple of 16 faster than multiple of 8 when
i force `-XX:UseAVX=3`, it can be seen AVX3 is slower on this chip, that may be
why java disabled it by default.
```
Benchmark Mode Cnt Score
Error Units
Decode21Benchmark.decode21Scalar thrpt 5 28.375
? 0.064 ops/ms
Decode21Benchmark.decode21Scalar:asm thrpt NaN
---
Decode21Benchmark.decode21Vector thrpt 5 41.844
? 0.182 ops/ms
Decode21Benchmark.decode21Vector:asm thrpt NaN
---
Decode21Benchmark.decode21VectorFloorToMultipleOf16 thrpt 5 64.471
? 0.218 ops/ms
Decode21Benchmark.decode21VectorFloorToMultipleOf16:asm thrpt NaN
---
Decode21Benchmark.decode21VectorFloorToMultipleOf8 thrpt 5 39.665
? 0.120 ops/ms
Decode21Benchmark.decode21VectorFloorToMultipleOf8:asm thrpt NaN
---
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]