Re: [I] For 4-bit quantized vectors, should we change the scheme of unpacking nibbles? [lucene]

via GitHub Wed, 11 Feb 2026 13:39:52 -0800


kaivalnp commented on issue #15697:
URL: https://github.com/apache/lucene/issues/15697#issuecomment-3887367850


   Also pasting a sample JMH benchmark (from #15698) run using:
   
   ```
   java --module-path lucene/benchmark-jmh/build/benchmarks --module 
org.apache.lucene.benchmark.jmh 
"VectorUtilBenchmark.binaryHalfByte.*SinglePacked.*" -p size=1024
   ```
   
   ```
   Benchmark                                                       (size)   
Mode  Cnt  Score   Error   Units
   VectorUtilBenchmark.binaryHalfByteDotProductSinglePackedScalar    1024  
thrpt   15  2.443 ± 0.016  ops/us
   VectorUtilBenchmark.binaryHalfByteDotProductSinglePackedVector    1024  
thrpt   15  2.605 ± 0.010  ops/us
   VectorUtilBenchmark.binaryHalfByteSquareSinglePackedScalar        1024  
thrpt   15  2.020 ± 0.013  ops/us
   VectorUtilBenchmark.binaryHalfByteSquareSinglePackedVector        1024  
thrpt   15  2.587 ± 0.025  ops/us
   ```
   
   On my machine (Graviton3) -- the vectorized implementations do not give a 
large performance improvement over the scalar ones.
   
   Individual JMH iterations started strong, but performance dropped after some 
iterations. Example for `binaryHalfByteDotProductSinglePackedVector`:
   
   ```
   # Warmup Iteration   1: 11.063 ops/us
   # Warmup Iteration   2: 15.398 ops/us
   # Warmup Iteration   3: 2.589 ops/us
   # Warmup Iteration   4: 2.586 ops/us
   Iteration   1: 2.598 ops/us
   Iteration   2: 2.606 ops/us
   Iteration   3: 2.596 ops/us
   Iteration   4: 2.602 ops/us
   Iteration   5: 2.606 ops/us
   ```
   
   I'm not sure if this is just a JMH thing, or would happen in general too.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] For 4-bit quantized vectors, should we change the scheme of unpacking nibbles? [lucene]

Reply via email to