Re: [PR] Create vectorized versions of ScalarQuantizer.quantize and recalculateCorrectiveOffset [lucene]

via GitHub Tue, 04 Mar 2025 07:12:19 -0800


benwtrent commented on PR #14304:
URL: https://github.com/apache/lucene/pull/14304#issuecomment-2697969649


   I compared this branch with main. There are measurable improvements, but the 
quantization step isn't the main bottle neck. Vector comparisons still dominate 
the costs. But, its a nice bump I would say.
   
   candidate:
   ```
   recall  latency (ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
index s  index docs/s  force merge s  num segments  index size (MB)  vec disk 
(MB)  vec RAM (MB)
    0.826         2.340  500000   100      50       32        100     7 bits    
86.54       5777.61         337.47             1          1859.34       
1831.055       366.211
   ```
   
   baseline:
   ```
   recall  latency (ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
index s  index docs/s  force merge s  num segments  index size (MB)  vec disk 
(MB)  vec RAM (MB)
    0.828         2.680  500000   100      50       32        100     7 bits    
88.48       5650.74         357.45             1          1859.57       
1831.055       366.211
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] Create vectorized versions of ScalarQuantizer.quantize and recalculateCorrectiveOffset [lucene]

Reply via email to