Re: [PR] Create vectorized versions of ScalarQuantizer.quantize and recalculateCorrectiveOffset [lucene]

via GitHub Tue, 11 Mar 2025 06:31:51 -0700


benwtrent commented on PR #14304:
URL: https://github.com/apache/lucene/pull/14304#issuecomment-2713553719


   On GCP, there isn't much difference. I wouldn't expect there to be a huge 
amount of difference as the dominate cost is the vector comparisons not the 
quantization.
   
   I haven't tested with "flat" yet.
   
   BASELINE
   ```
   recall  latency (ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
visited  index s  index docs/s  force merge s  num segments  index size (MB)  
vec disk (MB)  vec RAM (MB)
    0.961         2.910  200000   100      50       64        250     7 bits    
 6677   111.44       1794.70          79.03             1           997.58      
  976.563       195.313
   ```
   
   CANDIDATE
   ```
   recall  latency (ms)    nDoc  topK  fanout  maxConn  beamWidth  quantized  
visited  index s  index docs/s  force merge s  num segments  index size (MB)  
vec disk (MB)  vec RAM (MB)
    0.960         2.460  200000   100      50       64        250     7 bits    
 6527   110.99       1801.98          76.68             1           997.55      
  976.563       195.313
   ```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] Create vectorized versions of ScalarQuantizer.quantize and recalculateCorrectiveOffset [lucene]

Reply via email to