benwtrent commented on PR #14304:
URL: https://github.com/apache/lucene/pull/14304#issuecomment-2697969649
I compared this branch with main. There are measurable improvements, but the
quantization step isn't the main bottle neck. Vector comparisons still dominate
the costs. But, its a nice bump I would say.
candidate:
```
recall latency (ms) nDoc topK fanout maxConn beamWidth quantized
index s index docs/s force merge s num segments index size (MB) vec disk
(MB) vec RAM (MB)
0.826 2.340 500000 100 50 32 100 7 bits
86.54 5777.61 337.47 1 1859.34
1831.055 366.211
```
baseline:
```
recall latency (ms) nDoc topK fanout maxConn beamWidth quantized
index s index docs/s force merge s num segments index size (MB) vec disk
(MB) vec RAM (MB)
0.828 2.680 500000 100 50 32 100 7 bits
88.48 5650.74 357.45 1 1859.57
1831.055 366.211
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]