shubhamvishu commented on PR #14963:
URL: https://github.com/apache/lucene/pull/14963#issuecomment-3242215130
OK, I ran the `luceneutil` benchmarks and I see a huge improvement in the
indexing throughput with this PR compared to baseline(without this change). I
see an almost **`4x`** improvement in the indexing time here.
*Note : The improvement in the latency or CPUTime seems to be driven by the
less no. of segments which also very slightly impact the recall as we know.*
CC - @benwtrent @msokolov @jpountz @vigyasharma
#### With `HNSW_GRAPH_THRESHOLD` = 10
**Baseline**
```
Results:
recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn
beamWidth quantized index(s) index_docs/s num_segments index_size(MB)
vec_disk(MB) vec_RAM(MB) indexType
0.515 11.084 11.071 0.999 500000 100 50 64
250 4 bits 120.51 4149.14 7 1690.12
1649.857 185.013 HNSW
0.875 9.965 9.946 0.998 500000 100 50 64
250 7 bits 119.18 4195.23 3 1871.00
1832.962 368.118 HNSW
0.978 19.637 19.621 0.999 500000 100 50 64
250 no 127.63 3917.51 8 1501.98
1464.844 1464.844 HNSW
```
**Candidate**
```
Results:
recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn
beamWidth quantized index(s) index_docs/s num_segments index_size(MB)
vec_disk(MB) vec_RAM(MB) indexType
0.517 5.933 5.914 0.997 500000 100 50 64
250 4 bits 30.34 16482.07 2 1694.34
1649.857 185.013 HNSW
0.871 9.655 9.635 0.998 500000 100 50 64
250 7 bits 27.86 17945.59 3 1869.52
1832.962 368.118 HNSW
0.961 11.280 11.269 0.999 500000 100 50 64
250 no 31.16 16046.73 3 1503.46
1464.844 1464.844 HNSW
```
--------------------------
#### With `HNSW_GRAPH_THRESHOLD` = 100
**Baseline**
```
Results:
recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn
beamWidth quantized index(s) index_docs/s num_segments index_size(MB)
vec_disk(MB) vec_RAM(MB) indexType
0.515 11.101 11.074 0.998 500000 100 50 64
250 4 bits 118.33 4225.40 7 1690.02
1649.857 185.013 HNSW
0.874 10.199 10.176 0.998 500000 100 50 64
250 7 bits 118.18 4230.83 3 1871.16
1832.962 368.118 HNSW
0.977 19.990 19.979 0.999 500000 100 50 64
250 no 126.92 3939.61 8 1501.95
1464.844 1464.844 HNSW
```
**Candidate**
```
Results:
recall latency(ms) netCPU avgCpuCount nDoc topK fanout maxConn
beamWidth quantized index(s) index_docs/s num_segments index_size(MB)
vec_disk(MB) vec_RAM(MB) indexType
0.517 5.900 5.882 0.997 500000 100 50 64
250 4 bits 30.13 16596.96 2 1694.14
1649.857 185.013 HNSW
0.872 9.680 9.660 0.998 500000 100 50 64
250 7 bits 28.58 17495.98 3 1869.05
1832.962 368.118 HNSW
0.964 11.640 11.619 0.998 500000 100 50 64
250 no 27.25 18349.97 3 1502.64
1464.844 1464.844 HNSW
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]