mccullocht opened a new issue, #16029: URL: https://github.com/apache/lucene/issues/16029
### Description Add options for data-blind scalar quantization. Data blind quantization will allow users to drop the raw float vectors which will be a significant reduction in storage in all cases (~4x or more improvement) but limits flexibility (no re-quantization of input). There are 3 parts to this change: - [ ] Allow users to disable centering and drop the raw float vectors. This is usable at high bit rates (4-8 bits/dim) where users may not be interested in re-ranking the results. - [ ] Implement random rotation of vectors and queries. This generates a more favorable value distribution for quantization and should help reclaim some of the accuracy losses in the first step. - [ ] Implement residual vector quantization. After quantizing each vector `v`, quantize the residual `v - dequantize(quantize(v))` and write it to separate storage. This vector can be used to boost accuracy during reranking/rescoring or vector decoding while still allowing fast/cheap representations for graph traversal. This makes data blind quantization more palatable at low bit rates (1-2 bits/dim). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
