mccullocht opened a new issue, #16029:
URL: https://github.com/apache/lucene/issues/16029

   ### Description
   
   Add options for data-blind scalar quantization. Data blind quantization will 
allow users to drop the raw float vectors which will be a significant reduction 
in storage in all cases (~4x or more improvement) but limits flexibility (no 
re-quantization of input).
   
   There are 3 parts to this change:
   - [ ] Allow users to disable centering and drop the raw float vectors. This 
is usable at high bit rates (4-8 bits/dim) where users may not be interested in 
re-ranking the results.
   - [ ] Implement random rotation of vectors and queries. This generates a 
more favorable value distribution for quantization and should help reclaim some 
of the accuracy losses in the first step.
   - [ ] Implement residual vector quantization. After quantizing each vector 
`v`, quantize the residual `v - dequantize(quantize(v))` and write it to 
separate storage. This vector can be used to boost accuracy during 
reranking/rescoring or vector decoding while still allowing fast/cheap 
representations for graph traversal. This makes data blind quantization more 
palatable at low bit rates (1-2 bits/dim).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to