jtibshirani commented on code in PR #1076:
URL: https://github.com/apache/lucene/pull/1076#discussion_r950623197


##########
lucene/core/src/java/org/apache/lucene/index/VectorEncoding.java:
##########
@@ -21,12 +21,8 @@
 public enum VectorEncoding {
 
   /**
-   * Encodes vector using 8 bits of precision per sample. Use only with 
DOT_PRODUCT similarity.
-   * NOTE: this can enable significant storage savings and faster searches, at 
the cost of some
-   * possible loss of precision. In order to use it, all vectors must be of 
the same norm, as
-   * measured by the sum of the squares of the scalar values, and those values 
must be in the range
-   * [-128, 127]. This applies to both document and query vectors. Using 
nonconforming vectors can
-   * result in errors or poor search results.
+   * Encodes vector using 8 bits of precision per sample. NOTE: this can 
enable significant storage

Review Comment:
   Maybe we could keep the part about how query vectors need to have all values 
within [-128, 127]? Because if they don't, we'll just cast to a byte and the 
results could be really surprising.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to