[GitHub] [lucene-jira-archive] mocobeta commented on issue #7: Make a detailed migration plan

2022-08-10 Thread GitBox
mocobeta commented on issue #7: URL: https://github.com/apache/lucene-jira-archive/issues/7#issuecomment-1211611734 As a note, here is the INFRA issue. https://issues.apache.org/jira/browse/INFRA-23563 -- This is an automated message from the Apache Git Service. To respond to the m

[jira] [Updated] (LUCENE-10577) Enable quantization of HNSW vectors to 8 bits

2022-08-10 Thread Michael Sokolov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Sokolov updated LUCENE-10577: - Summary: Enable quantization of HNSW vectors to 8 bits (was: Quantize vector values)

[GitHub] [lucene] rmuir commented on pull request #1057: LUCENE-10670: Add a codec class to track merge time of each index part

2022-08-10 Thread GitBox
rmuir commented on PR #1057: URL: https://github.com/apache/lucene/pull/1057#issuecomment-1211278416 this will actually slow down the merge heavily, by preventing things like optimized bulk merges of stored fields. I really don't think we should be doing this with a codec-wrapper. you

[jira] [Commented] (LUCENE-10577) Quantize vector values

2022-08-10 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17578150#comment-17578150 ] ASF subversion and git services commented on LUCENE-10577: -- Co

[GitHub] [lucene] msokolov merged pull request #1054: LUCENE-10577: enable quantization of HNSW vectors to 8 bits

2022-08-10 Thread GitBox
msokolov merged PR #1054: URL: https://github.com/apache/lucene/pull/1054 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apac

[jira] [Commented] (LUCENE-10678) computing the partition point on a BKD tree merge can overflow

2022-08-10 Thread Ignacio Vera (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17578098#comment-17578098 ] Ignacio Vera commented on LUCENE-10678: --- The error from this bug looks like: {cod

[GitHub] [lucene] iverase opened a new pull request, #1065: LUCENE-10678: Fix possible overflow when computing the partition point on the BKD tree

2022-08-10 Thread GitBox
iverase opened a new pull request, #1065: URL: https://github.com/apache/lucene/pull/1065 We currently compute the partition point for a set of points by multiplying the number of nodes that needs to be on the left of the BKD tree by the maxPointsInLeafNode. This multiplication is done on t

[jira] [Created] (LUCENE-10678) computing the partition point on a BKD tree merge can overflow

2022-08-10 Thread Ignacio Vera (Jira)
Ignacio Vera created LUCENE-10678: - Summary: computing the partition point on a BKD tree merge can overflow Key: LUCENE-10678 URL: https://issues.apache.org/jira/browse/LUCENE-10678 Project: Lucene -

[jira] [Commented] (LUCENE-10471) Increase the number of dims for KNN vectors to 2048

2022-08-10 Thread Michael Sokolov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17577970#comment-17577970 ] Michael Sokolov commented on LUCENE-10471: -- > Maybe I do not understand the co

[jira] [Comment Edited] (LUCENE-10677) Duplicate strings in FieldInfo#attributes contribute significantly to heap usage at scale

2022-08-10 Thread David Turner (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17577853#comment-17577853 ] David Turner edited comment on LUCENE-10677 at 8/10/22 9:42 AM: -

[jira] [Commented] (LUCENE-10677) Duplicate strings in FieldInfo#attributes contribute significantly to heap usage at scale

2022-08-10 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17577859#comment-17577859 ] Dawid Weiss commented on LUCENE-10677: -- > It looks like we might be able to interc

[jira] [Commented] (LUCENE-10677) Duplicate strings in FieldInfo#attributes contribute significantly to heap usage at scale

2022-08-10 Thread David Turner (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17577853#comment-17577853 ] David Turner commented on LUCENE-10677: --- > I'm opposed to the use of string.inter

[jira] [Commented] (LUCENE-10471) Increase the number of dims for KNN vectors to 2048

2022-08-10 Thread Marcus Eagan (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17577849#comment-17577849 ] Marcus Eagan commented on LUCENE-10471: --- [~michi] You are free to increase the di