Re: [PR] build(deps): bump holidays from 0.75 to 0.77 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir merged PR #15029: URL: https://github.com/apache/lucene/pull/15029 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apach

Re: [PR] build(deps): bump ruff from 0.12.1 to 0.12.7 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir merged PR #15028: URL: https://github.com/apache/lucene/pull/15028 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apach

Re: [PR] build(deps): bump pygithub from 2.6.1 to 2.7.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir merged PR #15031: URL: https://github.com/apache/lucene/pull/15031 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apach

Re: [PR] build(deps): bump pygithub from 2.6.1 to 2.7.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir commented on PR #15031: URL: https://github.com/apache/lucene/pull/15031#issuecomment-3146183887 Since we don't use the problematic `get_rate_limit` i think the risk is minimal. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [PR] build(deps): bump pygithub from 2.6.1 to 2.7.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir commented on PR #15031: URL: https://github.com/apache/lucene/pull/15031#issuecomment-3146146200 @dependabot rebase -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] build(deps): bump holidays from 0.75 to 0.77 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir commented on PR #15029: URL: https://github.com/apache/lucene/pull/15029#issuecomment-3146146836 @dependabot rebase -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] build(deps): bump basedpyright from 1.29.5 to 1.31.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir merged PR #15030: URL: https://github.com/apache/lucene/pull/15030 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apach

Re: [PR] build(deps): bump basedpyright from 1.29.5 to 1.31.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
rmuir commented on PR #15030: URL: https://github.com/apache/lucene/pull/15030#issuecomment-3146075126 This one is always the first to merge for the python dependency updates. The `ruff` linter only looks at individual `.py` files in isolation, so it won't detect any integration issues. Thi

Re: [PR] MultiIndexMergeScheduler: a production multi-tenant merge scheduler [lucene]

2025-08-01 Thread via GitHub
corecursion commented on code in PR #15015: URL: https://github.com/apache/lucene/pull/15015#discussion_r2249017404 ## lucene/core/src/java/org/apache/lucene/index/MultiIndexMergeScheduler.java: ## @@ -0,0 +1,203 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

Re: [PR] MultiIndexMergeScheduler: a production multi-tenant merge scheduler [lucene]

2025-08-01 Thread via GitHub
corecursion commented on code in PR #15015: URL: https://github.com/apache/lucene/pull/15015#discussion_r2249017250 ## lucene/core/src/java/org/apache/lucene/index/MultiIndexMergeScheduler.java: ## @@ -0,0 +1,203 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[PR] build(deps): bump ruff from 0.12.1 to 0.12.7 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
dependabot[bot] opened a new pull request, #15028: URL: https://github.com/apache/lucene/pull/15028 Bumps [ruff](https://github.com/astral-sh/ruff) from 0.12.1 to 0.12.7. Release notes Sourced from https://github.com/astral-sh/ruff/releases";>ruff's releases. 0.12.7 Releas

[PR] build(deps): bump pygithub from 2.6.1 to 2.7.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
dependabot[bot] opened a new pull request, #15031: URL: https://github.com/apache/lucene/pull/15031 Bumps [pygithub](https://github.com/pygithub/pygithub) from 2.6.1 to 2.7.0. Release notes Sourced from https://github.com/pygithub/pygithub/releases";>pygithub's releases. v2.

[PR] build(deps): bump basedpyright from 1.29.5 to 1.31.0 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
dependabot[bot] opened a new pull request, #15030: URL: https://github.com/apache/lucene/pull/15030 Bumps [basedpyright](https://github.com/detachhead/basedpyright) from 1.29.5 to 1.31.0. Commits https://github.com/DetachHead/basedpyright/commit/f9d048fb2c37aca36df6f4d2e1887825

[PR] build(deps): bump holidays from 0.75 to 0.77 in /dev-tools/scripts [lucene]

2025-08-01 Thread via GitHub
dependabot[bot] opened a new pull request, #15029: URL: https://github.com/apache/lucene/pull/15029 Bumps [holidays](https://github.com/vacanza/holidays) from 0.75 to 0.77. Release notes Sourced from https://github.com/vacanza/holidays/releases";>holidays's releases. Version

Re: [PR] MultiIndexMergeScheduler: a production multi-tenant merge scheduler [lucene]

2025-08-01 Thread via GitHub
corecursion commented on code in PR #15015: URL: https://github.com/apache/lucene/pull/15015#discussion_r2248998062 ## lucene/core/src/java/org/apache/lucene/index/MultiIndexMergeScheduler.java: ## @@ -0,0 +1,203 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
kaivalnp commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248977825 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsWriter.java: ## @@ -342,8 +356,12 @@ private void reconstructAndWriteNeighbours( }

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #14932: URL: https://github.com/apache/lucene/pull/14932#issuecomment-3145928338 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

Re: [PR] MultiIndexMergeScheduler: a production multi-tenant merge scheduler [lucene]

2025-08-01 Thread via GitHub
corecursion commented on code in PR #15015: URL: https://github.com/apache/lucene/pull/15015#discussion_r2248960412 ## lucene/core/src/java/org/apache/lucene/index/MultiIndexMergeScheduler.java: ## @@ -0,0 +1,203 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
aylonsk commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248958499 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsReader.java: ## @@ -76,6 +79,7 @@ public final class Lucene99HnswVectorsReader extends Kn

Re: [PR] MultiIndexMergeScheduler: a production multi-tenant merge scheduler [lucene]

2025-08-01 Thread via GitHub
corecursion commented on code in PR #15015: URL: https://github.com/apache/lucene/pull/15015#discussion_r2248957089 ## lucene/core/src/java/org/apache/lucene/index/MultiIndexMergeScheduler.java: ## @@ -0,0 +1,203 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
aylonsk commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248956741 ## lucene/backward-codecs/src/test/org/apache/lucene/backward_codecs/lucene99/Lucene99HnswVectorsFormatV0.java: ## @@ -0,0 +1,233 @@ +/* Review Comment: I added a

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
kaivalnp commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248954947 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsReader.java: ## @@ -76,6 +79,7 @@ public final class Lucene99HnswVectorsReader extends K

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #14932: URL: https://github.com/apache/lucene/pull/14932#issuecomment-3145902714 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

Re: [PR] Optimize prefix sum computation in Lucene99HnswVectorsReader; fixes #15024 [lucene]

2025-08-01 Thread via GitHub
yossev commented on PR #15027: URL: https://github.com/apache/lucene/pull/15027#issuecomment-3145884514 Perfect. I will be waiting for updates on that matter. Thank you! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub an

Re: [PR] Optimize prefix sum computation in Lucene99HnswVectorsReader; fixes #15024 [lucene]

2025-08-01 Thread via GitHub
jpountz commented on PR #15027: URL: https://github.com/apache/lucene/pull/15027#issuecomment-3145871053 The only missing thing is an entry in lucene/CHANGES.txt but we can deal with it later. -- This is an automated message from the Apache Git Service. To respond to the message, please l

Re: [PR] Optimize prefix sum computation in Lucene99HnswVectorsReader; fixes #15024 [lucene]

2025-08-01 Thread via GitHub
yossev commented on PR #15027: URL: https://github.com/apache/lucene/pull/15027#issuecomment-3145860795 > Thank you, this looks good. If you have cycles to run benchmarks, this would be appreciated, you can check out this: https://github.com/mikemccand/luceneutil/blob/main/README.md#running

Re: [PR] MultiIndexMergeScheduler: a production multi-tenant merge scheduler [lucene]

2025-08-01 Thread via GitHub
vigyasharma commented on code in PR #15015: URL: https://github.com/apache/lucene/pull/15015#discussion_r2248874888 ## lucene/core/src/java/org/apache/lucene/index/MultiIndexMergeScheduler.java: ## @@ -0,0 +1,203 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

Re: [PR] Only score doc first when minCompetitiveScore is zero [lucene]

2025-08-01 Thread via GitHub
jpountz commented on PR #14856: URL: https://github.com/apache/lucene/pull/14856#issuecomment-3145773701 @gf2121 Let's merge this PR? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] Bypass HNSW graph building for tiny segments [lucene]

2025-08-01 Thread via GitHub
jpountz commented on code in PR #14963: URL: https://github.com/apache/lucene/pull/14963#discussion_r2248864216 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsFormat.java: ## @@ -137,9 +144,16 @@ public final class Lucene99HnswVectorsFormat extends

Re: [PR] Bypass HNSW graph building for tiny segments [lucene]

2025-08-01 Thread via GitHub
jpountz commented on code in PR #14963: URL: https://github.com/apache/lucene/pull/14963#discussion_r2248862786 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsFormat.java: ## @@ -115,6 +115,13 @@ public final class Lucene99HnswVectorsFormat extends

Re: [PR] Bypass HNSW graph building for tiny segments [lucene]

2025-08-01 Thread via GitHub
jpountz commented on code in PR #14963: URL: https://github.com/apache/lucene/pull/14963#discussion_r2248839514 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsFormat.java: ## @@ -137,9 +144,16 @@ public final class Lucene99HnswVectorsFormat extends

Re: [PR] Optimize prefix sum computation in Lucene99HnswVectorsReader; fixes #15024 [lucene]

2025-08-01 Thread via GitHub
jpountz commented on PR #15027: URL: https://github.com/apache/lucene/pull/15027#issuecomment-3145718862 Thank you, this looks good. If you have cycles to run benchmarks, this would be appreciated, you can check out this: https://github.com/mikemccand/luceneutil/blob/main/README.md#running-

Re: [PR] Bypass HNSW graph building for tiny segments [lucene]

2025-08-01 Thread via GitHub
msokolov commented on code in PR #14963: URL: https://github.com/apache/lucene/pull/14963#discussion_r2248750458 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsFormat.java: ## @@ -137,9 +144,16 @@ public final class Lucene99HnswVectorsFormat extends

Re: [PR] Avoid reconstructing HNSW graph during singleton merges [lucene]

2025-08-01 Thread via GitHub
msokolov commented on code in PR #15003: URL: https://github.com/apache/lucene/pull/15003#discussion_r2248685396 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsWriter.java: ## @@ -347,12 +350,195 @@ private void reconstructAndWriteNeighbours( }

Re: [I] Improve prefix sum in Lucene99HnswVectorsReader [lucene]

2025-08-01 Thread via GitHub
yossev commented on issue #15024: URL: https://github.com/apache/lucene/issues/15024#issuecomment-3145499229 I've submitted a PR with the proposed optimization: #15027 It replaces the prefix sum logic with the faster variant as discussed. Feedback welcome! -- This is an automated mes

Re: [PR] Optimize prefix sum computation in Lucene99HnswVectorsReader; fixes #15024 [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #15027: URL: https://github.com/apache/lucene/pull/15027#issuecomment-3145494478 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

[PR] Optimize prefix sum computation in Lucene99HnswVectorsReader; fixes #15024 [lucene]

2025-08-01 Thread via GitHub
yossev opened a new pull request, #15027: URL: https://github.com/apache/lucene/pull/15027 Replaced the two-step prefix sum loop in `Lucene99HnswVectorsReader` with a single-loop variant that avoids redundant memory access and improves performance. Previous approach: - Rea

Re: [PR] Avoid reconstructing HNSW graph during singleton merges [lucene]

2025-08-01 Thread via GitHub
jpountz commented on PR #15003: URL: https://github.com/apache/lucene/pull/15003#issuecomment-3145430620 I don't feel qualified to do the review, but I agree with the motivation. I wonder if this optimization could be applied when there are more than 1 segment to merge by first applying del

Re: [I] Improve prefix sum in Lucene99HnswVectorsReader [lucene]

2025-08-01 Thread via GitHub
jpountz commented on issue #15024: URL: https://github.com/apache/lucene/issues/15024#issuecomment-3145419003 Historically, several contributors claimed issues and never actually worked on them so I'm not too keen on assigning it. Nobody else is working on it. Given your comment, other cont

Re: [I] Index level caching policy is thrashed by segment-specific query rewrites [lucene]

2025-08-01 Thread via GitHub
jpountz commented on issue #14986: URL: https://github.com/apache/lucene/issues/14986#issuecomment-3145402961 I was wondering about a variant of your idea that would consist of not reusing the provided `IndexSearcher` but creating a private one that doesn't cache (`IndexSearcher#setQueryCac

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
jpountz commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248579183 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsReader.java: ## @@ -76,6 +79,7 @@ public final class Lucene99HnswVectorsReader extends Kn

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
kaivalnp commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248561198 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsReader.java: ## @@ -76,6 +79,7 @@ public final class Lucene99HnswVectorsReader extends K

Re: [PR] Remove unused `nonZeroLevelSize` [lucene]

2025-08-01 Thread via GitHub
msokolov merged PR #15022: URL: https://github.com/apache/lucene/pull/15022 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.ap

Re: [PR] Remove unused `nonZeroLevelSize` [lucene]

2025-08-01 Thread via GitHub
msokolov commented on PR #15022: URL: https://github.com/apache/lucene/pull/15022#issuecomment-3145252602 thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

Re: [I] Index level caching policy is thrashed by segment-specific query rewrites [lucene]

2025-08-01 Thread via GitHub
GovindBalaji-S-Glean commented on issue #14986: URL: https://github.com/apache/lucene/issues/14986#issuecomment-3145194691 One hack I can think of is that instead of `searcher` here, we send in a decorator of `searcher` with `IndexSearcher#createWeight(query)` just doing `query.createWeight

Re: [I] Faceting + Data Sketches [lucene]

2025-08-01 Thread via GitHub
msfroh commented on issue #15017: URL: https://github.com/apache/lucene/issues/15017#issuecomment-3145183135 > Is there any prior work in this space within Lucene or search engines in general that anyone is aware of? I haven't seen anything myself, but maybe there's something else to draw o

[I] RamUsageEstimator.sizeOf(Query, long) does not match RamUsageTester.ramUsed(obj) [lucene]

2025-08-01 Thread via GitHub
smuching202 opened a new issue, #15026: URL: https://github.com/apache/lucene/issues/15026 # Context While implementing `Accountable.ramBytesUsed()`, I noticed a discrepancy between the values returned by `RamUsageEstimator.sizeOf(Query, long)` and `RamUsageTester.ramUsed(obj)` in Lucene

Re: [PR] flush segments in addIndexes, not in addIndexesReaderMerge; fixes #14994 [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #15025: URL: https://github.com/apache/lucene/pull/15025#issuecomment-3144975495 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

[PR] flush segments in addIndexes, not in addIndexesReaderMerge; fixes #14994 [lucene]

2025-08-01 Thread via GitHub
msokolov opened a new pull request, #15025: URL: https://github.com/apache/lucene/pull/15025 Just monkey-coding based on @mikemccand 's comment in the issue. It makes sense to me and seems to pass unit tests. Restores this logic to the way it was before 698f40ad51af0c42b0a4a8321ab899

Re: [I] Improve prefix sum in Lucene99HnswVectorsReader [lucene]

2025-08-01 Thread via GitHub
yossev commented on issue #15024: URL: https://github.com/apache/lucene/issues/15024#issuecomment-3144904919 Hi @jpountz, I’d like to work on this issue if it’s still available. Could you please assign it to me or let me know if someone else is already working on it? Thanks! -- This is a

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
aylonsk commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2248181313 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsFormat.java: ## @@ -212,7 +213,7 @@ public KnnVectorsReader fieldsReader(SegmentReadState

Re: [PR] Don't revisit knn query results unless optimistic collection was used… [lucene]

2025-08-01 Thread via GitHub
msokolov merged PR #15019: URL: https://github.com/apache/lucene/pull/15019 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.ap

Re: [I] Optimistic knn query breaks nested vector search [lucene]

2025-08-01 Thread via GitHub
msokolov closed issue #15005: Optimistic knn query breaks nested vector search URL: https://github.com/apache/lucene/issues/15005 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] Don't revisit knn query results unless optimistic collection was used… [lucene]

2025-08-01 Thread via GitHub
msokolov commented on PR #15019: URL: https://github.com/apache/lucene/pull/15019#issuecomment-3144815083 thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

Re: [PR] Add ImpactRangeQuery for Impact-Based Document Range Prioritization [lucene]

2025-08-01 Thread via GitHub
jpountz commented on PR #15023: URL: https://github.com/apache/lucene/pull/15023#issuecomment-3144623188 Very cool. Have you been able to measure any speedup with this approach? FYI, this breaks some API contracts, e.g. a `BulkScorer` is expected to score ranges of doc IDs in doc ID o

[I] Improve prefix sum in Lucene99HnswVectorsReader [lucene]

2025-08-01 Thread via GitHub
jpountz opened a new issue, #15024: URL: https://github.com/apache/lucene/issues/15024 `Lucene99HnswVectorsReader` computes prefix sums by doing: ```java currentNeighborsBuffer[0] = dataIn.readVInt(); for (int i = 1; i < arcCount; i++) { currentNeighb

Re: [PR] GroupVarInt Encoding Implementation for HNSW Graphs [lucene]

2025-08-01 Thread via GitHub
jpountz commented on code in PR #14932: URL: https://github.com/apache/lucene/pull/14932#discussion_r2247938079 ## lucene/core/src/java/org/apache/lucene/codecs/lucene99/Lucene99HnswVectorsReader.java: ## @@ -76,6 +79,7 @@ public final class Lucene99HnswVectorsReader extends Kn

Re: [I] Try encoding very frequent terms using a dense bitmap [lucene]

2025-08-01 Thread via GitHub
jpountz commented on issue #13147: URL: https://github.com/apache/lucene/issues/13147#issuecomment-3144492229 This has been implemented in #14133. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [I] Try encoding very frequent terms using a dense bitmap [lucene]

2025-08-01 Thread via GitHub
jpountz closed issue #13147: Try encoding very frequent terms using a dense bitmap URL: https://github.com/apache/lucene/issues/13147 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

Re: [I] Lucene 9.11: Make AbstractKnnVectorQuery.searchLeaf() overridable to enable custom KNN instrumentation [lucene]

2025-08-01 Thread via GitHub
jpountz commented on issue #15020: URL: https://github.com/apache/lucene/issues/15020#issuecomment-3144489128 For visibility, we are moving the decision between approximate and exact from the query to `KnnVectorsFormat`: https://github.com/apache/lucene/pull/14963. This will allow Lucene t

Re: [PR] Add ImpactRangeQuery for Impact-Based Document Range Prioritization [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #15023: URL: https://github.com/apache/lucene/pull/15023#issuecomment-3143954289 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

Re: [PR] Add ImpactRangeQuery for Impact-Based Document Range Prioritization [lucene]

2025-08-01 Thread via GitHub
atris commented on PR #15023: URL: https://github.com/apache/lucene/pull/15023#issuecomment-3143952348 @jpountz Please review -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [PR] Add ImpactRangeQuery for Impact-Based Document Range Prioritization [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #15023: URL: https://github.com/apache/lucene/pull/15023#issuecomment-3143919539 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

[PR] Add ImpactRangeQuery for Impact-Based Document Range Prioritization [lucene]

2025-08-01 Thread via GitHub
atris opened a new pull request, #15023: URL: https://github.com/apache/lucene/pull/15023 Implements a query wrapper that prioritizes document ranges based on their scoring potential using Lucene's impact information. The implementation divides the document space into ranges and evaluat

Re: [PR] Remove unused `nonZeroLevelSize` [lucene]

2025-08-01 Thread via GitHub
github-actions[bot] commented on PR #15022: URL: https://github.com/apache/lucene/pull/15022#issuecomment-3143835245 This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog label to it and you will stop

[PR] Remove unused `nonZeroLevelSize` [lucene]

2025-08-01 Thread via GitHub
viliam-durina opened a new pull request, #15022: URL: https://github.com/apache/lucene/pull/15022 A little optimization. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] BytesRefIntMap's INIT_RAM_BYTES uses shallowSizeof() instead of shallowSizeOfInstance() [lucene]

2025-08-01 Thread via GitHub
gf2121 commented on issue #15018: URL: https://github.com/apache/lucene/issues/15018#issuecomment-3143231272 Hi, i think this is a bug (my bad), thanks for catching! Would you like to open a PR to fix? -- This is an automated message from the Apache Git Service. To respond to the me