[jira] [Commented] (LUCENE-9335) Add a bulk scorer for disjunctions that does dynamic pruning

2021-04-06 Thread Zach Chen (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17316042#comment-17316042 ] Zach Chen commented on LUCENE-9335: --- Just curious, I see code inĀ  [https://github.com/

[GitHub] [lucene] rmuir opened a new pull request #70: LUCENE-9911: enable ecjLint unusedExceptionParameter

2021-04-06 Thread GitBox
rmuir opened a new pull request #70: URL: https://github.com/apache/lucene/pull/70 unusedExceptionParameter is a very useful check, as it detects if you catch an exception and do nothing with it at all (swallowed). As a library, its important to preserve exceptions (e.g. chain the ro

[jira] [Created] (LUCENE-9911) enable ecjLint unusedExceptionParameter

2021-04-06 Thread Robert Muir (Jira)
Robert Muir created LUCENE-9911: --- Summary: enable ecjLint unusedExceptionParameter Key: LUCENE-9911 URL: https://issues.apache.org/jira/browse/LUCENE-9911 Project: Lucene - Core Issue Type: Tas

[jira] [Commented] (LUCENE-9855) Reconsider names for ANN related format and APIs

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315897#comment-17315897 ] Tomoko Uchida commented on LUCENE-9855: --- I didn't use the word "strategy" with any

[jira] [Assigned] (LUCENE-9855) Reconsider names for ANN related format and APIs

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida reassigned LUCENE-9855: - Assignee: (was: Tomoko Uchida) > Reconsider names for ANN related format and APIs >

[jira] [Updated] (LUCENE-9855) Reconsider names for ANN related format and APIs

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida updated LUCENE-9855: -- Summary: Reconsider names for ANN related format and APIs (was: Reconsider codec VectorFormat

[jira] [Commented] (LUCENE-9850) Explore PFOR for Doc ID delta encoding (instead of FOR)

2021-04-06 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315895#comment-17315895 ] Greg Miller commented on LUCENE-9850: - Thanks! I've just submitted a PR for this wor

[GitHub] [lucene] gsmiller opened a new pull request #69: LUCENE-9850: Use PFOR encoding for doc IDs (instead of FOR)

2021-04-06 Thread GitBox
gsmiller opened a new pull request #69: URL: https://github.com/apache/lucene/pull/69 # Description Switch over to PFOR encoding for doc IDs (instead of FOR) to achieve better index compression. # Solution Details are in the Jira issue, but I explored the index size vs.

[jira] [Updated] (LUCENE-9855) Reconsider codec VectorFormat and ANN related API names

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida updated LUCENE-9855: -- Summary: Reconsider codec VectorFormat and ANN related API names (was: Reconsider codec name

[jira] [Updated] (LUCENE-9855) Reconsider codec name VectorFormat and ANN related APIs

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida updated LUCENE-9855: -- Summary: Reconsider codec name VectorFormat and ANN related APIs (was: Reconsider codec name

[GitHub] [lucene] rmuir commented on a change in pull request #68: LUCENE-9910: maximize javac lint

2021-04-06 Thread GitBox
rmuir commented on a change in pull request #68: URL: https://github.com/apache/lucene/pull/68#discussion_r608213508 ## File path: gradle/java/javac.gradle ## @@ -33,27 +33,51 @@ allprojects { options.compilerArgs += [ "-Xlint:-deprecation", "-Xlint:-se

[GitHub] [lucene] rmuir opened a new pull request #68: LUCENE-9910: maximize javac lint

2021-04-06 Thread GitBox
rmuir opened a new pull request #68: URL: https://github.com/apache/lucene/pull/68 This enables quite a few javac warnings from java11+ that weren't enabled for some reason. None of them fail, so lock them in. Additionally some newer checks are only recognized for newer JDK vers

[jira] [Created] (LUCENE-9910) maximize javac lint

2021-04-06 Thread Robert Muir (Jira)
Robert Muir created LUCENE-9910: --- Summary: maximize javac lint Key: LUCENE-9910 URL: https://issues.apache.org/jira/browse/LUCENE-9910 Project: Lucene - Core Issue Type: Task Report

[jira] [Commented] (LUCENE-9904) Port GenerateJflexTLDMacros.java regeneration to gradle

2021-04-06 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315830#comment-17315830 ] Dawid Weiss commented on LUCENE-9904: - That hardcoded list of tld suffixes in testEx

[jira] [Commented] (LUCENE-9904) Port GenerateJflexTLDMacros.java regeneration to gradle

2021-04-06 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315814#comment-17315814 ] Dawid Weiss commented on LUCENE-9904: - This one entails regeneration of top-level do

[jira] [Commented] (LUCENE-9855) Reconsider codec name VectorFormat

2021-04-06 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315813#comment-17315813 ] Robert Muir commented on LUCENE-9855: - {quote} For postings and doc values, we make

[jira] [Updated] (LUCENE-9905) Revise approach to specifying NN algorithm

2021-04-06 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julie Tibshirani updated LUCENE-9905: - Affects Version/s: main (9.0) > Revise approach to specifying NN algorithm > ---

[jira] [Updated] (LUCENE-9905) Revise approach to specifying NN algorithm

2021-04-06 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julie Tibshirani updated LUCENE-9905: - Priority: Blocker (was: Major) > Revise approach to specifying NN algorithm > -

[jira] [Updated] (LUCENE-9908) Move VectorValues#search to VectorReader and LeafReader

2021-04-06 Thread Julie Tibshirani (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julie Tibshirani updated LUCENE-9908: - Affects Version/s: main (9.0) > Move VectorValues#search to VectorReader and LeafReader

[jira] [Created] (LUCENE-9909) Some jflex regeneration tasks should also check the checksums of %include-d resources

2021-04-06 Thread Dawid Weiss (Jira)
Dawid Weiss created LUCENE-9909: --- Summary: Some jflex regeneration tasks should also check the checksums of %include-d resources Key: LUCENE-9909 URL: https://issues.apache.org/jira/browse/LUCENE-9909 P

[GitHub] [lucene] dweiss opened a new pull request #67: LUCENE-9904: Port GenerateJflexTLDMacros.java regeneration to gradle

2021-04-06 Thread GitBox
dweiss opened a new pull request #67: URL: https://github.com/apache/lucene/pull/67 This does contain the regenerated tld file... which should entail regeneration of the jflex parser monster. I'll follow-up. -- This is an automated message from the Apache Git Service. To respond to the m

[GitHub] [lucene] gautamworah96 commented on a change in pull request #62: LUCENE-9902 Minor fixes to the faceting API

2021-04-06 Thread GitBox
gautamworah96 commented on a change in pull request #62: URL: https://github.com/apache/lucene/pull/62#discussion_r608103838 ## File path: lucene/CHANGES.txt ## @@ -287,7 +287,10 @@ Other API Changes - -(No changes) + +* LUCENE-9902: Change the getValue

[jira] [Commented] (LUCENE-9902) Update faceting API to use modern Java features

2021-04-06 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315780#comment-17315780 ] ASF subversion and git services commented on LUCENE-9902: - Commi

[GitHub] [lucene] mikemccand merged pull request #62: LUCENE-9902 Minor fixes to the faceting API

2021-04-06 Thread GitBox
mikemccand merged pull request #62: URL: https://github.com/apache/lucene/pull/62 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please c

[GitHub] [lucene] mikemccand commented on a change in pull request #62: LUCENE-9902 Minor fixes to the faceting API

2021-04-06 Thread GitBox
mikemccand commented on a change in pull request #62: URL: https://github.com/apache/lucene/pull/62#discussion_r608099720 ## File path: lucene/CHANGES.txt ## @@ -287,7 +287,10 @@ Other API Changes - -(No changes) + +* LUCENE-9902: Change the getValue met

[jira] [Commented] (LUCENE-9850) Explore PFOR for Doc ID delta encoding (instead of FOR)

2021-04-06 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315773#comment-17315773 ] Michael McCandless commented on LUCENE-9850: {quote}I might except the oppos

[jira] [Commented] (LUCENE-9827) Small segments are slower to merge due to stored fields since 8.7

2021-04-06 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315763#comment-17315763 ] ASF subversion and git services commented on LUCENE-9827: - Commi

[GitHub] [lucene] rmuir merged pull request #28: LUCENE-9827: avoid wasteful recompression for small segments

2021-04-06 Thread GitBox
rmuir merged pull request #28: URL: https://github.com/apache/lucene/pull/28 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contac

[jira] [Commented] (LUCENE-9850) Explore PFOR for Doc ID delta encoding (instead of FOR)

2021-04-06 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315700#comment-17315700 ] Greg Miller commented on LUCENE-9850: - Thanks [~jpountz]. I'll cleanup my code a bit

[GitHub] [lucene] mayya-sharipova commented on a change in pull request #11: LUCENE-9334 Consistency of field data structures

2021-04-06 Thread GitBox
mayya-sharipova commented on a change in pull request #11: URL: https://github.com/apache/lucene/pull/11#discussion_r607913700 ## File path: lucene/core/src/test/org/apache/lucene/document/TestPerFieldConsistency.java ## @@ -0,0 +1,202 @@ +/* + * Licensed to the Apache Softwar

[GitHub] [lucene] mayya-sharipova commented on a change in pull request #11: LUCENE-9334 Consistency of field data structures

2021-04-06 Thread GitBox
mayya-sharipova commented on a change in pull request #11: URL: https://github.com/apache/lucene/pull/11#discussion_r607913534 ## File path: lucene/core/src/test/org/apache/lucene/document/TestPerFieldConsistency.java ## @@ -0,0 +1,202 @@ +/* + * Licensed to the Apache Softwar

[GitHub] [lucene] mayya-sharipova commented on a change in pull request #11: LUCENE-9334 Consistency of field data structures

2021-04-06 Thread GitBox
mayya-sharipova commented on a change in pull request #11: URL: https://github.com/apache/lucene/pull/11#discussion_r607913125 ## File path: lucene/MIGRATE.md ## @@ -358,11 +358,21 @@ Rather, an IllegalArgumentException shall be thrown. This is introduced for bett defence and

[GitHub] [lucene] jpountz merged pull request #50: Add an example to the CacheHelper docs.

2021-04-06 Thread GitBox
jpountz merged pull request #50: URL: https://github.com/apache/lucene/pull/50 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please cont

[GitHub] [lucene] jpountz commented on pull request #50: Add an example to the CacheHelper docs.

2021-04-06 Thread GitBox
jpountz commented on pull request #50: URL: https://github.com/apache/lucene/pull/50#issuecomment-814164029 Thanks for looking. If I had to actually build such a cache, I'd probably cache per-segment instead, but that would have made the example more complex while the goal is more to show

[jira] [Commented] (LUCENE-9850) Explore PFOR for Doc ID delta encoding (instead of FOR)

2021-04-06 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315606#comment-17315606 ] Adrien Grand commented on LUCENE-9850: -- bq. it seems like the index size reduction

[GitHub] [lucene] jpountz commented on a change in pull request #64: LUCENE-9907: Remove packedInts dependency on StoredFieldsFormat

2021-04-06 Thread GitBox
jpountz commented on a change in pull request #64: URL: https://github.com/apache/lucene/pull/64#discussion_r607865192 ## File path: lucene/core/src/java/org/apache/lucene/codecs/lucene90/compressing/Lucene90CompressingStoredFieldsReader.java ## @@ -374,8 +371,8 @@ private Ser

[jira] [Comment Edited] (LUCENE-9888) Re-instate CheckIndex's attempts to confirm index sort is consistent across all segments

2021-04-06 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315503#comment-17315503 ] Adrien Grand edited comment on LUCENE-9888 at 4/6/21, 1:01 PM: ---

[jira] [Commented] (LUCENE-9906) TestIndexSorting.testAddIndexesWithDeletionsAndDirectory can throw error

2021-04-06 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315519#comment-17315519 ] Adrien Grand commented on LUCENE-9906: -- This is due to LUCENE-9888, I left a commen

[jira] [Commented] (LUCENE-9855) Reconsider codec name VectorFormat

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315504#comment-17315504 ] Tomoko Uchida commented on LUCENE-9855: --- I opened two PRs just to get the picture

[jira] [Commented] (LUCENE-9888) Re-instate CheckIndex's attempts to confirm index sort is consistent across all segments

2021-04-06 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315503#comment-17315503 ] Adrien Grand commented on LUCENE-9888: -- There are some test failures due to this ch

[GitHub] [lucene] nitirajrathore commented on a change in pull request #55: LUCENE-9798 : Fix looping bug and made Full Knn calculation parallelizable

2021-04-06 Thread GitBox
nitirajrathore commented on a change in pull request #55: URL: https://github.com/apache/lucene/pull/55#discussion_r607813173 ## File path: lucene/test-framework/src/java/org/apache/lucene/util/FullKnn.java ## @@ -0,0 +1,156 @@ +package org.apache.lucene.util; + +import org.apa

[GitHub] [lucene] nitirajrathore commented on a change in pull request #55: LUCENE-9798 : Fix looping bug and made Full Knn calculation parallelizable

2021-04-06 Thread GitBox
nitirajrathore commented on a change in pull request #55: URL: https://github.com/apache/lucene/pull/55#discussion_r607812871 ## File path: lucene/test-framework/src/java/org/apache/lucene/util/FullKnn.java ## @@ -0,0 +1,156 @@ +package org.apache.lucene.util; + +import org.apa

[jira] [Created] (LUCENE-9908) Move VectorValues#search to VectorReader and LeafReader

2021-04-06 Thread Adrien Grand (Jira)
Adrien Grand created LUCENE-9908: Summary: Move VectorValues#search to VectorReader and LeafReader Key: LUCENE-9908 URL: https://issues.apache.org/jira/browse/LUCENE-9908 Project: Lucene - Core

[jira] [Commented] (LUCENE-9855) Reconsider codec name VectorFormat

2021-04-06 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315486#comment-17315486 ] Adrien Grand commented on LUCENE-9855: -- bq. NeighborsFormat also doesn't feel preci

[GitHub] [lucene] mocobeta opened a new pull request #66: LUCENE-9855: Rename VectorFormat to HnswVectorsFormat

2021-04-06 Thread GitBox
mocobeta opened a new pull request #66: URL: https://github.com/apache/lucene/pull/66 https://issues.apache.org/jira/browse/LUCENE-9855 This renames two interfaces and its subclasses. - o.a.l.c.VectorFormat to HnswVectorsFormat - o.a.l.i.VectorValues to NumericVectors

[GitHub] [lucene] mocobeta opened a new pull request #65: LUCENE-9855: Rename VectorFormat to NumericVectorsFormat and VectorValuesFormat

2021-04-06 Thread GitBox
mocobeta opened a new pull request #65: URL: https://github.com/apache/lucene/pull/65 See https://issues.apache.org/jira/browse/LUCENE-9855 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[GitHub] [lucene] iverase opened a new pull request #64: LUCENE-9907: Remove packedInts dependency on StoredFieldsFormat

2021-04-06 Thread GitBox
iverase opened a new pull request #64: URL: https://github.com/apache/lucene/pull/64 This PR removes the dependency on PackedInts to the StoredFieldsFormat in favour of directReader / DirectWriter. -- This is an automated message from the Apache Git Service. To respond to the message, p

[jira] [Created] (LUCENE-9907) Remove dependency on PackedInts in all current codecs

2021-04-06 Thread Ignacio Vera (Jira)
Ignacio Vera created LUCENE-9907: Summary: Remove dependency on PackedInts in all current codecs Key: LUCENE-9907 URL: https://issues.apache.org/jira/browse/LUCENE-9907 Project: Lucene - Core

[jira] [Commented] (LUCENE-9905) Revise approach to specifying NN algorithm

2021-04-06 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315359#comment-17315359 ] Adrien Grand commented on LUCENE-9905: -- +1 > Revise approach to specifying NN alg

[GitHub] [lucene] jpountz commented on pull request #11: LUCENE-9334 Consistency of field data structures

2021-04-06 Thread GitBox
jpountz commented on pull request #11: URL: https://github.com/apache/lucene/pull/11#issuecomment-813948862 > I'm also curious if we plan to enforce value consistency in a follow-up? It would be great if we could enforce value consistency too. It feels more challenging because `Index

[jira] [Comment Edited] (LUCENE-9855) Reconsider codec name VectorFormat

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315328#comment-17315328 ] Tomoko Uchida edited comment on LUCENE-9855 at 4/6/21, 8:01 AM: --

[jira] [Commented] (LUCENE-9855) Reconsider codec name VectorFormat

2021-04-06 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315328#comment-17315328 ] Tomoko Uchida commented on LUCENE-9855: --- I set the Fix version of the issue to 9.0