[GitHub] [lucene] zacharymorn commented on a change in pull request #240: LUCENE-10002: Deprecate IndexSearch#search(Query, Collector) in favor of IndexSearcher#search(Query, CollectorManager)

2021-08-20 Thread GitBox
zacharymorn commented on a change in pull request #240: URL: https://github.com/apache/lucene/pull/240#discussion_r692722475 ## File path: lucene/benchmark/src/java/org/apache/lucene/benchmark/byTask/tasks/ReadTask.java ## @@ -180,6 +185,7 @@ protected int withTopDocs(IndexSea

[GitHub] [lucene] zacharymorn commented on a change in pull request #240: LUCENE-10002: Deprecate IndexSearch#search(Query, Collector) in favor of IndexSearcher#search(Query, CollectorManager)

2021-08-20 Thread GitBox
zacharymorn commented on a change in pull request #240: URL: https://github.com/apache/lucene/pull/240#discussion_r692722544 ## File path: lucene/core/src/java/org/apache/lucene/search/IndexSearcher.java ## @@ -527,7 +503,10 @@ public TopDocs search(Query query, int n) throws

[GitHub] [lucene] zacharymorn commented on pull request #240: LUCENE-10002: Deprecate IndexSearch#search(Query, Collector) in favor of IndexSearcher#search(Query, CollectorManager)

2021-08-20 Thread GitBox
zacharymorn commented on pull request #240: URL: https://github.com/apache/lucene/pull/240#issuecomment-902489306 > Left a few comments. Looks good overall. Lots of little changes! Those test cases... :) Thanks @gsmiller for the review! Yes there are indeed many small changes, and m

[GitHub] [lucene] mikemccand commented on pull request #254: LUCENE-10059: Fix AssertionError in JapaneseTokenizer backtrace

2021-08-20 Thread GitBox
mikemccand commented on pull request #254: URL: https://github.com/apache/lucene/pull/254#issuecomment-902651596 In the mean time, @dungba88 could you please also open a backport PR for Lucene/Solr 8.x? It is a different git repository (https://github.com/apache/lucene-solr/tree/branch_8x

[GitHub] [lucene] mikemccand merged pull request #254: LUCENE-10059: Fix AssertionError in JapaneseTokenizer backtrace

2021-08-20 Thread GitBox
mikemccand merged pull request #254: URL: https://github.com/apache/lucene/pull/254 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubsc

[jira] [Commented] (LUCENE-10059) Assertion error in JapaneseTokenizer backtrace

2021-08-20 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402181#comment-17402181 ] ASF subversion and git services commented on LUCENE-10059: -- Co

[GitHub] [lucene] mikemccand commented on a change in pull request #179: LUCENE-9476: Add getBulkPath API to DirectoryTaxonomyReader

2021-08-20 Thread GitBox
mikemccand commented on a change in pull request #179: URL: https://github.com/apache/lucene/pull/179#discussion_r69227 ## File path: lucene/CHANGES.txt ## @@ -137,6 +137,9 @@ API Changes Improvements +* LUCENE-9476: Add new getBulkPath API to DirectoryTaxonomyReader t

[jira] [Commented] (LUCENE-9918) Can PForUtil be further auto-vectorized?

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402203#comment-17402203 ] Greg Miller commented on LUCENE-9918: - [~gworah] could be worth trying. I wasn't awa

[jira] [Commented] (LUCENE-9969) DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402212#comment-17402212 ] Greg Miller commented on LUCENE-9969: - [~chengfengfeng] did you see [~gworah]'s sugg

[jira] [Assigned] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Miller reassigned LUCENE-5309: --- Assignee: Greg Miller > when using SortedSetDV faceting, specialize the case when all docs

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402235#comment-17402235 ] Greg Miller commented on LUCENE-5309: - This is a small change and I'll give it a try

[GitHub] [lucene] dungba88 commented on pull request #254: LUCENE-10059: Fix AssertionError in JapaneseTokenizer backtrace

2021-08-20 Thread GitBox
dungba88 commented on pull request #254: URL: https://github.com/apache/lucene/pull/254#issuecomment-902731868 @mikemccand Thanks for reviewing and merging! I'll prepare a backport PR for version 8.x -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [lucene-solr] dungba88 opened a new pull request #2557: LUCENE-10059: Fix AssertionError in JapaneseTokenizer backtrace

2021-08-20 Thread GitBox
dungba88 opened a new pull request #2557: URL: https://github.com/apache/lucene-solr/pull/2557 # Description This is a backport of https://github.com/apache/lucene/pull/254 There is an issue which causes an `AssertionError` in the backtrace step of `JapaneseTokenizer`. If ther

[GitHub] [lucene] dungba88 commented on pull request #254: LUCENE-10059: Fix AssertionError in JapaneseTokenizer backtrace

2021-08-20 Thread GitBox
dungba88 commented on pull request #254: URL: https://github.com/apache/lucene/pull/254#issuecomment-902748267 I've opened a backport PR for 8.x: https://github.com/apache/lucene-solr/pull/2557 -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [lucene] msokolov commented on pull request #251: LUCENE-10040: Temporarily disable test assertion

2021-08-20 Thread GitBox
msokolov commented on pull request #251: URL: https://github.com/apache/lucene/pull/251#issuecomment-902778915 OK, so it's a real weird behavior that manifests for very large K or very small N, weird cases. I agree, let's relax the assertion. I am not sure this will be fixed by a multi-lev

[GitHub] [lucene] msokolov commented on a change in pull request #251: LUCENE-10040: Temporarily disable test assertion

2021-08-20 Thread GitBox
msokolov commented on a change in pull request #251: URL: https://github.com/apache/lucene/pull/251#discussion_r693039774 ## File path: lucene/core/src/test/org/apache/lucene/search/TestKnnVectorQuery.java ## @@ -350,7 +350,9 @@ public void testDeletes() throws IOException {

[GitHub] [lucene] msokolov commented on a change in pull request #250: LUCENE-10054 Make HnswGraph hierarchical

2021-08-20 Thread GitBox
msokolov commented on a change in pull request #250: URL: https://github.com/apache/lucene/pull/250#discussion_r693041906 ## File path: lucene/core/src/java/org/apache/lucene/util/hnsw/HnswGraph.java ## @@ -56,22 +56,28 @@ public final class HnswGraph extends KnnGraphValues {

[jira] [Commented] (LUCENE-10054) Handle hierarchy in HNSW graph

2021-08-20 Thread Michael Sokolov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402289#comment-17402289 ] Michael Sokolov commented on LUCENE-10054: -- Thanks for looking into this! cou

[jira] [Commented] (LUCENE-10033) Encode doc values in smaller blocks of values, like postings

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402311#comment-17402311 ] Greg Miller commented on LUCENE-10033: -- HI [~weizijun], just my opinion, but I thi

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402385#comment-17402385 ] Michael McCandless commented on LUCENE-5309: Woot!  Thanks [~gsmiller]! > w

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402386#comment-17402386 ] Greg Miller commented on LUCENE-5309: - Hmm, well it looks like there might be a nice

[GitHub] [lucene] gsmiller opened a new pull request #255: Lucene-5309: Optimize facet counting for single-valued SSDV / StringValueFacetCounts

2021-08-20 Thread GitBox
gsmiller opened a new pull request #255: URL: https://github.com/apache/lucene/pull/255 # Description This change introduces special-case logic when facet counting on a `SortedDocValues` field. Instead of using the more-general logic that supports multi-valued fields (i.e., `SortedS

[GitHub] [lucene] gsmiller commented on a change in pull request #255: Lucene-5309: Optimize facet counting for single-valued SSDV / StringValueFacetCounts

2021-08-20 Thread GitBox
gsmiller commented on a change in pull request #255: URL: https://github.com/apache/lucene/pull/255#discussion_r693177246 ## File path: lucene/core/src/java/org/apache/lucene/index/MultiDocValues.java ## @@ -82,7 +82,6 @@ public int nextDoc() throws IOException { i

[GitHub] [lucene] gsmiller commented on a change in pull request #255: Lucene-5309: Optimize facet counting for single-valued SSDV / StringValueFacetCounts

2021-08-20 Thread GitBox
gsmiller commented on a change in pull request #255: URL: https://github.com/apache/lucene/pull/255#discussion_r693177372 ## File path: lucene/core/src/java/org/apache/lucene/index/MultiDocValues.java ## @@ -452,7 +449,6 @@ public static SortedNumericDocValues getSortedNumericV

[GitHub] [lucene] gsmiller commented on a change in pull request #255: Lucene-5309: Optimize facet counting for single-valued SSDV / StringValueFacetCounts

2021-08-20 Thread GitBox
gsmiller commented on a change in pull request #255: URL: https://github.com/apache/lucene/pull/255#discussion_r693177498 ## File path: lucene/core/src/java/org/apache/lucene/index/MultiDocValues.java ## @@ -680,9 +673,9 @@ public static SortedSetDocValues getSortedSetValues(fi

[jira] [Comment Edited] (LUCENE-10054) Handle hierarchy in HNSW graph

2021-08-20 Thread Mayya Sharipova (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17401172#comment-17401172 ] Mayya Sharipova edited comment on LUCENE-10054 at 8/20/21, 7:39 PM: -

[jira] [Comment Edited] (LUCENE-10054) Handle hierarchy in HNSW graph

2021-08-20 Thread Mayya Sharipova (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17401172#comment-17401172 ] Mayya Sharipova edited comment on LUCENE-10054 at 8/20/21, 7:40 PM: -

[jira] [Commented] (LUCENE-10054) Handle hierarchy in HNSW graph

2021-08-20 Thread Mayya Sharipova (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402399#comment-17402399 ] Mayya Sharipova commented on LUCENE-10054: -- [~sokolov] Thanks your for your fe

[GitHub] [lucene] gsmiller commented on a change in pull request #255: Lucene-5309: Optimize facet counting for single-valued SSDV / StringValueFacetCounts

2021-08-20 Thread GitBox
gsmiller commented on a change in pull request #255: URL: https://github.com/apache/lucene/pull/255#discussion_r693200369 ## File path: lucene/CHANGES.txt ## @@ -415,6 +415,8 @@ Improvements This prevents caching a query clause when it is much more expensive than running

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402405#comment-17402405 ] Greg Miller commented on LUCENE-5309: - Went ahead and put a [PR|https://github.com/a

[jira] [Updated] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Miller updated LUCENE-5309: Status: Patch Available (was: Open) > when using SortedSetDV faceting, specialize the case when a

[jira] [Commented] (LUCENE-10017) Do not check footers on Lucene7x segments when throwing IndexFormatTooOld

2021-08-20 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402411#comment-17402411 ] ASF subversion and git services commented on LUCENE-10017: -- Co

[GitHub] [lucene] madrob merged pull request #200: LUCENE-10017 Less verbose exception on IndexFormatTooOld

2021-08-20 Thread GitBox
madrob merged pull request #200: URL: https://github.com/apache/lucene/pull/200 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...

[jira] [Resolved] (LUCENE-10017) Do not check footers on Lucene7x segments when throwing IndexFormatTooOld

2021-08-20 Thread Mike Drob (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mike Drob resolved LUCENE-10017. Fix Version/s: main (9.0) Assignee: Mike Drob Resolution: Fixed > Do not check fo

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17402468#comment-17402468 ] Greg Miller commented on LUCENE-5309: - I kicked off a run of {{wikimediumall}} as we