[jira] [Commented] (LUCENE-10054) Handle hierarchy in HNSW graph

2021-10-18 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17429958#comment-17429958 ] Michael McCandless commented on LUCENE-10054: - [~mayya] this looks like an awesome

[jira] [Commented] (LUCENE-10103) QueryCache not estimating query size properly

2021-10-13 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17428222#comment-17428222 ] Michael McCandless commented on LUCENE-10103: - Thanks [~zhai7631] – I just pushed the PR to

[jira] [Commented] (LUCENE-10093) TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427279#comment-17427279 ] Michael McCandless commented on LUCENE-10093: - OK I created a starter PR – still some

[jira] [Commented] (LUCENE-10093) TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427245#comment-17427245 ] Michael McCandless commented on LUCENE-10093: - Also, I don't quite understand why

[jira] [Commented] (LUCENE-10093) TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427243#comment-17427243 ] Michael McCandless commented on LUCENE-10093: - {quote}If a user sets maxMergedSegmentMB to

[jira] [Commented] (LUCENE-10093) TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427239#comment-17427239 ] Michael McCandless commented on LUCENE-10093: - Yeah I agree with [~jpountz]'s explanation. 

[jira] [Resolved] (LUCENE-10160) TestTieredMergePolicy reproducible failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-10160. - Resolution: Duplicate Dup of LUCENE-10093. > TestTieredMergePolicy

[jira] [Commented] (LUCENE-10093) TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427101#comment-17427101 ] Michael McCandless commented on LUCENE-10093: - Thanks [~jpountz] – I'll try to dig on this

[jira] [Commented] (LUCENE-10160) TestTieredMergePolicy reproducible failure

2021-10-11 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427098#comment-17427098 ] Michael McCandless commented on LUCENE-10160: - Thanks [~rcmuir] – [~jpountz] dug a bit on

[jira] [Commented] (LUCENE-10148) Fix DataInput/Output javadocs, MIGRATE.txt to document endianness

2021-10-05 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17424510#comment-17424510 ] Michael McCandless commented on LUCENE-10148: - +1 > Fix DataInput/Output javadocs,

[jira] [Commented] (LUCENE-10143) RateLimitedIndexOutput should delegate writeShort/writeInt/writeLong

2021-10-02 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423561#comment-17423561 ] Michael McCandless commented on LUCENE-10143: - Whoa, great catch!  I wish we could

[jira] [Commented] (LUCENE-8739) ZSTD Compressor support in Lucene

2021-09-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-8739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17422815#comment-17422815 ] Michael McCandless commented on LUCENE-8739: Wow, these are compelling results! Can you try

[jira] [Commented] (LUCENE-9983) Stop sorting determinize powersets unnecessarily

2021-09-29 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17422088#comment-17422088 ] Michael McCandless commented on LUCENE-9983: [~zhai7631] I think this can be resolved now?

[jira] [Commented] (LUCENE-9969) DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机

2021-09-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17419737#comment-17419737 ] Michael McCandless commented on LUCENE-9969: Yeah big +1 to experimenting with that!  It's

[jira] [Commented] (LUCENE-10062) Explore using SORTED_NUMERIC doc values to encode taxonomy ordinals for faceting

2021-09-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17419732#comment-17419732 ] Michael McCandless commented on LUCENE-10062: - {quote} it seems like there's enough benefit

[jira] [Commented] (LUCENE-10117) Add a tool to make it easy to put together perfasm output with lucene-util benchmarks.

2021-09-22 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17418734#comment-17418734 ] Michael McCandless commented on LUCENE-10117: - +1! > Add a tool to make it easy to put

[jira] [Commented] (LUCENE-10033) Encode doc values in smaller blocks of values, like postings

2021-09-14 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17415059#comment-17415059 ] Michael McCandless commented on LUCENE-10033: - Thanks for trying on this [~jpountz]!  And

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-09 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17412688#comment-17412688 ] Michael McCandless commented on LUCENE-10088: - +1 to add that one-liner corner-case

[jira] [Created] (LUCENE-10093) TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure

2021-09-09 Thread Michael McCandless (Jira)
Michael McCandless created LUCENE-10093: --- Summary: TestTieredMergePolicy.testForcedMergesUseLeastNumberOfMerges test failure Key: LUCENE-10093 URL: https://issues.apache.org/jira/browse/LUCENE-10093

[jira] [Resolved] (LUCENE-10092) TestCheckIndex failure

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-10092. - Fix Version/s: main (9.0) Resolution: Fixed Thanks [~jpountz]. We will

[jira] [Commented] (LUCENE-10092) TestCheckIndex failure

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17412108#comment-17412108 ] Michael McCandless commented on LUCENE-10092: - No worries [~zacharymorn] – Lucene's

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411913#comment-17411913 ] Michael McCandless commented on LUCENE-10088: - OK the 8.x failure repros for me: {noformat}

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411911#comment-17411911 ] Michael McCandless commented on LUCENE-10088: - Yeah now that we understand the (exotic)

[jira] [Commented] (LUCENE-10092) TestCheckIndex failure

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411909#comment-17411909 ] Michael McCandless commented on LUCENE-10092: - This seems to work: {noformat} diff --git

[jira] [Commented] (LUCENE-10092) TestCheckIndex failure

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411908#comment-17411908 ] Michael McCandless commented on LUCENE-10092: - I added {{-Dtests.verbose=true}} and

[jira] [Commented] (LUCENE-10092) TestCheckIndex failure

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411907#comment-17411907 ] Michael McCandless commented on LUCENE-10092: - Repros for me ... looks like a test bug. 

[jira] [Commented] (LUCENE-10010) Should we have a NFA Query?

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411882#comment-17411882 ] Michael McCandless commented on LUCENE-10010: - Thanks for the update [~zhai7631]!  Indeed

[jira] [Commented] (LUCENE-10091) Fix some old errors in the benchmark module

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411881#comment-17411881 ] Michael McCandless commented on LUCENE-10091: - Thanks [~xiaoshi_2021]!  You are right, the

[jira] [Commented] (LUCENE-9662) CheckIndex should be concurrent

2021-09-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411880#comment-17411880 ] Michael McCandless commented on LUCENE-9662: {quote}it seems like it's ok for us to just

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411339#comment-17411339 ] Michael McCandless commented on LUCENE-10088: - Ooooh I like that theory [~jpountz] – so IW

[jira] [Commented] (LUCENE-9620) Add Weight#count(LeafReaderContext)

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411255#comment-17411255 ] Michael McCandless commented on LUCENE-9620: Can this one be resolved now?  Or we are

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411250#comment-17411250 ] Michael McCandless commented on LUCENE-10088: - Or, maybe, with this seed, this test is

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411249#comment-17411249 ] Michael McCandless commented on LUCENE-10088: - Hmm, I instrumented {{MockDirectoryWrapper}}

[jira] [Commented] (LUCENE-9662) CheckIndex should be concurrent

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411208#comment-17411208 ] Michael McCandless commented on LUCENE-9662: {quote}What do you think? Would you recommend

[jira] [Commented] (LUCENE-9662) CheckIndex should be concurrent

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411204#comment-17411204 ] Michael McCandless commented on LUCENE-9662: {quote}To increase its concurrency for nightly

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411194#comment-17411194 ] Michael McCandless commented on LUCENE-10088: - Hmm, another test, on 8.x branch, is also

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411183#comment-17411183 ] Michael McCandless commented on LUCENE-10088: - Ugh, adding {{-Dtests.verbose=true}} causes

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411178#comment-17411178 ] Michael McCandless commented on LUCENE-10088: - {noformat}    >    

[jira] [Created] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2021-09-07 Thread Michael McCandless (Jira)
Michael McCandless created LUCENE-10088: --- Summary: Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader Key: LUCENE-10088 URL:

[jira] [Commented] (LUCENE-10081) KoreanTokenizer should check the max backtrace gap on whitespaces

2021-09-07 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411175#comment-17411175 ] Michael McCandless commented on LUCENE-10081: - Good catch!  Thanks [~jimczi]. Hmm, was

[jira] [Updated] (LUCENE-10035) Simple text codec add multi level skip list data

2021-09-03 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-10035: Fix Version/s: 8.10 > Simple text codec add multi level skip list data >

[jira] [Commented] (LUCENE-10035) Simple text codec add multi level skip list data

2021-09-03 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17409510#comment-17409510 ] Michael McCandless commented on LUCENE-10035: - I felt bad that we were not going to

[jira] [Commented] (LUCENE-9662) CheckIndex should be concurrent

2021-09-02 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17408766#comment-17408766 ] Michael McCandless commented on LUCENE-9662: Hmm, it looks like we didn't fix the {{Usage:

[jira] [Commented] (LUCENE-9662) CheckIndex should be concurrent

2021-09-02 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17408746#comment-17408746 ] Michael McCandless commented on LUCENE-9662: Whoa, look [how much faster {{CheckIndex}}

[jira] [Commented] (LUCENE-10079) DocValues new iterator API is missing in migration guide

2021-08-31 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17407411#comment-17407411 ] Michael McCandless commented on LUCENE-10079: - Hrmph, you are right!  We clearly should

[jira] [Commented] (LUCENE-9969) DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机

2021-08-31 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17407363#comment-17407363 ] Michael McCandless commented on LUCENE-9969: Imagine we had a {{NUMERIC}} doc values field,

[jira] [Created] (LUCENE-10080) Use a bit set to count long-tail of singleton FacetLabels?

2021-08-31 Thread Michael McCandless (Jira)
Michael McCandless created LUCENE-10080: --- Summary: Use a bit set to count long-tail of singleton FacetLabels? Key: LUCENE-10080 URL: https://issues.apache.org/jira/browse/LUCENE-10080 Project:

[jira] [Commented] (LUCENE-10035) Simple text codec add multi level skip list data

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406814#comment-17406814 ] Michael McCandless commented on LUCENE-10035: - OK fair enough :)  We should get 9.0 out

[jira] [Commented] (LUCENE-10078) Enable merge-on-refresh by default?

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406811#comment-17406811 ] Michael McCandless commented on LUCENE-10078: - {quote}Let's set a low-ish default value,

[jira] [Commented] (LUCENE-9969) DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406807#comment-17406807 ] Michael McCandless commented on LUCENE-9969: Let's try to find a better data-structure to do

[jira] [Commented] (LUCENE-10035) Simple text codec add multi level skip list data

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406800#comment-17406800 ] Michael McCandless commented on LUCENE-10035: - Woot, thank you [~wuda0112] and [~jpountz]! 

[jira] [Resolved] (LUCENE-8723) Bad interaction bewteen WordDelimiterGraphFilter, StopFilter and FlattenGraphFilter

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-8723. Fix Version/s: 8.10 main (9.0) Resolution: Fixed > Bad

[jira] [Commented] (LUCENE-8723) Bad interaction bewteen WordDelimiterGraphFilter, StopFilter and FlattenGraphFilter

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-8723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406798#comment-17406798 ] Michael McCandless commented on LUCENE-8723: Oh that's great news [~Geoffrey Lawson] – also

[jira] [Created] (LUCENE-10078) Enable merge-on-refresh by default?

2021-08-30 Thread Michael McCandless (Jira)
Michael McCandless created LUCENE-10078: --- Summary: Enable merge-on-refresh by default? Key: LUCENE-10078 URL: https://issues.apache.org/jira/browse/LUCENE-10078 Project: Lucene - Core

[jira] [Commented] (LUCENE-10073) Allow very small merges to merge more than segmentsPerTier segments?

2021-08-30 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406796#comment-17406796 ] Michael McCandless commented on LUCENE-10073: - {quote}{quote}We might also enable

[jira] [Commented] (LUCENE-10068) Switch to a "double barrel" HPPC cache for the taxonomy LRU cache

2021-08-29 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406403#comment-17406403 ] Michael McCandless commented on LUCENE-10068: - Yeah that's a good point [~rcmuir] – maybe

[jira] [Commented] (LUCENE-10068) Switch to a "double barrel" HPPC cache for the taxonomy LRU cache

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406249#comment-17406249 ] Michael McCandless commented on LUCENE-10068: - It's also possible HPPC has some collection

[jira] [Commented] (LUCENE-10070) "count all" faceting functionality counts deleted docs for multiple implementations

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406245#comment-17406245 ] Michael McCandless commented on LUCENE-10070: - Wow, good catch! > "count all" faceting

[jira] [Commented] (LUCENE-10071) Review and refactor synchronization handling between MockDirectoryWrapper and CheckIndex

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406244#comment-17406244 ] Michael McCandless commented on LUCENE-10071: - Ideally we would be able to sometimes use

[jira] [Updated] (LUCENE-10058) lucene main(9.0) run ./gradlew lucene:benchmark:run error

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-10058: Fix Version/s: main (9.0) Resolution: Fixed Status: Resolved

[jira] [Resolved] (LUCENE-10051) lucene branch_8x run ant run-task error

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-10051. - Fix Version/s: 8.10 Resolution: Fixed > lucene branch_8x run ant

[jira] [Commented] (LUCENE-10051) lucene branch_8x run ant run-task error

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406237#comment-17406237 ] Michael McCandless commented on LUCENE-10051: - Thanks [~xiaoshi_2021]! > lucene branch_8x

[jira] [Reopened] (LUCENE-9969) DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened LUCENE-9969: > DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机 >

[jira] [Commented] (LUCENE-9969) DirectoryTaxonomyReader.taxoArray占用内存较大导致系统OOM宕机

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406232#comment-17406232 ] Michael McCandless commented on LUCENE-9969: I think we should reopen this

[jira] [Commented] (LUCENE-10067) investigate 6/23/2021 -> 6/24/2021 drop in facets perf

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406228#comment-17406228 ] Michael McCandless commented on LUCENE-10067: - Yes, thank you [~jpountz]!  What an awesome

[jira] [Commented] (LUCENE-10059) Assertion error in JapaneseTokenizer backtrace

2021-08-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17406227#comment-17406227 ] Michael McCandless commented on LUCENE-10059: - Did we already backport this fix to 8.10 as

[jira] [Commented] (LUCENE-10074) Remove unneeded default value assignment

2021-08-27 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17405787#comment-17405787 ] Michael McCandless commented on LUCENE-10074: - This is maybe controversial, but it bugs me

[jira] [Commented] (LUCENE-10073) Allow very small merges to merge more than segmentsPerTier segments?

2021-08-27 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17405780#comment-17405780 ] Michael McCandless commented on LUCENE-10073: - +1, I think that makes sense. I wish we had

[jira] [Commented] (LUCENE-10067) investigate 6/23/2021 -> 6/24/2021 drop in facets perf

2021-08-25 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17404428#comment-17404428 ] Michael McCandless commented on LUCENE-10067: - {quote}It's a good thing they were running

[jira] [Commented] (LUCENE-10067) investigate 6/23/2021 -> 6/24/2021 drop in facets perf

2021-08-25 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17404405#comment-17404405 ] Michael McCandless commented on LUCENE-10067: - Thanks [~rcmuir] – I added a couple

[jira] [Commented] (LUCENE-10067) investigate 6/23/2021 -> 6/24/2021 drop in facets perf

2021-08-25 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17404406#comment-17404406 ] Michael McCandless commented on LUCENE-10067: - And thank you nightly benchmarks for

[jira] [Commented] (LUCENE-9613) Create blocks for ords when it helps in Lucene80DocValuesFormat

2021-08-25 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17404360#comment-17404360 ] Michael McCandless commented on LUCENE-9613: I am not certain, but this change was likely

[jira] [Commented] (LUCENE-10068) Switch to a "double barrel" HPPC cache for the taxonomy LRU cache

2021-08-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17404030#comment-17404030 ] Michael McCandless commented on LUCENE-10068: - Even though this "double" map caching

[jira] [Commented] (LUCENE-9972) Performance regression in NRTCachingDirectory

2021-08-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17403914#comment-17403914 ] Michael McCandless commented on LUCENE-9972: {quote}I guess we could start writing on the

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17403910#comment-17403910 ] Michael McCandless commented on LUCENE-5309: This caused a nice jump in SSDV facets in the

[jira] [Commented] (LUCENE-10062) Explore using SORTED_NUMERIC doc values to encode taxonomy ordinals for faceting

2021-08-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17403906#comment-17403906 ] Michael McCandless commented on LUCENE-10062: - +1, this is a great idea! > Explore using

[jira] [Commented] (LUCENE-9963) Flatten graph filter has errors when there are holes at beginning or end of alternate paths

2021-08-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17403811#comment-17403811 ] Michael McCandless commented on LUCENE-9963: [~Geoffrey Lawson] – I think this one can be

[jira] [Commented] (LUCENE-9972) Performance regression in NRTCachingDirectory

2021-08-24 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17403804#comment-17403804 ] Michael McCandless commented on LUCENE-9972: Hmm this issue remains open, and the workaround

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-23 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17403121#comment-17403121 ] Michael McCandless commented on LUCENE-5309: Wow, that is a surprisingly powerful

[jira] [Commented] (LUCENE-5309) when using SortedSetDV faceting, specialize the case when all docs are single-valued

2021-08-20 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17402385#comment-17402385 ] Michael McCandless commented on LUCENE-5309: Woot!  Thanks [~gsmiller]! > when using

[jira] [Commented] (LUCENE-10052) Add LuceneTestCase.newBytesRef methods

2021-08-17 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400387#comment-17400387 ] Michael McCandless commented on LUCENE-10052: - I have a quick initial PR for this, adding

[jira] [Created] (LUCENE-10052) Add LuceneTestCase.newBytesRef methods

2021-08-17 Thread Michael McCandless (Jira)
Michael McCandless created LUCENE-10052: --- Summary: Add LuceneTestCase.newBytesRef methods Key: LUCENE-10052 URL: https://issues.apache.org/jira/browse/LUCENE-10052 Project: Lucene - Core

[jira] [Assigned] (LUCENE-10052) Add LuceneTestCase.newBytesRef methods

2021-08-17 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-10052: --- Assignee: Michael McCandless > Add LuceneTestCase.newBytesRef methods >

[jira] [Commented] (LUCENE-9802) Switch to new logo

2021-08-17 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400384#comment-17400384 ] Michael McCandless commented on LUCENE-9802: Oooh yes thank you! > Switch to new logo >

[jira] [Updated] (LUCENE-10014) docvalue writeBlock gcd encode improve

2021-08-16 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-10014: Fix Version/s: 8.10 9.0 Resolution: Fixed

[jira] [Commented] (LUCENE-10014) docvalue writeBlock gcd encode improve

2021-08-16 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17399722#comment-17399722 ] Michael McCandless commented on LUCENE-10014: - OK patch looks good to me – thanks

[jira] [Commented] (LUCENE-10014) docvalue writeBlock gcd encode improve

2021-08-16 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17399712#comment-17399712 ] Michael McCandless commented on LUCENE-10014: - Thanks [~weizijun] – I'll have a look at the

[jira] [Commented] (LUCENE-10048) Bypass total frequency check if field uses custom term frequency

2021-08-15 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17399349#comment-17399349 ] Michael McCandless commented on LUCENE-10048: - {quote}On that note, maybe the right

[jira] [Commented] (LUCENE-10048) Bypass total frequency check if field uses custom term frequency

2021-08-13 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17398655#comment-17398655 ] Michael McCandless commented on LUCENE-10048: - Some brief historical context: LUCENE-8947

[jira] [Commented] (LUCENE-7020) TieredMergePolicy - cascade maxMergeAtOnce setting to maxMergeAtOnceExplicit

2021-07-29 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-7020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17390042#comment-17390042 ] Michael McCandless commented on LUCENE-7020: +1 to remove. My only worry is crazy abusive

[jira] [Commented] (LUCENE-10033) Encode doc values in smaller blocks of values, like postings

2021-07-29 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17389864#comment-17389864 ] Michael McCandless commented on LUCENE-10033: - {quote}But the former tend to be slower

[jira] [Commented] (LUCENE-10030) [DrillSidewaysScorer] redundant score() calculations in doQueryFirstScoring

2021-07-27 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17388092#comment-17388092 ] Michael McCandless commented on LUCENE-10030: - bq. Btw, should we have added changes to

[jira] [Commented] (LUCENE-9450) Taxonomy index should use DocValues not StoredFields

2021-07-22 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385612#comment-17385612 ] Michael McCandless commented on LUCENE-9450: OK, +1 for Scenario 2 (use index created

[jira] [Commented] (LUCENE-9450) Taxonomy index should use DocValues not StoredFields

2021-07-20 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384433#comment-17384433 ] Michael McCandless commented on LUCENE-9450: Ahh that is a compelling option too [~jpountz].

[jira] [Commented] (LUCENE-9450) Taxonomy index should use DocValues not StoredFields

2021-07-20 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384320#comment-17384320 ] Michael McCandless commented on LUCENE-9450: Catching up here... So yeah as things stand,

[jira] [Commented] (LUCENE-10010) Should we have a NFA Query?

2021-07-20 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384259#comment-17384259 ] Michael McCandless commented on LUCENE-10010: - bq. While in the NFA query, ideally we don't

[jira] [Updated] (LUCENE-9619) Move Points from a visitor API to a cursor-style API?

2021-07-16 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-9619: --- Summary: Move Points from a visitor API to a cursor-style API? (was: Move Points

[jira] [Commented] (LUCENE-10021) Upgrade HPPC to 0.9.0

2021-07-08 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17377573#comment-17377573 ] Michael McCandless commented on LUCENE-10021: - I think it's fine to just do this upgrade

[jira] [Resolved] (LUCENE-10009) wrong message in Lucene 7.4.0

2021-06-25 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-10009. - Fix Version/s: 8.10 main (9.0) Resolution: Fixed

[jira] [Commented] (LUCENE-10009) wrong message in Lucene 7.4.0

2021-06-25 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17369501#comment-17369501 ] Michael McCandless commented on LUCENE-10009: - Whoa, good catch!  This bug is still present

<    1   2   3   4   5   6   7   >