[jira] [Resolved] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida resolved LUCENE-10485. Resolution: Won't Fix I'm closing this (at this moment). Instead of trying to hide the `

[GitHub] [lucene] zacharymorn commented on a change in pull request #767: LUCENE-10436: Deprecate DocValuesFieldExistsQuery, NormsFieldExistsQuery and KnnVectorFieldExistsQuery with FieldExistsQuery

2022-03-31 Thread GitBox
zacharymorn commented on a change in pull request #767: URL: https://github.com/apache/lucene/pull/767#discussion_r840250256 ## File path: lucene/core/src/java/org/apache/lucene/search/DocValuesFieldExistsQuery.java ## @@ -31,42 +28,21 @@ /** * A {@link Query} that matches

[GitHub] [lucene] zacharymorn commented on a change in pull request #767: LUCENE-10436: Deprecate DocValuesFieldExistsQuery, NormsFieldExistsQuery and KnnVectorFieldExistsQuery with FieldExistsQuery

2022-03-31 Thread GitBox
zacharymorn commented on a change in pull request #767: URL: https://github.com/apache/lucene/pull/767#discussion_r840249824 ## File path: lucene/core/src/java/org/apache/lucene/search/DocValuesFieldExistsQuery.java ## @@ -31,42 +28,21 @@ /** * A {@link Query} that matches

[jira] [Comment Edited] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515701#comment-17515701 ] Tomoko Uchida edited comment on LUCENE-10485 at 4/1/22 5:22 AM: -

[jira] [Commented] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515701#comment-17515701 ] Tomoko Uchida commented on LUCENE-10485: Note: Found some IDE issues with white

[GitHub] [lucene] nknize commented on a change in pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
nknize commented on a change in pull request #749: URL: https://github.com/apache/lucene/pull/749#discussion_r840219946 ## File path: lucene/sandbox/src/test/org/apache/lucene/sandbox/search/TestIndexSortSortedNumericDocValuesRangeQuery.java ## @@ -155,15 +156,23 @@ public voi

[GitHub] [lucene] reta commented on a change in pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
reta commented on a change in pull request #749: URL: https://github.com/apache/lucene/pull/749#discussion_r840135696 ## File path: lucene/sandbox/src/java/org/apache/lucene/sandbox/search/IndexSortSortedNumericDocValuesRangeQuery.java ## @@ -250,6 +255,10 @@ private BoundedDo

[GitHub] [lucene] reta commented on a change in pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
reta commented on a change in pull request #749: URL: https://github.com/apache/lucene/pull/749#discussion_r840135295 ## File path: lucene/sandbox/src/test/org/apache/lucene/sandbox/search/TestIndexSortSortedNumericDocValuesRangeQuery.java ## @@ -155,15 +156,23 @@ public void

[GitHub] [lucene] jtibshirani commented on a change in pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
jtibshirani commented on a change in pull request #749: URL: https://github.com/apache/lucene/pull/749#discussion_r840121029 ## File path: lucene/sandbox/src/java/org/apache/lucene/sandbox/search/IndexSortSortedNumericDocValuesRangeQuery.java ## @@ -70,6 +72,9 @@ * fie

[GitHub] [lucene] Yuti-G commented on pull request #779: LUCENE-10488: Optimize Facets#getTopDims in IntTaxonomyFacets

2022-03-31 Thread GitBox
Yuti-G commented on pull request #779: URL: https://github.com/apache/lucene/pull/779#issuecomment-1085135788 I ran the benchmark and no regression found, please see attached: TaskQPS baseline StdDevQPS candidate StdDev Pct diff p-

[jira] [Comment Edited] (LUCENE-10488) Optimize Facets#getTopDims across Facets implementations

2022-03-31 Thread Yuting Gan (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515594#comment-17515594 ] Yuting Gan edited comment on LUCENE-10488 at 3/31/22 9:18 PM: ---

[jira] [Comment Edited] (LUCENE-10488) Optimize Facets#getTopDims across Facets implementations

2022-03-31 Thread Yuting Gan (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515594#comment-17515594 ] Yuting Gan edited comment on LUCENE-10488 at 3/31/22 9:18 PM: ---

[jira] [Commented] (LUCENE-10488) Optimize Facets#getTopDims across Facets implementations

2022-03-31 Thread Yuting Gan (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515594#comment-17515594 ] Yuting Gan commented on LUCENE-10488: - Hi [~gsmiller] , I have brought the optimiz

[GitHub] [lucene] zhaih commented on a change in pull request #762: LUCENE-10482 Allow users to create their own DirectoryTaxonomyReaders with empty taxoArrays instead of letting the taxoEpoch decide

2022-03-31 Thread GitBox
zhaih commented on a change in pull request #762: URL: https://github.com/apache/lucene/pull/762#discussion_r840006141 ## File path: lucene/facet/src/java/org/apache/lucene/facet/taxonomy/directory/DirectoryTaxonomyReader.java ## @@ -78,10 +78,20 @@ private volatile Taxonom

[GitHub] [lucene] Yuti-G opened a new pull request #779: LUCENE-10488: Optimize Facets#getTopDims in IntTaxonomyFacets

2022-03-31 Thread GitBox
Yuti-G opened a new pull request #779: URL: https://github.com/apache/lucene/pull/779 # Description This change overrides and optimizes the default implementation of getTopDims in IntTaxonomyFacets which is extended by FastTaxonomyFacetCounts and TaxonomyFacetSumIntAssociations.

[jira] [Comment Edited] (LUCENE-10470) Unable to Tessellate polygon

2022-03-31 Thread Yixun Xu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515567#comment-17515567 ] Yixun Xu edited comment on LUCENE-10470 at 3/31/22 8:07 PM:

[GitHub] [lucene] yixunx edited a comment on pull request #756: LUCENE-10470: [Tessellator] Prevent bridges that introduce collinear edges

2022-03-31 Thread GitBox
yixunx edited a comment on pull request #756: URL: https://github.com/apache/lucene/pull/756#issuecomment-1085046782 Thank you for fixing this @iverase. However I just ran into the same Tessellation error using this branch on a different polygon (repro [here](https://github.com/iverase/luc

[jira] [Updated] (LUCENE-10470) Unable to Tessellate polygon

2022-03-31 Thread Yixun Xu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yixun Xu updated LUCENE-10470: -- Attachment: image-2022-03-31-16-06-33-051.png > Unable to Tessellate polygon > --

[jira] [Commented] (LUCENE-10470) Unable to Tessellate polygon

2022-03-31 Thread Yixun Xu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515567#comment-17515567 ] Yixun Xu commented on LUCENE-10470: --- As mentioned in Github comment I ran into anothe

[jira] [Updated] (LUCENE-10470) Unable to Tessellate polygon

2022-03-31 Thread Yixun Xu (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yixun Xu updated LUCENE-10470: -- Attachment: polygon2.geojson > Unable to Tessellate polygon > > >

[GitHub] [lucene] yixunx commented on pull request #756: LUCENE-10470: [Tessellator] Prevent bridges that introduce collinear edges

2022-03-31 Thread GitBox
yixunx commented on pull request #756: URL: https://github.com/apache/lucene/pull/756#issuecomment-1085046782 Thank you for fixing this @iverase. However I just ran into the same Tessellation error using this branch on a different polygon (repro [here](https://github.com/iverase/lucene/com

[GitHub] [lucene] mcimadamore edited a comment on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
mcimadamore edited a comment on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084961727 > Thanks for your great explanation. Makes a lot of sense. > > Do you know if the `MAP_SHARED` flag is set for mapped memory segments? I guess this means that eve

[GitHub] [lucene] mcimadamore commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
mcimadamore commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084961727 > Thanks for your great explanation. Makes a lot of sense. > > Do you know if the `MAP_SHARED` flag is set for mapped memory segments? I guess this means that even if I'

[GitHub] [lucene] reta commented on a change in pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
reta commented on a change in pull request #749: URL: https://github.com/apache/lucene/pull/749#discussion_r839872948 ## File path: lucene/sandbox/src/java/org/apache/lucene/sandbox/search/IndexSortSortedNumericDocValuesRangeQuery.java ## @@ -318,6 +326,17 @@ private static Va

[GitHub] [lucene] reta commented on pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
reta commented on pull request #749: URL: https://github.com/apache/lucene/pull/749#issuecomment-1084917861 @jtibshirani thanks a lot for guiding me through, I believe all your comments have been addressed, thanks again! -- This is an automated message from the Apache Git Service. To res

[GitHub] [lucene] jtibshirani commented on a change in pull request #749: LUCENE-10466: IndexSortSortedNumericDocValuesRangeQuery unconditionally assumes the usage of the LONG-encoded SortField

2022-03-31 Thread GitBox
jtibshirani commented on a change in pull request #749: URL: https://github.com/apache/lucene/pull/749#discussion_r839809386 ## File path: lucene/core/src/java/org/apache/lucene/search/SortedNumericSortField.java ## @@ -62,7 +62,9 @@ public SortedNumericSortField(String field,

[GitHub] [lucene] JohannesLichtenberger commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
JohannesLichtenberger commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084809944 Thanks for your great explanation. Makes a lot of sense. Do you know if the `MAP_SHARED` flag is set for mapped memory segments? I guess this means that even i

[GitHub] [lucene] uschindler edited a comment on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
uschindler edited a comment on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084758218 Thanks @rmuir for the clirifaction. To add, because also @mcimadamore asked: We use shared segments, because we only allocate and map the segment once. It is then

[GitHub] [lucene] uschindler edited a comment on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
uschindler edited a comment on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084758218 Thanks @rmuir for the clirifaction. To add, because also @mcimadamore asked: We use shared segments, because we only allocate and map the segment once. It is then

[GitHub] [lucene] uschindler commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
uschindler commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084758218 Thanks @rmuir for the clirifaction. To add, because also @mcimadamore asked: We use shared segments, because we only allocate and map the segment once. It is then used by

[jira] [Updated] (LUCENE-10315) Speed up BKD leaf block ids codec by a 512 ints ForUtil

2022-03-31 Thread Ignacio Vera (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ignacio Vera updated LUCENE-10315: -- Attachment: cpu_profile_path.html cpu_profile_baseline.html Status: Re

[jira] [Commented] (LUCENE-10315) Speed up BKD leaf block ids codec by a 512 ints ForUtil

2022-03-31 Thread Ignacio Vera (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515397#comment-17515397 ] Ignacio Vera commented on LUCENE-10315: --- I have been looking into this issue and

[GitHub] [lucene] JohannesLichtenberger commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
JohannesLichtenberger commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084634009 Oh right, thanks. That's the big difference. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [lucene] rmuir commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
rmuir commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084630849 quoting from Uwe: > All files are write-once so there are no updates to files which were written before. This is the key piece that I think you are missing. We write fi

[GitHub] [lucene] JohannesLichtenberger edited a comment on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
JohannesLichtenberger edited a comment on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084586608 So, if the writer adds something to the lucene index (not via mmap) new index readers will create a new `IndexInput` with new mapped memory segments plus clon

[GitHub] [lucene] JohannesLichtenberger commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
JohannesLichtenberger commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084586608 So, if the writer adds something to the lucene index (not via mmap) new index readers will create a new `IndexInput` with new mapped memory segments plus clones with

[GitHub] [lucene] uschindler edited a comment on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
uschindler edited a comment on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084563803 > I'm working on a similar approach for my data store, but I'm currently not sure if it's a good idea for multiple readers plus a single reader/writer to map a segment f

[GitHub] [lucene] uschindler commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
uschindler commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084563803 > I'm working on a similar approach for my data store, but I'm currently not sure if it's a good idea for multiple readers plus a single reader/writer to map a segment for each

[GitHub] [lucene] JohannesLichtenberger commented on pull request #518: Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26)

2022-03-31 Thread GitBox
JohannesLichtenberger commented on pull request #518: URL: https://github.com/apache/lucene/pull/518#issuecomment-1084507956 I'm working on a similar approach for my data store, but I'm currently not sure if it's a good idea for multiple readers plus a single reader/writer to map a segment

[GitHub] [lucene] wjp719 commented on pull request #731: LUCENE-10456: Implement Weight#count for MultiRangeQuery

2022-03-31 Thread GitBox
wjp719 commented on pull request #731: URL: https://github.com/apache/lucene/pull/731#issuecomment-1084419272 @jpountz I have added the CHANGES log, thanks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [lucene] wjp719 commented on a change in pull request #731: LUCENE-10456: Implement Weight#count for MultiRangeQuery

2022-03-31 Thread GitBox
wjp719 commented on a change in pull request #731: URL: https://github.com/apache/lucene/pull/731#discussion_r839468461 ## File path: lucene/sandbox/src/java/org/apache/lucene/sandbox/search/MultiRangeQuery.java ## @@ -314,6 +388,36 @@ public Scorer scorer(LeafReaderContext co

[GitHub] [lucene] jpountz commented on pull request #731: LUCENE-10456: Implement Weight#count for MultiRangeQuery

2022-03-31 Thread GitBox
jpountz commented on pull request #731: URL: https://github.com/apache/lucene/pull/731#issuecomment-1084332206 Can you also add a CHANGES entry under version 9.2? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR

[GitHub] [lucene] jpountz commented on a change in pull request #731: LUCENE-10456: Implement Weight#count for MultiRangeQuery

2022-03-31 Thread GitBox
jpountz commented on a change in pull request #731: URL: https://github.com/apache/lucene/pull/731#discussion_r839388337 ## File path: lucene/sandbox/src/java/org/apache/lucene/sandbox/search/MultiRangeQuery.java ## @@ -314,6 +388,36 @@ public Scorer scorer(LeafReaderContext c

[GitHub] [lucene] jpountz commented on a change in pull request #767: LUCENE-10436: Deprecate DocValuesFieldExistsQuery, NormsFieldExistsQuery and KnnVectorFieldExistsQuery with FieldExistsQuery

2022-03-31 Thread GitBox
jpountz commented on a change in pull request #767: URL: https://github.com/apache/lucene/pull/767#discussion_r839380353 ## File path: lucene/core/src/java/org/apache/lucene/search/DocValuesFieldExistsQuery.java ## @@ -31,42 +28,21 @@ /** * A {@link Query} that matches docu

[jira] [Commented] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515173#comment-17515173 ] Dawid Weiss commented on LUCENE-10485: -- I think they make a lot of sense and it'd

[jira] [Commented] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515155#comment-17515155 ] Tomoko Uchida commented on LUCENE-10485: Thanks for the pointer. In practice, m

[jira] [Commented] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515128#comment-17515128 ] Dawid Weiss commented on LUCENE-10485: -- I already explored that back here: https:

[jira] [Commented] (LUCENE-10485) Limit o.a.l.a.morph package visibility only to kuromoji and nori

2022-03-31 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515122#comment-17515122 ] Tomoko Uchida commented on LUCENE-10485: If my understanding is correct, we nee