Re: Bugfix release 9.11.1

2024-06-21 Thread Stefan Vodita
The fix is now in main, branch_9x, and branch_9_11. On Thu, 20 Jun 2024 at 14:17, Stefan Vodita wrote: > Thank you Ignacio for handling the release! > > I've just updated the PR with the fix [1]. > I can push it tomorrow. > > > Stefan > > [1] https://github.c

Re: Bugfix release 9.11.1

2024-06-20 Thread Stefan Vodita
Thank you Ignacio for handling the release! I've just updated the PR with the fix [1]. I can push it tomorrow. Stefan [1] https://github.com/apache/lucene/pull/13494 On Thu, 20 Jun 2024 at 08:36, Ignacio Vera wrote: > I am now preparing for a bugfix release from branch branch_9_11. I am >

Do we need a 9.11.1 release?

2024-06-18 Thread Stefan Vodita
Hi all, I wanted to bring to everyone's attention that we released a bug [1] in StringValueFacetCounts with 9.11. On an empty match-set, instead of returning empty facet results, we throw an NPE. Users can work around this, but obviously it's not ideal. I noticed there is one other issue around

Re: Intellij build/test times

2024-06-07 Thread Stefan Vodita
+1, I had the same problem and it seems better now. Thank you, Dawid! On Thu, 6 Jun 2024 at 12:20, Michael Sokolov wrote: > Oh! TIL! so much better, thanks. And now I have the "Repeat" option > back in the test runner > > On Thu, Jun 6, 2024 at 6:18 AM Dawid Weiss wrote: > > > > > > Don't know

Re: Lucene 9.11

2024-05-29 Thread Stefan Vodita
Ben, I just merged #13414 , so it's not a blocker for the release. Thanks again for volunteering to be release manager! Stefan On Tue, 28 May 2024 at 14:58, Benjamin Trent wrote: > Hey y'all, > > I am planning on starting the release process

Re: Improve testing

2024-05-25 Thread Stefan Vodita
Some useful documentation on the gradlew commands: https://github.com/apache/lucene/blob/main/help/workflow.txt On Sat, 25 May 2024 at 19:38, Stefan Vodita wrote: > I'll add a step in between 1 and 2 that I often forget: ./gradlew tidy > This refactors your code to the style the projec

Re: Improve testing

2024-05-25 Thread Stefan Vodita
I'll add a step in between 1 and 2 that I often forget: ./gradlew tidy This refactors your code to the style the project uses, which we have checks for. On Sat, 25 May 2024 at 00:53, Michael Froh wrote: > Is your new test uncommitted? > > The Gradle check will fail if you have uncommitted

Re: Help running the demo program

2024-04-22 Thread Stefan Vodita
Hi Siddharth, If you happen to be using IntelliJ, you can run a demo class from the IDE. It probably works with other IDEs too, though I haven't tried it. Stefan On Sun, 21 Apr 2024 at 23:59, Siddharth Jain wrote: > Hello, > > I am a new user to Lucene. I checked out the Lucene repo >

Re: [Vote] Bump the Lucene main branch to Java 21

2024-02-23 Thread Stefan Vodita
+1 On Fri, 23 Feb 2024 at 11:24, Chris Hegarty wrote: > Hi, > > Since the discussion on bumping the Lucene main branch to Java 21 is > winding down, let's hold a vote on this important change. > > Once bumped, the next major release of Lucene (whenever that will be) will > require a version of

Re: Welcome Zhang Chao as Lucene committer

2024-02-21 Thread Stefan Vodita
Congratulations, Chao! On Tue, 20 Feb 2024 at 17:28, Adrien Grand wrote: > I'm pleased to announce that Zhang Chao has accepted the PMC's > invitation to become a committer. > > Chao, the tradition is that new committers introduce themselves with a > brief bio. > > Congratulations and welcome!

Re: Announcing githubsearch!

2024-02-20 Thread Stefan Vodita
Thank you Mike, I really like all the facets! One thing that bothered me about GitHub's own search was that it would return different results if I wasn't signed in. Maybe it does early stopping for non-authenticated users? In any case, this won't be a problem with githubsearch. Have you

Re: Request for dev list subscription

2024-02-15 Thread Stefan Vodita
Hi Sandeepan, You can find information about the mailing lists, including how to sign up, at: https://lucene.apache.org/core/discussion.html Let us know if that doesn't work for some reason. Stefan On Thu, 15 Feb 2024 at 20:13, Sandeepan Baidya wrote: > Hi there, > > I would like to be added

Re: [VOTE] Release Lucene 9.10.0 RC1

2024-02-14 Thread Stefan Vodita
+1 SUCCESS! [0:39:49.675498] On Wed, 14 Feb 2024 at 19:30, Adrien Grand wrote: > Please vote for release candidate 1 for Lucene 9.10.0 > > The artifacts can be downloaded from: > > https://dist.apache.org/repos/dist/dev/lucene/lucene-9.10.0-RC1-rev-695c0ac84508438302cd346a812cfa2fdc5a10df > >

Re: [VOTE] Release Lucene 9.9.2 RC1

2024-01-26 Thread Stefan Vodita
+1 SUCCESS! [2:34:29.551907] On Thu, 25 Jan 2024 at 11:57, Chris Hegarty wrote: > Please vote for release candidate 1 for Lucene 9.9.2 > > The artifacts can be downloaded from: > > https://dist.apache.org/repos/dist/dev/lucene/lucene-9.9.2-RC1-rev-a2939784c4ca60bc28bf488b5479c02fc2e5e22c > >

Re: Welcome Stefan Vodita as Lucene committter

2024-01-19 Thread Stefan Vodita
my inconsistent practice habits. Practice will have to wait until I finish up the next PR - looking forward to many more in the future! Cheers, Stefan On Thu, 18 Jan 2024 at 15:56, Michael McCandless wrote: > Hi Team, > > I'm pleased to announce that Stefan Vodita has accepted

Re: Heads up: upcoming GitHub action to mark stale Lucene PRs

2024-01-08 Thread Stefan Vodita
/apache/lucene/actions/runs/7447339092/job/20259452054 [2] https://github.com/apache/lucene/pulls?q=is%3Apr+label%3AStale+is%3Aclosed On Thu, 4 Jan 2024 at 15:47, Stefan Vodita wrote: > There's a flag for excluding draft PRs [1]. I'll add it to the workflow. > If we need more flexi

Re: Heads up: upcoming GitHub action to mark stale Lucene PRs

2024-01-04 Thread Stefan Vodita
). > > Uwe > Am 04.01.2024 um 14:04 schrieb Michael McCandless: > > Hi Team, > > Stefan Vodita made an awesome simple PR adding a GitHub action to remind / > nag us about stale PRs: https://github.com/apache/lucene/pull/12813 > > This happened after an in-person di

Re: Faceting Queries NON-Taxonomy-based

2023-11-16 Thread Stefan Vodita
use an interactive session  > > Thanks! > > Tony > > > -Original Message- > From: Stefan Vodita > Sent: Tuesday, November 14, 2023 19:14 > To: t...@xfire.io.invalid > Cc: java-user@lucene.apache.org > Subject: Re: Faceting Queries NON-Taxonomy-based >

Re: Faceting Queries NON-Taxonomy-based

2023-11-14 Thread Stefan Vodita
Hi Tony, Have you looked at the faceting demo package [1]? The faceting documentation that lives there [2] was updated last year (thanks @epotyom!) and there haven't been major API changes since. The `SimpleSortedSetFacetsExample` [3] might be a good starting point for what you're trying to do.

Re: Healthy PR Approaches from Apache Beam

2023-11-11 Thread Stefan Vodita
; > Mike McCandless > > http://blog.mikemccandless.com > > > On Tue, Oct 31, 2023 at 5:39 AM Stefan Vodita > wrote: > >> Hi all, >> >> I recently learned a few interesting things that the Beam >> <https://github.com/apache/beam> project does to >

Healthy PR Approaches from Apache Beam

2023-10-31 Thread Stefan Vodita
Hi all, I recently learned a few interesting things that the Beam project does to promote and maintain good interactions on PRs. 1. Community metrics dashboard . The graphs are pretty and insightful.

Re: Computing multiple different aggregations over a match-set in one pass

2023-09-09 Thread Stefan Vodita
aising the idea! > > Cheers, > -Greg > > On Mon, Mar 6, 2023 at 7:21 AM Stefan Vodita > wrote: > > > Hi Greg, > > > > The PR looks great. I think it's a useful feature to have and it helps > > with the > > use-case we were discussing. I left a comment with so

Re: Reindexing leaving behind 0 live doc segments

2023-08-30 Thread Stefan Vodita
Hi Rahul, What you're describing sounds similar to index rearranging [1], although in that case the reindexing is done in a new index. The last commit in the IndexRearranger class added support for reading and reindexing deletes - maybe having a look at that and at the Javadoc would help?

Re: Index ordinal data in the taxonomy

2023-05-27 Thread Stefan Vodita
ork OK-ish?) > > Shai > > On Sat, May 13, 2023 at 6:45 PM Stefan Vodita wrote: >> >> Hello Shai, >> >> Thank you for the feedback! I'll try to answer each of the questions. >> >> > will it change the API in non-backward compatible way, or impa

Re: Index ordinal data in the taxonomy

2023-05-13 Thread Stefan Vodita
affect the taxonomy index. > > Generally though, I don't see a reason not to support it. > > Shai > > On Thu, May 11, 2023 at 1:03 PM Stefan Vodita wrote: >> >> Hi everyone, >> >> I work on the Lucene product search team at Amazon. We’ve been considering

Index ordinal data in the taxonomy

2023-05-11 Thread Stefan Vodita
Hi everyone, I work on the Lucene product search team at Amazon. We’ve been considering indexing scoring signals for ordinals into the taxonomy, which could reduce index size for some use-cases. Example Let's consider a library of research papers, where each paper is represented by a Lucene

Re: Computing multiple different aggregations over a match-set in one pass

2023-03-06 Thread Stefan Vodita
ally convinced I like it > (I think the expression syntax/API is a little wonky), but that's something > we could discuss in an issue. > > https://github.com/apache/lucene/pull/12184 > > Cheers, > -Greg > > On Fri, Feb 24, 2023 at 1:57 PM Stefan Vodita > wrote: > >

Re: Computing multiple different aggregations over a match-set in one pass

2023-02-24 Thread Stefan Vodita
API. Of course, there's > nothing preventing users from building this "multiple association" > functionality themselves. > > That's my take on this, but maybe I'm missing some other use-cases that > could justify adding this capability in a general way? What do you thi

Re: Computing multiple different aggregations over a match-set in one pass

2023-02-17 Thread Stefan Vodita
> Cheers, > -Greg > > On Thu, Feb 16, 2023 at 1:32 PM Stefan Vodita > wrote: > > > Hi Greg, > > > > To better understand how much work gets duplicated, I went ahead > > and modified FloatTaxonomyFacets as an example [1]. It doesn't look > > too p

Re: Computing multiple different aggregations over a match-set in one pass

2023-02-16 Thread Stefan Vodita
sers? But that's just a guess/assumption. > > Cheers, > -Greg > > On Tue, Feb 14, 2023 at 3:19 AM Stefan Vodita > wrote: > > > Hi Greg, > > > > I see now where my example didn’t give enough info. In my mind, `Genre / > > Author nationality / Author name` is s

Re: Computing multiple different aggregations over a match-set in one pass

2023-02-14 Thread Stefan Vodita
erating the hits. If the ordinals are > "packed" into the same field though (which is the default in Lucene if > you're using taxonomy faceting), then you should only need to do a single > iteration over that field. > > Cheers, > -Greg > > On Sat, Feb 11, 2023 at

Re: Computing multiple different aggregations over a match-set in one pass

2023-02-11 Thread Stefan Vodita
> over three different match sets (one constrained by "Mark Twain" as the > author, one constrained by "American authors" and one constrained by the > "sci-fi" genre). Is that correct? > > Cheers, > -Greg > > On Fri, Feb 10, 2023 at 11:33 AM Stefa

Computing multiple different aggregations over a match-set in one pass

2023-02-10 Thread Stefan Vodita
Hi all, Let’s say I have an index of books, similar to the example in the facet demo [1] with a hierarchical facet field encapsulating `Genre / Author’s nationality / Author’s name`. I might like to find the latest publish date of a book written by Mark Twain, the sum of the prices of books

Question about current situation of good first issues in GitHub

2023-01-10 Thread Stefan Vodita
Hello Shunya, As far as I know, GitHub issues are not marked for new developers yet. The project migrated a few months ago from Jira to GitHub issues, so you can still search the old labels in Jira . In particular, there is `newdev` for good starter issues [1]. Hope this helps, Stefan [1]

[jira] [Commented] (LUCENE-10603) Improve iteration of ords for SortedSetDocValues

2022-07-04 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17562230#comment-17562230 ] Stefan Vodita commented on LUCENE-10603: Hi Greg! I thought I'd help out. [Here|https

[jira] [Commented] (LUCENE-10629) Add fastMatchQuery param to MatchingFacetSetCounts

2022-07-01 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17561449#comment-17561449 ] Stefan Vodita commented on LUCENE-10629: This looked like an easy task to try my hand at, so I

[jira] [Updated] (LOG4J2-3452) Race condition in FileUtils.mkdir()

2022-03-29 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LOG4J2-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Vodita updated LOG4J2-3452: -- Description: I noticed an error like: ERROR Unable to create file

[jira] [Updated] (LOG4J2-3452) Race condition in FileUtils.mkdir()

2022-03-29 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LOG4J2-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Vodita updated LOG4J2-3452: -- Description: I noticed an error like: {{ERROR Unable to create file

[jira] [Updated] (LOG4J2-3452) Race condition in FileUtils.mkdir()

2022-03-29 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LOG4J2-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Vodita updated LOG4J2-3452: -- Description: I noticed an error like: {{ERROR Unable to create file

[jira] [Created] (LOG4J2-3452) Race condition in FileUtils.mkdir()

2022-03-29 Thread Stefan Vodita (Jira)
Stefan Vodita created LOG4J2-3452: - Summary: Race condition in FileUtils.mkdir() Key: LOG4J2-3452 URL: https://issues.apache.org/jira/browse/LOG4J2-3452 Project: Log4j 2 Issue Type: Bug

[jira] [Commented] (LUCENE-10182) TestRamUsageEstimator asserts trivial equality

2021-10-19 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17430430#comment-17430430 ] Stefan Vodita commented on LUCENE-10182: Might as well fix the assertion order, since it's

[jira] [Commented] (LUCENE-10182) TestRamUsageEstimator asserts trivial equality

2021-10-18 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17430216#comment-17430216 ] Stefan Vodita commented on LUCENE-10182: Oh, I get it now. Thanks for the explanation! I guess

[jira] [Commented] (LUCENE-10182) TestRamUsageEstimator asserts trivial equality

2021-10-18 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17430179#comment-17430179 ] Stefan Vodita commented on LUCENE-10182: I’d be happy to help with this. I saw mention at some

[jira] [Commented] (LUCENE-10182) TestRamUsageEstimator asserts trivial equality

2021-10-18 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17429927#comment-17429927 ] Stefan Vodita commented on LUCENE-10182: I pushed a change renaming {{RamUsageTester.sizeOf

[jira] [Commented] (LUCENE-10182) TestRamUsageEstimator asserts trivial equality

2021-10-15 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17429494#comment-17429494 ] Stefan Vodita commented on LUCENE-10182: [This PR|https://github.com/apache/lucene/pull/386

[jira] [Created] (LUCENE-10182) TestRamUsageEstimator asserts trivial equality

2021-10-15 Thread Stefan Vodita (Jira)
Stefan Vodita created LUCENE-10182: -- Summary: TestRamUsageEstimator asserts trivial equality Key: LUCENE-10182 URL: https://issues.apache.org/jira/browse/LUCENE-10182 Project: Lucene - Core

[jira] [Comment Edited] (LUCENE-10129) Add RamUsageEstimator shallowSizeOf(long[]) overload that just calls sizeOf(long[])?

2021-10-08 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426103#comment-17426103 ] Stefan Vodita edited comment on LUCENE-10129 at 10/8/21, 10:33 AM

[jira] [Commented] (LUCENE-10129) Add RamUsageEstimator shallowSizeOf(long[]) overload that just calls sizeOf(long[])?

2021-10-08 Thread Stefan Vodita (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426103#comment-17426103 ] Stefan Vodita commented on LUCENE-10129: Hello! I am new to Lucene, and this looked like