Re: Should IndexWriter.flush return seqNo?

2023-04-26 Thread Patrick Zhai
> Patrick maybe you had an interesting use case in mind?

I had one, but later on I found out that I don't necessarily use flush to
achieve that so it's not really a valid use case that definitely need
flush...

On Tue, Apr 25, 2023 at 7:26 PM Ishan Chattopadhyaya <
ichattopadhy...@gmail.com> wrote:

> I think Apache Solr could explore leveraging the returned sequence number
> for its transaction logs.
>
> On Tue, 25 Apr 2023 at 18:36, Michael McCandless <
> luc...@mikemccandless.com> wrote:
>
>> On Sun, Apr 23, 2023 at 6:19 AM Uwe Schindler  wrote:
>>
>> Having the sequence number public in API does not bring any benefit, as
>>> you cannot use it for anything.
>>>
>>
>> Actually there are some interesting use cases for sequence numbers:
>>
>> They enable the caller to know the effective order of operations of
>> concurrent indexing events.  This can be useful for applications that might
>> sometimes update the same document at the same time across threads to
>> implement optimistic concurrency to re-index the same document if the order
>> was not correct according to the applications external version tracking for
>> out-of-order updates.  OpenSearch has an array of locks to implement
>> pessimistic concurrency (ensuring the that same id is never updated
>> concurrently) but for cases where the conflicts are rare, the optimistic
>> implementation based on Lucene's sequence numbers is likely more efficient.
>>
>> Another use case is precise indexing operation replay (e.g. from a
>> Kinesis queue or transaction log or whatever) on recovering from a commit
>> point: upon commit, you know which precise indexing event was captured in
>> the commit, and on recovering you can resume indexing from precisely the
>> next indexing event.  This doesn't matter for idempotent updates, but, for
>> other cases like append only, it is useful and performant.
>>
>> I also don't see why flush should return a sequence number -- it is not
>> an externally visible event.  Patrick maybe you had an interesting use case
>> in mind?  Note that commit also writes (and fsyncs) the next segments_N
>> file, to light all the newly written/fsync'd segments for the next reader
>> to open.
>>
>> Mike McCandless
>>
>> http://blog.mikemccandless.com
>>
>>


New branch and feature freeze for Lucene 9.6.0

2023-04-26 Thread Alan Woodward
NOTICE:

Branch branch_9_6 has been cut and versions updated to 9.7 on stable branch.

Please observe the normal rules:

* No new features may be committed to the branch.
* Documentation patches, build patches and serious bug fixes may be
 committed to the branch. However, you should submit all patches you
 want to commit as pull requests first to give others the chance to review
 and possibly vote against them. Keep in mind that it is our
 main intention to keep the branch as stable as possible.
* All patches that are intended for the branch should first be committed
 to the unstable branch, merged into the stable branch, and then into
 the current release branch.
* Normal unstable and stable branch development may continue as usual.
 However, if you plan to commit a big change to the unstable branch
 while the branch feature freeze is in effect, think twice: can't the
 addition wait a couple more days? Merges of bug fixes into the branch
 may become more difficult.
* Only Github issues with Milestone 9.6
 and priority "Blocker" will delay a release candidate build.


I am struggling to find the lucene Jenkins jobs on the new apache build server 
at https://jenkins-ccos.apache.org/ - if anybody has any hints as to how to 
navigate the helpful new interface with a non-functional search box, I would be 
very grateful…

It’s a holiday weekend coming up in the UK, so my plan is to give Jenkins a few 
days to chew things over (once I actually get the jobs running) and then build 
a RC on Tuesday 2nd May.
-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org