I've uploaded a fix in https://github.com/apache/lucene/pull/11861 (thanks
Mike for the review!). If there are no objections, I plan to merge it
tomorrow and then get started on a 9.4.1 release candidate.

Julie

On Tue, Oct 18, 2022 at 2:52 PM Michael Sokolov <msoko...@gmail.com> wrote:

> Oh no! Very sorry -- thank you for volunteering to fix  (hangs head in
> shame). I guess I'll see where the bug is soon ...
>
> On Tue, Oct 18, 2022 at 2:50 PM Michael Wechner
> <michael.wech...@wyona.com> wrote:
> >
> > +1 :-)
> >
> > Thanks
> >
> > Michael
> >
> > Am 18.10.22 um 19:52 schrieb Julie Tibshirani:
> > > Hi everyone,
> > >
> > > We recently discovered a severe bug in the 9.4 release in the kNN
> > > vectors format: https://github.com/apache/lucene/issues/11858.
> > > Explaining the problem: when ingesting a lot of data, or when
> > > performing a force merge, segments can grow large. The format
> > > validation code accidentally uses an int instead of a long to compute
> > > the data size, so it can fail on these large segments. When format
> > > validation fails, the segment is essentially lost and unusable. For
> > > some client systems like Elasticsearch, it can send the whole index
> > > into a "failed" state, blocking further writes or searches.
> > >
> > > I think this bug is sufficiently bad that we should perform a 9.4.1
> > > release as soon as possible. The fix is just an update to the
> > > read-side validation code, there won't be any effect on the data
> > > format. This means it is safe to merge the fix into the existing 9.4
> > > vectors format. The bug was introduced during the work to add
> > > quantization (https://github.com/apache/lucene/pull/1054) and does not
> > > affect versions before 9.4.
> > >
> > > Let me know what you think! I could serve as release manager. (We
> > > should also follow up with a plan to prevent this from happening in
> > > the future -- maybe we need to regularly run larger-scale benchmarks?)
> > >
> > > Julie
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> > For additional commands, e-mail: dev-h...@lucene.apache.org
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
>
>

Reply via email to