Re: Question about the performance of Lucene99PostingsFormat

2024-09-16 Thread Rui Wu
Dear Adrien, We found that the regression of match-all is not caused by the PostingList format, and instead it's caused by MaxScoreBulkScorer class. Let me create a new email thread about it since the tile of this email thread is N/A anymore. On Wed, Sep 11, 2024 at 6:24 PM Rui Wu wrote: > Than

Re: Question about the performance of Lucene99PostingsFormat

2024-09-11 Thread Rui Wu
Thanks for your prompt reply! On Tue, Sep 10, 2024 at 1:38 PM Adrien Grand wrote: > Can you clarify what you refer to by match-all and match-many queries? > Lucene's MatchAllDocsQuery should not be impacted since it doesn't use > postings for evaluation. > match-all refers to a query that hits a

Re: Question about the performance of Lucene99PostingsFormat

2024-09-10 Thread Adrien Grand
Can you clarify what you refer to by match-all and match-many queries? Lucene's MatchAllDocsQuery should not be impacted since it doesn't use postings for evaluation. Since FOR is a bit less space-efficient than PFOR, I guess it could be a bit slower if your Directory abstraction was a bit slow at

Question about the performance of Lucene99PostingsFormat

2024-09-10 Thread Rui Wu
Dear experts, I have a question about the following change: The Lucene9.11 changed the Posting list format (Lucene GITHUB#12696 : Change Postings back to using FOR in Lucene99PostingsFormat. Freqs, positions and offset keep using PFOR) However, in our