mikemccand commented on PR #12526: URL: https://github.com/apache/lucene/pull/12526#issuecomment-1881088881
> > Maybe we should add OrHighVeryLow to nightly benchy too? > > @mikemccand I started looking into this, but my enwiki (`enwiki-20120502-lines-with-random-label.txt`) seems to have slightly different frequencies compared to frequencies reported in wikinightly.tasks, are nightly benchmarks using the same export or a different one? I think it could make sense to have two new tasks `OrHighLow110` where the low-frequency term always has a frequency of 110 >k and `OrHighLow90` where the low-frequency term always has a frequency of 90<k. These two cases are interesting because in one case it takes very long to collect `k` matches of the highest scoring clause, and in the other case this never happens. Very late answer (sorry!): hmm indeed the frequencies reported in those task files (as comments) are likely from a different (older?) enwiki snapshot. It looks like you muscled through this and added the new atsks to nightly tasks, thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org