jpountz commented on PR #12526:
URL: https://github.com/apache/lucene/pull/12526#issuecomment-1699741992
I added a few tasks that I'm adding here for reference to see how it plays
with disjunctions that have more terms or different document frequencies:
```
OrHighVeryLow: 2005 mousehole # freq=835460 freq=123
OrHighVeryLow: until motorboats # freq=425389 freq=128
OrHighVeryLow: made monceau # freq=742313 freq=126
OrHighVeryLow: do bush's # freq=511178 freq=2681
OrHighVeryLow: 10 mikup # freq=918339 freq=119
OrHighMedLow: international chris valois
OrHighMedLow: right million universalist
OrHighMedLow: known created forays
OrHighMedLow: its universal bush's
OrHighMedLow: 9 network racedetail.html
OrHighHighHigh: 2005 until made
OrHighHighHigh: do 10 international
OrHighHighHigh: right known its
OrHighHighHigh: until 10 known
OrHighHighHigh: made international its
OrHighMedMed: international chris million
OrHighMedMed: right million created
OrHighMedMed: known created universal
OrHighMedMed: its universal network
OrHighMedMed: 9 network chris
OrHighHighLow: several following valois
OrHighHighLow: publisher end universalist
OrHighHighLow: 2009 film forays
OrHighHighLow: http known bush's
OrHighHighLow: south county racedetail.html
OrHighHighMed: international right million
OrHighHighMed: right known created
OrHighMighMed: known its universal
OrHighHighMed: its 9 network
OrHighHighMed: 9 international chris
```
```
TaskQPS baseline StdDevQPS
my_modified_version StdDev Pct diff p-value
OrHighMedMed 158.53 (3.6%) 155.92
(4.4%) -1.7% ( -9% - 6%) 0.193
OrHighHighHigh 53.97 (5.0%) 53.13
(4.9%) -1.6% ( -10% - 8%) 0.324
OrHighHighMed 106.81 (4.0%) 105.37
(4.3%) -1.3% ( -9% - 7%) 0.306
OrHighHigh 64.42 (5.6%) 63.64
(4.0%) -1.2% ( -10% - 8%) 0.433
OrHighMighMed 201.12 (3.7%) 198.74
(3.5%) -1.2% ( -8% - 6%) 0.298
OrHighMedLow 323.10 (3.7%) 319.32
(4.2%) -1.2% ( -8% - 6%) 0.349
OrHighMed 227.13 (3.9%) 225.41
(3.0%) -0.8% ( -7% - 6%) 0.487
HighTerm 652.70 (4.2%) 659.51
(5.3%) 1.0% ( -8% - 11%) 0.491
PKLookup 248.57 (3.4%) 251.38
(1.9%) 1.1% ( -4% - 6%) 0.198
MedTerm 1060.67 (4.5%) 1076.33
(5.4%) 1.5% ( -8% - 11%) 0.350
LowTerm 1639.65 (7.0%) 1667.48
(4.9%) 1.7% ( -9% - 14%) 0.377
OrHighVeryLow 172.35 (8.2%) 196.54
(8.4%) 14.0% ( -2% - 33%) 0.000
OrHighHighLow 449.76 (3.0%) 633.61
(3.5%) 40.9% ( 33% - 48%) 0.000
OrHighLow 546.08 (5.4%) 1187.98
(5.1%) 117.5% ( 101% - 135%) 0.000
```
While it tends to help queries that are already fast, it also helped
OrHighVeryLow above, which is not among the fastest. I also like that none of
the queries is getting a major slowdown.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]