romseygeek opened a new issue, #15884:
URL: https://github.com/apache/lucene/issues/15884

   ### Description
   
   When Lucene90DocValuesConsumer writes a numeric doc values field, it 
iterates over the values multiple times - once to write skippers if they are 
configured, once to get minimum and maximum values for block encoding, once to 
write the values.  SortedNumeric repeats this, and then iterates again to write 
addresses if there are documents with multiple values.  Sorted and SortedSet 
re-use the numeric values machinery to write their ords, and SortedSet also 
adds in an iteration over all values to check to see if it can fall back to 
just Sorted.  BinaryDocValues iterates over everything to get minimum and 
maximum lengths before writing values out.
   
   We should be able to use skipper metadata (where it exists) to avoid at 
least some of these multiple iterations, especially if we add more information 
such as maxValuesPerDoc.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to