romseygeek opened a new issue, #15884: URL: https://github.com/apache/lucene/issues/15884
### Description When Lucene90DocValuesConsumer writes a numeric doc values field, it iterates over the values multiple times - once to write skippers if they are configured, once to get minimum and maximum values for block encoding, once to write the values. SortedNumeric repeats this, and then iterates again to write addresses if there are documents with multiple values. Sorted and SortedSet re-use the numeric values machinery to write their ords, and SortedSet also adds in an iteration over all values to check to see if it can fall back to just Sorted. BinaryDocValues iterates over everything to get minimum and maximum lengths before writing values out. We should be able to use skipper metadata (where it exists) to avoid at least some of these multiple iterations, especially if we add more information such as maxValuesPerDoc. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
