benwtrent commented on PR #15620:
URL: https://github.com/apache/lucene/pull/15620#issuecomment-3886870174
@churromorales AH, I better understand now. logbytesize doesn't:
- Handle the granular time patterns you care about as size doesn't directly
reflect temporal sizes (e.g. merge segments into chunks of 1h instead of merge
X sized segments).
- Segments being flushed next to each other might actually cross your
temporal thresholds (e.g. 1200UTC might span multiple segments, when optimally,
it would be a single segment).
This is indeed interesting.
I haven't reviewed, but I did read your summary. I noticed a lack of
information on handling "deletes and updates". I realize that this might
be...adverse to this type of policy, but it should be mentioned/handled in a
sane way.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]