nit: deliverables -> derivables. On Mon, May 4, 2026 at 2:33 PM Kamal Chandraprakash < [email protected]> wrote:
> Hi Jian, > > I went over the PR: https://github.com/apache/kafka/pull/20913 and > provided first pass comments. > Shall we avoid the config value deliverables? > > 1. deriving the value of `remote.copy.lag.ms` from `local.retention.ms` > and > 2. deriving the value of `remote.copy.lag.bytes` from > `local.retention.bytes`. > > The remote copy lag config can be configured at both broker/topic level > which should be sufficient. > Derivable config values are not clear for the user and add to the > operational complexity. > > Thanks, > Kamal > > On Wed, Mar 18, 2026 at 4:57 PM jian fu <[email protected]> wrote: > >> Hi Chia-Ping: >> >> Thanks for your review and comments. >> Q1: Is this broker-level configuration dynamic? >> The configures are broker-level configuration dynamic similar as others . >> I >> already updated the KIP content. Thanks for your reminder. >> Q2: should we add a metric to track the 'delayed size'? >> Currently, we do have a way to measure how much delay there is. Although >> it’s not very convenient, Thus I think in most cases there isn’t a need to >> continuously monitor this delay . In addition, we can leverage the API >> introduced in KIP-1187 (Support to retrieve remote log size via >> DescribeLogDirs RPC) to query it directly in future. So, given that this >> requirement is not particularly urgent, I think we can hold off on adding >> a >> metric for now. >> >> Thanks for your comments! >> >> Regards >> Jian >> >> >> Chia-Ping Tsai <[email protected]> 于2026年3月18日周三 18:43写道: >> >> > hi Jian >> > >> > sorry for late review. There are some questions below. >> > >> > Is this broker-level configuration dynamic? We should clarify this in >> the >> > KIP. Also, should we add a metric to track the 'delayed size'? >> > >> > Best, >> > Chia-Ping >> > >> > On 2025/11/19 13:29:11 jian fu wrote: >> > > Hi everyone, I'd like to start a discussion on KIP-1241, the goal is >> to >> > > reduce the remote storage. KIP: >> > > >> > >> https://cwiki.apache.org/confluence/display/KAFKA/KIP-1241%3A+Reduce+tiered+storage+redundancy+with+delayed+upload >> > > >> > > The Draft PR: https://github.com/apache/kafka/pull/20913 >> Problem: >> > > Currently, >> > > Kafka's tiered storage implementation uploads all non-active local log >> > > segments to remote storage immediately, even when they are still >> within >> > the >> > > local retention period. >> > > This results in redundant storage of the same data in both local and >> > remote >> > > tiers. >> > > >> > > When there is no requirement for real-time analytics or immediate >> > > consumption based on remote storage. It has the following drawbacks: >> > > >> > > 1. Wastes storage capacity and costs: The same data is stored twice >> > during >> > > the local retention window >> > > 2. Provides no immediate benefit: During the local retention period, >> > reads >> > > prioritize local data, making the remote copy unnecessary >> > > >> > > >> > > So. this KIP is to reduce tiered storage redundancy with delayed >> upload. >> > > You can check the test result example here directly: >> > > https://github.com/apache/kafka/pull/20913#issuecomment-3547156286 >> > > Looking forward to your feedback! Best regards, Jian >> > > >> > >> >
