Greg Harris created KAFKA-15202: ----------------------------------- Summary: MM2 OffsetSyncStore clears too many syncs when sync spacing is variable Key: KAFKA-15202 URL: https://issues.apache.org/jira/browse/KAFKA-15202 Project: Kafka Issue Type: Bug Components: mirrormaker Affects Versions: 3.4.1, 3.5.0, 3.3.3 Reporter: Greg Harris
The spacing between OffsetSyncs can vary significantly, due to conditions in the upstream topic and in the replication rate of the MirrorSourceTask. The OffsetSyncStore attempts to keep a maximal number of distinct syncs present, and for regularly spaced syncs it does not allow an incoming sync to expire more than one other unique sync. There are tests to enforce this property. For variable spaced syncs, there is no such guarantee, because multiple fine-grained syncs may need to be expired at the same time. However, instead of only those fine-grained syncs being expired, the store may also expire coarser-grained syncs. This causes a large decrease in the number of unique syncs. This is an extremely simple example: * Syncs: 0 (start), 1, 2, 4. The result: ``` TRACE New sync OffsetSync\{topicPartition=topic1-2, upstreamOffset=1, downstreamOffset=1} applied, new state is [1:1,0:0] (org.apache.kafka.connect.mirror.OffsetSyncStore:194) TRACE New sync OffsetSync\{topicPartition=topic1-2, upstreamOffset=2, downstreamOffset=2} applied, new state is [2:2,1:1,0:0] (org.apache.kafka.connect.mirror.OffsetSyncStore:194) TRACE New sync OffsetSync\{topicPartition=topic1-2, upstreamOffset=4, downstreamOffset=4} applied, new state is [4:4,0:0] (org.apache.kafka.connect.mirror.OffsetSyncStore:194) ``` Instead of being expired, the `2:2` sync should still be present in the final state, allowing the store to maintain 3 unique syncs. -- This message was sent by Atlassian Jira (v8.20.10#820010)