subject:"\[PR\] KAFKA\-16452\: Bound high\-watermark offset to range between LLSO and LEO \[kafka\]"



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1582149811


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   Opened #15825 a draft PR with the suggested approach. PTAL.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1582144306


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   > For example, if we lose the local data in all replicas, the 
lastStableOffset could still be in the middle of a tiered segment and moving it 
to localLogStartOffset immediately will be incorrect.
   
   I'm not clear on this:
   
   1. Segments that are eligible for upload to remote storage only when the 
`lastStableOffset` moves beyond the segment-to-be-uploaded-end-offset. 
   2. When all the replicas loses local data (offline partition), then we 
consider the data in remote storage also lost. Currently, for this case, we 
don't have provision to serve the remote data.
   3. When `firstUnstableOffsetMetadata` is empty, we return `highWatermark`. 
With this patch, the `highWatermark` lower boundary is set to 
`localLogStartOffset` so there won't be an issue. 
   
   > Note that OffsetMetadata (segmentBaseOffset and relativePositionInSegment) 
is only used in DelayedFetch for estimating the amount of available bytes.
   
   The 
[LogOffsetMetadata#onOlderSegment](https://sourcegraph.com/github.com/apache/kafka@5de5d967adffd864bad3ec729760a430253abf38/-/blob/storage/src/main/java/org/apache/kafka/storage/internals/log/LogOffsetMetadata.java?L54)
 method is used in the 
[hot-path](https://sourcegraph.com/github.com/apache/kafka@5de5d967adffd864bad3ec729760a430253abf38/-/blob/core/src/main/scala/kafka/log/UnifiedLog.scala?L324)
 of incrementing the high-watermark and expects the full metadata, otherwise it 
throws an error. Is it ok to remove the throwable from 
LogOffsetMetadata#onOlderSegment method and return `false` when 
`messageOffsetOnly` available?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1582144306


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   > For example, if we lose the local data in all replicas, the 
lastStableOffset could still be in the middle of a tiered segment and moving it 
to localLogStartOffset immediately will be incorrect.
   
   I'm not clear on this:
   
   1. Segments that are eligible for upload to remote storage only when the 
`lastStableOffset` moves beyond the segment-to-be-uploaded-end-offset. 
   2. When all the replicas loses local data (offline partition), then we 
consider the data in remote storage also lost. Currently, for this case, we 
don't have provision to serve the remote data.
   3. When `firstUnstableOffsetMetadata` is empty, we return `highWatermark`. 
With this patch, the `highWatermark` lower boundary is set to 
`localLogStartOffset` so there won't be an issue. 
   
   > Note that OffsetMetadata (segmentBaseOffset and relativePositionInSegment) 
is only used in DelayedFetch for estimating the amount of available bytes.
   
   The 
[LogOffsetMetadata#onOlderSegment](https://sourcegraph.com/github.com/apache/kafka@5de5d967adffd864bad3ec729760a430253abf38/-/blob/storage/src/main/java/org/apache/kafka/storage/internals/log/LogOffsetMetadata.java?L54)
 method is used in the 
[hot-path](https://sourcegraph.com/github.com/apache/kafka@5de5d967adffd864bad3ec729760a430253abf38/-/blob/core/src/main/scala/kafka/log/UnifiedLog.scala?L324)
 of incrementing the high-watermark and expects the full metadata, otherwise it 
throws an error. Is it ok to remove the throwable from 
LogOffsetMetadata#onOlderSegment method and return `false` by default?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1582144306


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   > For example, if we lose the local data in all replicas, the 
lastStableOffset could still be in the middle of a tiered segment and moving it 
to localLogStartOffset immediately will be incorrect.
   
   I'm not clear on this:
   
   1. Segments that are eligible for upload to remote storage only when the 
`lastStableOffset` moves beyond the segment-to-be-uploaded-end-offset. 
   2. When all the replicas loses local data (offline partition), then we 
consider the data in remote storage also lost. Currently, for this case, we 
don't have provision to serve the remote data.
   3. When `firstUnstableOffsetMetadata` is empty, we return `highWatermark`. 
With this patch, the `highWatermark` lower boundary is set to 
`localLogStartOffset` so there won't be an issue. 
   
   > Note that OffsetMetadata (segmentBaseOffset and relativePositionInSegment) 
is only used in DelayedFetch for estimating the amount of available bytes.
   
   The 
[LogOffsetMetadata#onOlderSegment](https://sourcegraph.com/github.com/apache/kafka@5de5d967adffd864bad3ec729760a430253abf38/-/blob/storage/src/main/java/org/apache/kafka/storage/internals/log/LogOffsetMetadata.java?L54)
 method is used in the 
[hot-path](https://sourcegraph.com/github.com/apache/kafka@5de5d967adffd864bad3ec729760a430253abf38/-/blob/core/src/main/scala/kafka/log/UnifiedLog.scala?L324)
 of incrementing the high-watermark and expects the full metadata, otherwise it 
throws an error. Is it ok to remove the throwable from 
LogOffsetMetadata#onOlderSegment method and return `false` by default.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1571126880


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   Thanks for suggesting the alternative approach. I'll check and comeback on 
this.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]

kamalcph commented on PR #15634:
URL: https://github.com/apache/kafka/pull/15634#issuecomment-2064647379

Thanks @chia7712 for the review!

> log-start-offset-checkpoint is missing and remote storage is enabled. The
logStartOffset will be set to zero, and it seems be a potential issue since the
ListOffsetRequest could get incorrect result

Most of the time when the follower joins the ISR, it updates the
log-start-offset and high-watermark from the leader FETCH response. The issue
can happen only when the follower gets elected as leader before updating it's
state as mentioned in the summary/comments.

When the `log-start-offset-checkpoint` file is missing:
1. For normal topic, the log-start-offset will be set to base-offset of the
first log segment so there is no issue. Since the data is there, read won't
fail.
2. For remote topic, the log-start-offset will be stale for sometime until
the RemoteLogManager
[updates](https://github.com/apache/kafka/blob/trunk/core/src/main/java/kafka/log/remote/RemoteLogManager.java#L671)
it, so the issue is intermittent and self-recovers.

> replication-offset-checkpoint is missing and remote storage is enabled.
This is what your described. The HWM is pointed to middle of tiered storage and
so it causes error when fetching records from local segments.

This is not an issue for normal topic. But for cluster enabled with
remote-storage, if the issue happens even on 1 partition, then it starts to
affect *subset* of topics. Controller batches the partitions in the
LeaderAndIsr request. If the broker fails to process the LISR for one
partition, then the remaining partition in that batch won't be processed. The
producers producing to those topics will start receiving
NOT_LEADER_FOR_PARTITION error.

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



chia7712 commented on PR #15634:
URL: https://github.com/apache/kafka/pull/15634#issuecomment-2063727744

   Sorry that the story I mentioned above seems be another issue. Let me have 
the summary about my thought.
   
   1. `log-start-offset-checkpoint` is missing and remote storage is enabled. 
The `logStartOffset` will be set to zero, and it seems be a potential issue 
since the `ListOffsetRequest` could get incorrect result
   2. `replication-offset-checkpoint` is missing and remote storage is enabled. 
This is what your described. The HWM is pointed to middle of tiered storage and 
so it causes error when fetching records from local segments.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]

chia7712 commented on PR #15634:
URL: https://github.com/apache/kafka/pull/15634#issuecomment-2063435567

> HWM is set to to localLogStartOffset in
[UnifiedLog#updateLocalLogStartOffset](https://sourcegraph.com/github.com/apache/kafka@f895ab5145077c5efa10a4a898628d901b01e2c2/-/blob/core/src/main/scala/kafka/log/UnifiedLog.scala?L162),
then we load the HWM from the checkpoint file in
[Partition#createLog](https://sourcegraph.com/github.com/apache/kafka@f895ab5145077c5efa10a4a898628d901b01e2c2/-/blob/core/src/main/scala/kafka/cluster/Partition.scala?L495).
If the HWM checkpoint file is missing / does not contain the entry for
partition, then the default value of 0 is taken. If 0 < LogStartOffset (LSO),
then LSO is assumed as HWM . Thus, the non-monotonic update of highwatermark
from LLSO to LSO can happen.

Pardon me. I'm a bit confused about this. Please feel free to correct me to
help me catch up :smile:

### case 0: the checkpoint file is missing and the remote storage is
**disabled**
The LSO is initialized to LLSO

https://github.com/apache/kafka/blob/aee9724ee15ed539ae73c09cc2c2eda83ae3c864/core/src/main/scala/kafka/log/LogLoader.scala#L180

so I can't understand why the non-monotonic update happens? After all, LLSO
and LSO are the same in this scenario.

### case 1: the checkpoint file is missing and the remote storage is
**enabled**
The LSO is initialzied to `logStartOffsetCheckpoint` which is 0 since there
are no checkpoint files.

https://github.com/apache/kafka/blob/aee9724ee15ed539ae73c09cc2c2eda83ae3c864/core/src/main/scala/kafka/log/LogLoader.scala#L178

And then HWM will be update to LLSO which is larger than zero.

https://github.com/apache/kafka/blob/aee9724ee15ed539ae73c09cc2c2eda83ae3c864/core/src/main/scala/kafka/log/UnifiedLog.scala#L172

And this could be a problem when
[Partition#createLog](https://sourcegraph.com/github.com/apache/kafka@f895ab5145077c5efa10a4a898628d901b01e2c2/-/blob/core/src/main/scala/kafka/cluster/Partition.scala?L495)
get called since the HWM is changed from LLSO (non-zero) to LSO (zero). Also,
the incorrect HWM causes error in `convertToOffsetMetadataOrThrow`.

If I understand correctly, it seems the root cause is that "when the
checkpoint files are not working, we will initialize a `UnifiedLog` with
incorrect LSO".

and so could we fix that by re-build `logStartOffsets` according remote
storage when checkpoint is not working
(https://github.com/apache/kafka/blob/aee9724ee15ed539ae73c09cc2c2eda83ae3c864/core/src/main/scala/kafka/log/LogManager.scala#L459)?

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



junrao commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1569562746


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   @kamalcph : Thanks for the explanation. I understand the problem now.
   
   As for the fix, it seems that it could work for HWM. However, I am not sure 
that we could always do the same thing of LastStableOffset. For example, if we 
lose the local data in all replicas, the lastStableOffset could still be in the 
middle of a tiered segment and moving it to localLogStartOffset immediately 
will be incorrect. 
   
   Here is another potential approach. Note that OffsetMetadata 
(segmentBaseOffset and relativePositionInSegment) is only used in DelayedFetch 
for estimating the amount of available bytes. If occasionally OffsetMetadata is 
not available, we don't have to force an exception in 
convertToOffsetMetadataOrThrow(). Instead, we can leave the OffsetMetadata as 
empty and just use a conservative 1 byte for estimating the amount of available 
bytes. This approach will apply to both HWM and LSO.  The inaccurate byte 
estimate will be ok as long as it's infrequent. What do you think?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1569356239


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -1223,6 +1223,12 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 s"but we only have log segments starting from offset: 
$logStartOffset.")
   }
 
+  private def checkLocalLogStartOffset(offset: Long): Unit = {

Review Comment:
   Agree on this. The `checkLocalLogStartOffset` is used only in the 
`convertToOffsetMetadataOrThrow` method which reads from local-disk.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1563885907


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   > In the rare case, the restarted broker is elected as the leader before 
caught up through unclean election. Is this the case that you want to address?
   
   yes, we want to address this case too. And, the issue can also happen during 
clean preferred-leader-election:
   
   ```
   Call stack: The replica (1002) has full data but HW is invalid, then the 
fetch-offset will be equal to LeaderLog(1001).highWatermark
   
   Leader (1001):
   KafkaApis.handleFetchRequest
   ReplicaManager.fetchMessages
   ReplicaManager.readFromLocalLog
   Partition.fetchRecords
   Partition.updateFollowerFetchState
   Partition.maybeExpandIsr
   Partition.submitAlterPartition
   ...
   ...
   ...
   # If there is not enough data to respond and there is no remote 
data, we will let the fetch request wait for new data.
   # parks the request in the DelayedFetchPurgatory
   
   
   Another thread, runs Preferred-Leader-Election in controller (1003), since 
the replica 1002 joined the ISR list, it can be elected as the preferred 
leader. The controller sends LeaderAndIsr requests to all the brokers.
   
   KafkaController.processReplicaLeaderElection
   KafkaController.onReplicaElection
   PartitionStateMachine.handleStateChanges
   PartitionStateMachine.doHandleStateChanges
   PartitionStateMachine.electLeaderForPartitions
   ControllerChannelManager.sendRequestsToBrokers
   
   
   Replica 1002 got elected as Leader and have invalid highWatermark since it 
didn't process the fetch-response from the previous leader 1001, throws 
OFFSET_OUT_OF_RANGE error when processing the LeaderAndIsr request. Note that 
in LeaderAndIsr request even if one partition fails, then the remaining 
partitions in that request won't be processed.
   
   KafkaApis.handleLeaderAndIsrRequest
   ReplicaManager.becomeLeaderOrFollower
   ReplicaManager.makeLeaders
   Partition.makeLeader
   Partition.maybeIncrementLeaderHW
   UnifiedLog.maybeIncrementHighWatermark (LeaderLog)
   UnifiedLog.fetchHighWatermarkMetadata
   
   
   The controller assumes that the current-leader for the tp0 is 1002, but the 
broker 1002 couldn't process the LISR. The controller retries the LISR until 
the broker 1002 becomes leader for tp0. During this time, the producers won't 
be able to send messages, as the node 1002, sends NOT_LEADER_FOR_PARTITION 
error-code to the producer.
   
   During this time, if a follower sends the FETCH request to read from the 
current-leader 1002, then OFFSET_OUT_OF_RANGE error will be returned by the 
leader:
   
KafkaApis.handleFetchRequest
 ReplicaManager.fetchMessages
 ReplicaManager.readFromLog
 Partition.fetchRecords
 # readFromLocalLog
 Partition.updateFollowerFetchState
 Partition.maybeIncrementLeaderHW
  LeaderLog.maybeIncrementHighWatermark
 UnifiedLog.fetchHighWatermarkMetadata
 UnifiedLog.convertToOffsetMetadataOrThrow
 LocalLog.convertToOffsetMetadataOrThrow
 LocalLog.read
 # OffsetOutOfRangeException 
exception
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]



kamalcph commented on code in PR #15634:
URL: https://github.com/apache/kafka/pull/15634#discussion_r1569328393


##
core/src/main/scala/kafka/log/UnifiedLog.scala:
##
@@ -282,15 +282,15 @@ class UnifiedLog(@volatile var logStartOffset: Long,
 
   /**
* Update high watermark with offset metadata. The new high watermark will 
be lower
-   * bounded by the log start offset and upper bounded by the log end offset.
+   * bounded by the local-log-start-offset and upper bounded by the 
log-end-offset.
*
* @param highWatermarkMetadata the suggested high watermark with offset 
metadata
* @return the updated high watermark offset
*/
   def updateHighWatermark(highWatermarkMetadata: LogOffsetMetadata): Long = {
 val endOffsetMetadata = localLog.logEndOffsetMetadata
-val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
logStartOffset) {
-  new LogOffsetMetadata(logStartOffset)
+val newHighWatermarkMetadata = if (highWatermarkMetadata.messageOffset < 
_localLogStartOffset) {

Review Comment:
   > For the makeLeaders path, it will call 
UnifiedLog.convertToOffsetMetadataOrThrow. Within it, 
checkLogStartOffset(offset) shouldn't throw OFFSET_OUT_OF_RANGE since we are 
comparing the offset with logStartOffset. Do you know which part throws 
OFFSET_OUT_OF_RANGE error?
   
   The next line `localLog.convertToOffsetMetadataOrThrow` in 
[convertToOffsetMetadataOrThrow](https://sourcegraph.com/github.com/apache/kafka@a3dcbd4e28a35f79f75ec1bf316ef0b39c0df164/-/blob/core/src/main/scala/kafka/log/UnifiedLog.scala?L1429)
 method reads the segment from disk, there it throws the error. The call 
[segments.floorSegment(startOffset)](https://sourcegraph.com/github.com/apache/kafka@a3dcbd4e28a35f79f75ec1bf316ef0b39c0df164/-/blob/core/src/main/scala/kafka/log/LocalLog.scala?L366)
 in LocalLog fails to find the segment with log-start-offset, then 
OffsetOutOfRangeException is thrown.
   
   > For the follower fetch path, it's bounded by LogEndOffset. So it shouldn't 
need to call UnifiedLog.fetchHighWatermarkMetadata, right? The regular consumer 
will call UnifiedLog.fetchHighWatermarkMetadata.
   
   yes, you're right. I attached the wrong call stack for handling the follower 
request. Please find the updated call stack below:
   
   Leader with invalid high-watermark handles the FETCH requests from follower 
and throws OFFSET_OUT_OF_RANGE error:
   
   ```
KafkaApis.handleFetchRequest
   ReplicaManager.fetchMessages
   ReplicaManager.readFromLog
   Partition.fetchRecords
   # readFromLocalLog
   Partition.updateFollowerFetchState
   Partition.maybeIncrementLeaderHW
LeaderLog.maybeIncrementHighWatermark
   UnifiedLog.fetchHighWatermarkMetadata
   UnifiedLog.convertToOffsetMetadataOrThrow
   
LocalLog.convertToOffsetMetadataOrThrow
   LocalLog.read
   # OffsetOutOfRangeException 
exception
   
   ```
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-16452: Bound high-watermark offset to range between LLSO and LEO [kafka]