[jira] [Commented] (KAFKA-12261) Splitting partition causes message loss for consumers with auto.offset.reset=latest

2024-02-29 Thread zhangzhisheng (Jira)


[ 
https://issues.apache.org/jira/browse/KAFKA-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17822135#comment-17822135
 ] 

zhangzhisheng commented on KAFKA-12261:
---

It is like this, but in my understanding, this is normal.

> Splitting partition causes message loss for consumers with 
> auto.offset.reset=latest
> ---
>
> Key: KAFKA-12261
> URL: https://issues.apache.org/jira/browse/KAFKA-12261
> Project: Kafka
>  Issue Type: Improvement
>Affects Versions: 2.7.0
>Reporter: Haruki Okada
>Assignee: Haruki Okada
>Priority: Minor
> Fix For: 3.6.0
>
>
> As of now, auto.offset.reset of ConsumerConfig is "latest" by default.
>  
> This could be a pitfall that causes message delivery loss when we split 
> topic's partitions like below:
> Say we have a topic-X which have only 1 partition.
>  # split topic-X to 2 partitions by kafka-topics.sh --alter --topic topic-X 
> --partitions 2 (topic-X-1 is added)
>  # producer knows that new partitions are added by refreshing metadata. 
> starts to produce to topic-X-1
>  # bit later, consumer knows that new partitions are added and triggering 
> consumer rebalance, then starts consuming topic-X-1
>  * 
>  ** upon starting consumption, it resets its offset to log-end-offset
> If the producer sent several records before 3, they could be not-delivered to 
> the consumer.
>  
>  
> This behavior isn't preferable in most cases, so it should be documented in 
> AUTO_OFFSET_RESET_DOC at least.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (KAFKA-12261) Splitting partition causes message loss for consumers with auto.offset.reset=latest

2021-02-20 Thread Haruki Okada (Jira)


[ 
https://issues.apache.org/jira/browse/KAFKA-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17287615#comment-17287615
 ] 

Haruki Okada commented on KAFKA-12261:
--

I see, thank you. Seems I don't have permission to set the assignee of the 
ticket, but anyway I've submitted a PR. Please take a look when you have time.

> Splitting partition causes message loss for consumers with 
> auto.offset.reset=latest
> ---
>
> Key: KAFKA-12261
> URL: https://issues.apache.org/jira/browse/KAFKA-12261
> Project: Kafka
>  Issue Type: Improvement
>Affects Versions: 2.7.0
>Reporter: Haruki Okada
>Priority: Minor
>
> As of now, auto.offset.reset of ConsumerConfig is "latest" by default.
>  
> This could be a pitfall that causes message delivery loss when we split 
> topic's partitions like below:
> Say we have a topic-X which have only 1 partition.
>  # split topic-X to 2 partitions by kafka-topics.sh --alter --topic topic-X 
> --partitions 2 (topic-X-1 is added)
>  # producer knows that new partitions are added by refreshing metadata. 
> starts to produce to topic-X-1
>  # bit later, consumer knows that new partitions are added and triggering 
> consumer rebalance, then starts consuming topic-X-1
>  * 
>  ** upon starting consumption, it resets its offset to log-end-offset
> If the producer sent several records before 3, they could be not-delivered to 
> the consumer.
>  
>  
> This behavior isn't preferable in most cases, so it should be documented in 
> AUTO_OFFSET_RESET_DOC at least.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KAFKA-12261) Splitting partition causes message loss for consumers with auto.offset.reset=latest

2021-02-19 Thread Luke Chen (Jira)


[ 
https://issues.apache.org/jira/browse/KAFKA-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17286914#comment-17286914
 ] 

Luke Chen commented on KAFKA-12261:
---

[~ocadaruma], sure, go ahead and assign the ticket to yourselves. THanks.

> Splitting partition causes message loss for consumers with 
> auto.offset.reset=latest
> ---
>
> Key: KAFKA-12261
> URL: https://issues.apache.org/jira/browse/KAFKA-12261
> Project: Kafka
>  Issue Type: Improvement
>Affects Versions: 2.7.0
>Reporter: Haruki Okada
>Priority: Minor
>
> As of now, auto.offset.reset of ConsumerConfig is "latest" by default.
>  
> This could be a pitfall that causes message delivery loss when we split 
> topic's partitions like below:
> Say we have a topic-X which have only 1 partition.
>  # split topic-X to 2 partitions by kafka-topics.sh --alter --topic topic-X 
> --partitions 2 (topic-X-1 is added)
>  # producer knows that new partitions are added by refreshing metadata. 
> starts to produce to topic-X-1
>  # bit later, consumer knows that new partitions are added and triggering 
> consumer rebalance, then starts consuming topic-X-1
>  * 
>  ** upon starting consumption, it resets its offset to log-end-offset
> If the producer sent several records before 3, they could be not-delivered to 
> the consumer.
>  
>  
> This behavior isn't preferable in most cases, so it should be documented in 
> AUTO_OFFSET_RESET_DOC at least.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KAFKA-12261) Splitting partition causes message loss for consumers with auto.offset.reset=latest

2021-02-18 Thread Haruki Okada (Jira)


[ 
https://issues.apache.org/jira/browse/KAFKA-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17286901#comment-17286901
 ] 

Haruki Okada commented on KAFKA-12261:
--

Yeah, actually I've reconsidered about that and changed the description of this 
issue.

 

Agree with keeping the default setting. Then may I submit a patch about 
refining AUTO_OFFSET_RESET_DOC?

> Splitting partition causes message loss for consumers with 
> auto.offset.reset=latest
> ---
>
> Key: KAFKA-12261
> URL: https://issues.apache.org/jira/browse/KAFKA-12261
> Project: Kafka
>  Issue Type: Improvement
>Affects Versions: 2.7.0
>Reporter: Haruki Okada
>Assignee: Luke Chen
>Priority: Minor
>
> As of now, auto.offset.reset of ConsumerConfig is "latest" by default.
>  
> This could be a pitfall that causes message delivery loss when we split 
> topic's partitions like below:
> Say we have a topic-X which have only 1 partition.
>  # split topic-X to 2 partitions by kafka-topics.sh --alter --topic topic-X 
> --partitions 2 (topic-X-1 is added)
>  # producer knows that new partitions are added by refreshing metadata. 
> starts to produce to topic-X-1
>  # bit later, consumer knows that new partitions are added and triggering 
> consumer rebalance, then starts consuming topic-X-1
>  * 
>  ** upon starting consumption, it resets its offset to log-end-offset
> If the producer sent several records before 3, they could be not-delivered to 
> the consumer.
>  
>  
> This behavior isn't preferable in most cases, so it should be documented in 
> AUTO_OFFSET_RESET_DOC at least.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (KAFKA-12261) Splitting partition causes message loss for consumers with auto.offset.reset=latest

2021-02-18 Thread Luke Chen (Jira)


[ 
https://issues.apache.org/jira/browse/KAFKA-12261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17286892#comment-17286892
 ] 

Luke Chen commented on KAFKA-12261:
---

I think we should document it, but not change the default setting to the 
*earlist*

> Splitting partition causes message loss for consumers with 
> auto.offset.reset=latest
> ---
>
> Key: KAFKA-12261
> URL: https://issues.apache.org/jira/browse/KAFKA-12261
> Project: Kafka
>  Issue Type: Improvement
>Affects Versions: 2.7.0
>Reporter: Haruki Okada
>Assignee: Luke Chen
>Priority: Minor
>
> As of now, auto.offset.reset of ConsumerConfig is "latest" by default.
>  
> This could be a pitfall that causes message delivery loss when we split 
> topic's partitions like below:
> Say we have a topic-X which have only 1 partition.
>  # split topic-X to 2 partitions by kafka-topics.sh --alter --topic topic-X 
> --partitions 2 (topic-X-1 is added)
>  # producer knows that new partitions are added by refreshing metadata. 
> starts to produce to topic-X-1
>  # bit later, consumer knows that new partitions are added and triggering 
> consumer rebalance, then starts consuming topic-X-1
>  * 
>  ** upon starting consumption, it resets its offset to log-end-offset
> If the producer sent several records before 3, they could be not-delivered to 
> the consumer.
>  
>  
> This behavior isn't preferable in most cases, so it should be documented in 
> AUTO_OFFSET_RESET_DOC at least.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)