[ 
https://issues.apache.org/jira/browse/HUDI-4575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yue Zhang updated HUDI-4575:
----------------------------
    Fix Version/s: 0.14.0
                       (was: 0.13.1)

> Initial Kafka Global Offsets  in Hudi Kafka Sink Connector
> ----------------------------------------------------------
>
>                 Key: HUDI-4575
>                 URL: https://issues.apache.org/jira/browse/HUDI-4575
>             Project: Apache Hudi
>          Issue Type: Bug
>          Components: kafka-connect
>            Reporter: Vishal Agarwal
>            Priority: Major
>             Fix For: 0.14.0
>
>
> Hi team,
> I am trying to run Hudi Sink Connector with Kafka Connect. When the 
> connectors starts, it starts the Transaction coordinator which initialises 
> the global committed  offsets from the Hudi commit file. When its a first 
> time run, there is no commit file and hence it outputs
> [2022-08-08 19:58:20,529] INFO Hoodie Extra Metadata from latest commit is 
> absent (org.apache.hudi.connect.writers.KafkaConnectTransactionServices:147) 
> But say in first time, the earliest kafka offset is not 0, then the process 
> keeps on running the commit timelines. Ideally, the global offsets, at first 
> run, should be set to the earliest kafka offset.
> As per the current implementation, the participant checks the local offset 
> with coordinator offset and when its a mismatch, it sets to 0. But this 
> breaks, when its a fresh run and the global kafka commited offset is not 0



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to