Hi Spark users, especially Structured Streaming users who are using Kafka
as data source,

I'm pleased to introduce Kafka offset committer, which enables commit
offsets which batch has been processed. The tool is basically an
implementation of streaming query listener, which listens for events and
commit offsets for each batch. Please refer README.md in the repository to
see more details.

Here's Github repository for this tool:
https://github.com/HeartSaVioR/spark-sql-kafka-offset-committer
Currently it hasn't be published to Maven central, so you might need to
build the source and add jar via "--jars" option until artifact is
published.
I'd be happy to hear new ideas of improvements, and much appreciated for
contributions!

Enjoy!

Thanks,
Jungtaek Lim (HeartSaVioR)

Reply via email to