James Thornton created KAFKA-1436:
-------------------------------------
Summary: Idempotent Producer / Duplicate Detection
Key: KAFKA-1436
URL: https://issues.apache.org/jira/browse/KAFKA-1436
Project: Kafka
Issue Type: Improvement
Components: consumer, producer
Affects Versions: 0.8.0, 0.8.1, 0.8.2, 0.9.0, 0.8.1.1
Reporter: James Thornton
Assignee: Neha Narkhede
Dealing with duplicate messages is one of the major issues for teams using
Kafka, and Jay Kreps posted a page on implementing an Idempotent Producer to
address this issue:
https://cwiki.apache.org/confluence/display/KAFKA/Idempotent+Producer
MapDB 1.0 (https://github.com/jankotek/MapDB) was just released, and either it
or Java Chronicle (https://github.com/OpenHFT/Java-Chronicle/) could be
embedded within each broker to provide a high-performance, random-access,
off-heap store for request IDs.
As Jay points out in his post, global unique request IDs probably aren't
needed, but if that need should arise, Twitter's Snowflake service
(https://github.com/twitter/snowflake/) might be useful.
--
This message was sent by Atlassian JIRA
(v6.2#6252)