[ https://issues.apache.org/jira/browse/HUDI-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17362600#comment-17362600 ]
Nishith Agarwal commented on HUDI-1910: --------------------------------------- [~vinaypatil18] Thanks for sharing your approach. The first level of configs in deltastreamer are meant to be for generic use-cases that apply to general purpose ingestion activities. Whenever we want to add a specific use-case, we add configurable classes and then add a parameter something like this -> [https://github.com/apache/hudi/blob/master/hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/KafkaOffsetGen.java#L158.] This has 2 advantages # If you want to dynamically enable or disable such configs without code changes or deployment, it can be done if you are keeping your properties file updated with configs. # Keep users away from many custom configs (which most users might not care) by not floating them as top level configs in deltasteramer (they way you suggested). I think we should consider the first approach. > Supporting Kafka based checkpointing for HoodieDeltaStreamer > ------------------------------------------------------------ > > Key: HUDI-1910 > URL: https://issues.apache.org/jira/browse/HUDI-1910 > Project: Apache Hudi > Issue Type: Improvement > Components: DeltaStreamer > Reporter: Nishith Agarwal > Assignee: Vinay > Priority: Major > Labels: sev:normal, triaged > > HoodieDeltaStreamer currently supports commit metadata based checkpoint. Some > users have requested support for Kafka based checkpoints for freshness > auditing purposes. This ticket tracks any implementation for that. -- This message was sent by Atlassian Jira (v8.3.4#803005)