[ https://issues.apache.org/jira/browse/KAFKA-15190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17743311#comment-17743311 ]
A. Sophie Blee-Goldman commented on KAFKA-15190: ------------------------------------------------ I'm all for this, though it will need a KIP. Would you be interested in writing one? Happy to help you with the process if so. As for the meantime, perhaps you guys can get some relief by just writing this processId directly upon setup, before starting the Streams app? I believe it just expects a plain UUID at the moment, so you should be able to write a function that hashes this container id to something of that form and then persist it to disk in exactly the same way as Streams. > Allow configuring a streams process ID > -------------------------------------- > > Key: KAFKA-15190 > URL: https://issues.apache.org/jira/browse/KAFKA-15190 > Project: Kafka > Issue Type: Wish > Components: streams > Reporter: Joe Wreschnig > Priority: Major > > We run our Kafka Streams applications in containers with no persistent > storage, and therefore the mitigation of persisting process ID the state > directly in KAFKA-10716 does not help us avoid shuffling lots of tasks during > restarts. > However, we do have a persistent container ID (from a Kubernetes > StatefulSet). Would it be possible to expose a configuration option to let us > set the streams process ID ourselves? > We are already using this ID as our group.instance.id - would it make sense > to have the process ID be automatically derived from this (plus > application/client IDs) if it's set? The two IDs seem to have overlapping > goals of identifying "this consumer" across restarts. -- This message was sent by Atlassian Jira (v8.20.10#820010)