Nikki Thean created KAFKA-4607: ---------------------------------- Summary: Kafka Streams allows you to provide strings with illegal characters for internal topic names Key: KAFKA-4607 URL: https://issues.apache.org/jira/browse/KAFKA-4607 Project: Kafka Issue Type: Bug Reporter: Nikki Thean Priority: Minor
When using the aggregate function of the Kafka Streams DSL, I supplied the function with a name for the underlying state store that contained whitespace. The stream application ran with no error message but seemed to be hanging. After a lot of troubleshooting, I looked in the logs on the Kafka cluster and noticed that it was complaining about illegal characters in the log line. It turns out that an internal changelog topic had been created that contained whitespace and this was causing both Kafka and ZK to choke. I was unable to use the delete-topic command line utility as it also threw the illegal character error. In the end I had to manually delete the topics out of ZK and do some stuff with the offset logs. I am not well versed in the Kafka or Kafka Streams code, but from looking through it, I notice that [InternalTopicManager|https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/processor/internals/InternalTopicManager.java#L170] does not seem to validate the names of auto-generated internal topics beyond [checking for null topic names|https://github.com/apache/kafka/blob/trunk/streams/src/main/java/org/apache/kafka/streams/processor/internals/InternalTopicConfig.java#L38] rather than using Topic.validate. -- This message was sent by Atlassian JIRA (v6.3.4#6332)